Next Article in Journal
Properties of Statistical Depth with Respect to Compact Convex Random Sets: The Tukey Depth
Next Article in Special Issue
A Mutual Information-Based Network Autoregressive Model for Crude Oil Price Forecasting Using Open-High-Low-Close Prices
Previous Article in Journal
SIP-UNet: Sequential Inputs Parallel UNet Architecture for Segmentation of Brain Tissues from Magnetic Resonance Images
Previous Article in Special Issue
Macroeconomic Effects of Energy Price: New Insight from Korea?
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Forecasting Crude Oil Future Volatilities with a Threshold Zero-Drift GARCH Model

1
School of General Education, Guizhou University of Commerce, Guiyang 550014, China
2
Department of Actuarial Studies and Business Analytics, Macquarie University, Sydney, NSW 2109, Australia
*
Author to whom correspondence should be addressed.
Submission received: 4 July 2022 / Revised: 25 July 2022 / Accepted: 2 August 2022 / Published: 3 August 2022
(This article belongs to the Special Issue Mathematical Methods in Energy Economy)

Abstract

:
The recent price crash of the New York Mercantile Exchange (NYMEX) crude oil futures contract, which occurred on 20 April 2020, has caused history-writing movements of relative prices. For instance, the West Texas Intermediate (WTI) experienced a negative price. Explosive heteroskedasticity is also evidenced in associated products, such as the Intercontinental Exchange Brent (BRE) and Shanghai International Energy Exchange (INE) crude oil futures. Those movements indicate potential non-stationarity in the conditional volatility with an asymmetric influence of negative shocks. To incorporate those features, which cannot be accommodated by the existing generalized autoregressive conditional heteroskedasticity (GARCH) models, we propose a threshold zero-drift GARCH (TZD-GARCH) model. Our empirical studies of the daily INE returns from March 2018 to April 2020 demonstrate the usefulness of the TZD-GARCH model in understanding the empirical features and in precisely forecasting the volatility of INE. Robust checks based on BRE and WTI over various periods further lead to highly consistent results. Applications of news impact curves and Value-at-Risk (VaR) analyses indicate the usefulness of the proposed TZD-GARCH model in practice. Implications include more effectively hedging risks of crude oil futures for policymakers and market participants, as well as the potential market inefficiency of INE relative to WTI and BRE.

1. Introduction

Crude oil is an essential energy commodity. It is well known that crude oil storage of a country is strategical and can largely affect many critical daily necessities, such as fuel consumption. In addition, the demand for the so-called ‘black gold’ is relatively inelastic to the business cycle of the macro economy, and many traders purchase the oil futures to hedge against financial risks including the inflation risk (see, for example, Tang and Xiong [1] and Sadorsky [2]). Consequently, accurate understanding and precise forecasting of the crude oil volatility are important for both policy makers and financial market participants for various aims.
Despite being the second largest economy in the world, China did not have a crude oil future until 26 March 2018. The novel crude oil derivative was launched at the Shanghai International Energy Exchange (INE), a unit of the Shanghai International Futures Exchange. It is expected that INE will assist Chinese corporations and traders in hedging oil price volatility, establishing an Asian oil pricing system, and developing the Chinese yuan (CNY) internationalization. Compared to the two largest global crude oil futures, the Brent (BRE) and West Texas Intermediate (WTI) futures, there are some essential differences for INE. As concluded in Lv et al. [3], both BRE and WTI are based on light sweet low-sulfur crude oil, whereas the underlying asset of INE is medium sour higher-sulfur crude oil. In terms of the trading hours, the INE has formed a 24-h continuous global trading with a trading time zone of New York and London. Finally, the minimum trading margin of INE is 5% of contract value. Since its launch, INE has rapidly attracted a growing number of investors. By the time this paper was written, the trading volume of INE had exceeded the Oman contract. Although still notably behind BRE and WTI, INE has become the largest crude oil futures contract in the Asia-Pacific region. Nevertheless, INE is becoming an important complementary and alternative oil future of BRE and WTI.
Due to the recent impact of the novel coronavirus disease (COVID-19), the global financial markets experienced dramatically large volatility in early 2020. On 20 April 2020, the price of WTI crashed from $18 a barrel to – $38 in a matter of hours. Although a historically notable negative price did not occur for INE, its volatility has also considerably and quickly increased over the first four months of 2020. To study the volatility of INE, we use the Yang and Zhang [4] method, which employs the open, high, low and close prices and is independent of drift and opening gaps to calculate volatility for a financial asset. The derived daily volatility is plotted in Figure 1a, with a sample period of March 2018 to April 2020. With the observed explosive patterns, it is worth investigating further whether the volatility is non-stationary. According to the autoregressive functions (ACFs) shown in Figure 1b, non-geometrically reducing and significant ACFs at large lags are found for the volatility of INE. According to the augmented Dickey-Fuller (ADF) and Phillips-Perron (PP) tests, the p-values of the null hypothesis of the unit root are 0.495 and 0.446, respectively (Note that when the DF-GLS test is employed, the p-value is 0.338, still indicating the existence of unit root. In all cases, the employed unit root tests are just for motivation purposes. A more rigorous test is related to the stability, which is implemented in the empirical analysis of this paper). Such evidence provides strong motivation to consider the non-stationarity in the volatility modelling and forecasting of the novel INE crude oil future.
As for the econometric methodology, GARCH model, which is proposed in the seminal work of Bollerslev [5], has become a standard approach to studying the conditional heteroskedasticity feature of economic and financial time series. For its attractive characteristics such as the capability of modelling the volatility clustering and the flexibility to incorporate extensions, GARCH model and its extensions have been widely employed in oil volatility modelling and forecasting. Influential studies include Chang et al. [6], Hou and Suardi [7] and Basher and Sadorsky [8], and more recent research is conducted by Hou et al. [9], Lin et al. [10] and Marchese et al. [11], among others. However, despite the popularity, many extensions of GARCH are yet to develop well-established asymptotics for the estimators. This insufficiency seriously affects their reliability and/or efficiency in volatility modelling and forecasting, as the related statistical inferences are either questionable or inefficient. Even for the original GARCH model, no consistent estimator of the intercept of the conditional variance equation can be derived, when the underlying sequence is non-stationary [12]. Therefore, reliable forecasts cannot be produced even with the simplest GARCH(1,1) specification.
To resolve this issue, a recent study by Li et al. [13] proposes a Zero-drift GARCH (ZD-GARCH) model, by setting the intercept of the conditional variance to exactly 0. In particular, the ZD-GARCH process is always non-stationary and has well-established asymptotics of its quasi maximum likelihood estimator (QMLE). Moreover, to study the sample path of the heteroskedasticity, a new feature named stability (instability) is recommended to describe the patterns such that the unconditional variances will oscillate between 0 and infinity (converge to 0 or diverge to infinity). From a practical perspective, the new model nests the famous exponentially weighted moving average (EWMA) model developed by the RiskMetrics. Based on the EWMA model, companies like J.P. Morgan estimate and study the daily volatility of financial equities. Such desirable features make ZD-GARCH a suitable alternative to GARCH in modelling and forecasting the potentially non-stationarity of the INE’s future.
Apart from the non-stationarity, the recent oil price crash draws attention to the asymmetric effect of negative and positive news on the volatility of oil futures. The negative shock that took place on 20 April 2020 has apparently resulted in the largest daily volatility ever recorded for the WTI future. This is over twice as much as the daily volatility ever caused by a positive shock. Among the existing studies, mixed results are reported on the matter whether asymmetric volatility exists in crude oil markets or not (see, for example, Engle [14], Nomikos and Andriosopoulos [15], Žikeš and Baruník [16] and Lin et al. [10], among others). Nevertheless, the ZD-GARCH specification is unable to incorporate such a potential asymmetry. This will limit its appropriateness in modelling and forecasting the volatility of crude oil futures.
Inspired by the important works of Glosten et al. [17] and Zakoian [18], we propose a threshold ZD-GARCH (TZD-GARCH) model to study the INE volatility in this paper. Our baseline empirical results focus on the complete sample of INE returns ranging from its launch date (26 March 2018) to 30 April 2020, contrasting the GARCH, threshold GARCH (T-GARCH), ZD-GARCH and TZD-GARCH models. We provide strong evidence of the non-stationarity of the INE volatility that is preliminarily shown in Figure 1, according to the non-stationarity test [12] based on the GARCH and T-GARCH models. Both ZD-GARCH and TZD-GARCH models further suggest significant stability of the non-stationary volatility. The influence of negative shocks is significant and around 40% larger than that of positive shocks, according to the TZD-GARCH model. Via an expanding window approach, we compare the out-of-sample forecasts of the four models. The superiority of our TZD-GARCH over GARCH, T-GARCH and ZD-GARCH is supported by various forecasting error measures, including the root of mean squared error (RMSE), the mean absolute error (MAE), mean absolute percentage error (MAPE), the mean absolute scaled error (MASE) and the Diebold and Mariano [19] (DM) test. To check the robustness of our baseline findings, we also model BRE and WTI returns over three different sample periods: 2018–2020 (the same as INE), 2016–2020 (1000 observations) and 2008–2020 (3000 observations). Highly consistent results still hold, including the non-stationarity, stability, significant asymmetric influence of negative shocks, and the more accurate forecasting performance of TZD-GARCH. Finally, we present the news impact curves and perform the Value-at-Risk (VaR) analyses over INE, BRE and WTI using the fitted TZD-GARCH models. The inspiring and promising results shed light on the usefulness of the proposed TZD-GARCH model in modelling and forecasting volatility of crude oil futures in practice.
The contributions and implications of this paper are twofold. First, both the in-sample volatility features and out-of-sample forecasting results of the INE, BRE and WTI are well examined. Those empirical results strongly support the superiority of TZD-GARCH over existing GARCH-type and ZD-GARCH models, as for the accuracy of volatility forecasting. Further, using both 95% and 99% VaRs, we demonstrate that the actual extreme negative risks of INE, BRE and WTI are in-line with the risk tolerance of those metrics derived from the TZD-GARCH model. Therefore, since oil future has become one of safe haven assets, which are used by various hedge funds and pension funds, the proposed TZD-GARCH model can assist users in constructing accurate risk appetite and portfolio management practices. Second, the evidenced asymmetric effect and estimated news impact curves have at least two implications. For one thing, negative news is more influential than positive news for the crude oil markets. Thus, policymakers and investors should allocate scarcer resources to negative shocks to achieve the best hedging/risk management results to manage risks of macro-economy and/or micro-portfolios. For another, we notice that different from BRE and WTI, the asymmetric effect of the negative shocks is much weaker for the novel INE product. This might be explained by its comparatively smaller trading volume and liquidity, which has limited the response of INE to the market news. Consequently, the market efficiency of INE might be relatively lower than that of WTI and BRE.
The rest of this paper is organized as follows. Section 2 explains the GARCH and ZD-GARCH models. Section 3 proposes the TZD-GARCH model. Empirical results are presented in Section 4. Section 5 briefly discusses some implications. Section 6 concludes the paper.

2. GARCH and Zero-Drift GARCH Models

The classical GARCH(1,1) model has the following specification:
y t = η t h t     and     h t = ω 0 + α 0 y t 1 2 + β 0 h t 1     for     t = 1 , 2 , , n
where y t is the modeled time series, h t is its conditional variance, { η t } is an independently and identically distributed (iid) innovation sequence, and n is the sample size. In the original GARCH(1,1) model developed in Bollerslev [5], { η t } is assumed to be Gaussian. To ensure the non-negativity and weak stationarity, the constraints ω 0 > 0 , 0 α 0 < 1 , 0 β 0 < 1 and 0 α 0 + β 0 < 1 are imposed. Further, with the assumption such that E η t = 0 and E η t 2 = 1 , it can be shown that the unconditional variance of y t is s t 2 = var ( y t ) = ω 0 / ( 1 α 0 β 0 ) , and α 0 + β 0 (or more formally defined as α 0 var ( η t ) + β 0 ) is known as the volatility persistence. In general, it measures how fast a shock to the conditional variance will die out.
As for out-of-sample forecasting, the N-step-ahead forecast of the above GARCH(1,1) model can be derived by
E n h n + N = ω 0 + α 0 E n y n + N 1 2 + β 0 E n h n + N 1
where E n y n + N 1 2 = y n and E n h n + N 1 = h n for N = 1 and E n y n + N 1 2 = E n h n + N 1 for N > 1 . In a stationary case, the long-run forecast will converge to s t 2 .
However, such a prediction is only valid when γ 0 < 0 , where γ 0 is the top Lyapunov exponent defined by
γ 0 = E log ( β 0 + α 0 η t 2 ) .
The reason is that Bougerol and Picard [20] showed that the GARCH(1,1) model is stationary if and only if γ 0 < 0 . Otherwise, y t defined in (1) is heteroskedastic with an exponentially explosive unconditional variance, since s t 2 = ω 0 var ( η t 2 ) + [ var ( η t 2 ) α 0 + β 0 ] s t 1 2 . In such a case, no consistent estimator of ω 0 will be available [12], and thus the out-of-sample forecasting with (2) is not viable.
To resolve this issue, Li et al. [13] propose a novel Zero-drift GARCH (ZD-GARCH) model, which is directly motivated by (1), except that ω 0 is set 0 as follows.
y t = η t h t     and     h t = α 0 y t 1 2 + β 0 h t 1     for     t = 1 , 2 , , n
This simple modification effectively avoids the dilemma of inconsistent estimator of ω 0 when γ 0 0 . Also, the iterative relationship of the unconditional variance of y t defined in (3) is now s t 2 = [ var ( η t 2 ) α 0 + β 0 ] s t 1 2 , such that s t 2 = [ var ( η t 2 ) α 0 + β 0 ] t 1 s 1 2 . Consequently, the ZD-GARCH model is able to incorporate both the unconditional and conditional heteroskedasticity simultaneously.
There are additional attractive merits of the ZD-GARCH model, from both practical and technical aspects. First, it can be straightforwardly shown that (3) nests the famous exponentially weighted moving average (EWMA) model, as proposed in RiskMetrics. The EWMA model is widely employed by financial practitioners. For instance, J.P. Morgan calculates the daily volatility of many assets by EWMA. Second, Li et al. [13] prove that except for a different scale, the sample path of ZD-GARCH has a similar shape as that of the GARCH model. Thus, (3) is an effective alternative of and more convenient than (1). Third, like GARCH, asymptotics of the quasi-maximum likelihood estimator (QMLE) of α 0 and β 0 exist for the ZD-GARCH model. This enables the related statistical inferences. Fourth, Li et al. [13] define the case γ 0 = 0 ( γ 0 0 ) as stable (unstable) and propose a stability test with good power. The concept of stability/instability comprehensively describes various cases of heteroskedasticity. More specifically, | y t | converges to zero or diverges to infinity almost surely (a.s.) at an exponential rate when γ 0 < 0 or γ 0 > 0 , respectively. In contrast, in the stable case, | y t | oscillates randomly between zero and infinity over time. Finally, an associated portmanteau test is developed for the model diagnostic purpose of the ZD-GARCH model. When testing the standardized residuals ( η t 2 ), this test is analogous to the famous Li and Mak [21] test to diagnose potential misspecification for the original GARCH model. Good power is also evidenced via simulations.

3. Threshold Zero-Drift GARCH Model

Despite the effectiveness of the ZD-GARCH model, it cannot accommodate the asymmetric effects of negative and positive shocks to the volatility, which are studied with mixed results for the crude oil futures [10,14,15,16]. In the case of the GARCH framework, influential early efforts to address this issue are explored in Glosten et al. [17] and Zakoian [18]. In this paper, we discuss the specification examined in Zakoian [18], namely the Threshold GARCH (T-GARCH) model with the (1,1) specification as follows:
y t = η t h t     and     h t = ω 0 + α 0 + ( y t 1 + ) 2 + α 0 ( y t 1 ) 2 + β 0 h t 1     for     t = 1 , 2 , , n
where y t 1 + = y t 1 ( y t 1 = y t 1 ) when y t 1 > 0 ( y t 1 < 0 ) and otherwise is equal to 0. Note that α 0 + and α 0 are the impacts of positive and negative shocks, respectively. When asymmetric impacts exist, we have that α 0 + α 0 .
When { η t } is assumed symmetric with unit variance, the volatility persistence is measured by 0.5 ( α 0 + + α 0 ) + β 0 . The Lyapunov exponent γ 0 changes to E log ( β 0 + α 0 + ( η t + ) 2 + α 0 ( η t ) 2 ) . To ensure the stationarity and non-negativity, we require that 0 0.5 ( α 0 + + α 0 ) + β 0 < 1 .
Similar to the GARCH model, T-GARCH shares those drawbacks discussed in Section 2, including the non-existence of the consistent estimator of ω 0 . A natural solution is to extend the idea of the ZD-GARCH to develop a threshold Zero-drift GARCH (TZD-GARCH) model by letting ω 0 = 0 . With the usual (1,1) order, the specification is described below
y t = η t h t     and     h t = α 0 + ( y t 1 + ) 2 + α 0 ( y t 1 ) 2 + β 0 h t 1     for     t = 1 , 2 , , n .
Let θ = def ( α 0 + , α 0 , β 0 ) Θ be the unknown parameter of TZD-GARCH(1,1) model described by (5), and Θ is the parametric space. Assume that { y t } with a sample size of n is an TZD-GARCH(1,1) process, the Gaussian QMLE is defined as follows
θ ^ n = arg min θ Θ t = 1 n l n ( θ ) , where     l n ( θ ) = log ( σ t 2 ( θ ) ) + y t 2 σ t 2 ( θ )
and σ t 2 ( θ ) = α 0 + ( y t 1 + ) 2 + α 0 ( y t 1 ) 2 + β 0 σ t 1 2 ( θ ) for all t = 1 , , n . The asymptotics of which would follow naturally from the discussions in Li et al. [13]. As for the initial values, without loss of generality, y 0 may be chosen as the first non-zero observation, whereas σ 0 2 ( θ ) = 0 .
As stated in Section 2, the Lyapunov exponent γ 0 is critical to determining the stationarity (stability) for GARCH-type (ZD-GARCH-type) models. For the GARCH scenario, Francq and Zakoïan [12] propose a stationarity test to examine H 0 : γ 0 0 (non-stationary) against H 1 : γ 0 < 0 (stationary). As for the ZD-GARCH framework, which only considers the non-stationary process, Li et al. [13] develop a stability test to investigate H 0 : γ 0 = 0 (stable) against H 1 : γ 0 0 . For both tests, a natural plug-in estimator of γ 0 is defined below
γ ^ n = 1 n t = 1 n log ( β ^ n + α ^ n η ^ t 2 )
where α ^ n and β ^ n are the QMLE of α 0 and β 0 , respectively.
Analogous to this specification, we derive the following estimator of γ 0 , which will be used in both the T-GARCH and TZD-GARCH models
γ ^ n = 1 n t = 1 n log ( β ^ n + α ^ n + ( η ^ t + ) 2 + α ^ n ( η ^ t ) 2 ) .
Its asymptotics in the case of T-GARCH model may be derived following the proof of Theorem 3.1 in Francq and Zakoïan [12]. The asymptotics of TZD-GARCH can be shown using the results of Li et al. [13].
To implement the stability test, a t-type test statistic can be constructed to detect H 0 : γ 0 = 0 :
T n = n γ ^ n σ ^ η where     σ ^ η 2 = 1 n t = 1 n [ log ( β ^ n + α ^ n + ( η ^ t + ) 2 + α ^ n ( η ^ t ) 2 ) ] 2 γ ^ n 2 .
Since T n defined above asymptotically follows N ( 0 , 1 ) , we will then reject H 0 as for a usual t-type test. For instance, at the 5% level, we will reject H 0 and argue the instability when | T n | > 1.96 .
Note that in the case of the non-stationarity test for GARCH-type models, we will perform a one-way test. Therefore, although T n will be constructed with analogous formulas to those of ZD-GARCH-type models under H 0 : γ 0 0 , we will only claim stationarity at 5% when T n < 1.645 .
Finally, we now check the adequacy of the proposed TZD-GARCH model with a portmanteau test. We firstly follow Li et al. [13] and define the lag-k autocorrelation function (ACF) of the r-th power of the absolute residuals { | η ^ t | r } as
ρ ^ r ( k ) = t = k + 1 n ( | η ^ t | r a ^ r ) ( | η ^ t k | r a ^ r ) t = k + 1 n ( | η ^ t | r a ^ r ) 2 ,
where r > 0 , k is a positive integer, and a ^ r = n 1 t = 1 n | η ^ t | r . The limiting distribution of our portmanteau test is discussed in the following theorem.
To perform the proposed portmanteau test, firstly note that Ω ^ 2 ( m ) defined in Theorem 4 is just its sample counterpart using η ^ t (We do not explicitly discuss the case β 0 = 0 in this paper. For the interested readers, Ω ^ 2 ( m ) reduces to I m in such a scenario). The test statistic is then defined by
Q 2 ( m ) = n ( n + 2 ) ρ ^ 2 ( 1 ) n 1 , , ρ ^ 2 ( m ) n m [ Ω ^ 2 ( m ) ] 1 ρ ^ 2 ( 1 ) n 1 , , ρ ^ 2 ( m ) n m
where m is then the tested order of lags, the common choices of which are 6 and 12 in practice [13]. From Theorem 4, it is straightforward to see that Q 2 ( m ) asymptotically follows a χ m 2 distribution. The rejection of H 0 such that the fitted model is adequate (with no ARCH errors left for the standardized residuals) will be stated as a standard one-way χ 2 -test. For instance, when m = 6 , we will not reject H 0 when Q 2 ( 6 ) < 12.59 at 5% significant level.

4. Empirical Results

In this section, we study the novel crude oil future traded at the Shanghai International Energy Exchange (INE) using the four GARCH- and ZD-GARCH-type models discussed in this paper. We focus on the daily returns ranging from 26 March 2018 (the launch date of INE) to 30 April 2020. For comparison and robustness check purposes, we also discuss the modelling results of the two popular international oil futures: Brent crude oil (BRE) and West Texas Intermediate crude oil (WTI). The same range of 2018–2020 is used for comparison against the results of INE. In addition, we also consider larger sample sizes of BRE and WTI, the starting date goes back to mid-2008.
To be consistent with Figure 1 for the case of INE, we present the volatilities of BRE and WTI, both calculated via the Yang and Zhang [4] method, in Figure 2. For both contracts, their volatilities exhibit much similar temporal trend: oscillating between 0 and 1 from mid-2008 to early-2020, and growing rapidly up to 1.5 for BRE and 5 for WTI by the end of April 2020. This indicates the potential non-stationarity, as observed for the volatility of INE and described in Section 1. In Panel A of Table 1, we present descriptive statistics of the three daily returns in percentages (log-differences of daily closing price times 100) over 2018–2020 and 2008–2020 (for BRE and WTI only). All returns demonstrate fairly similar features. When contrasting INE, BRE and WTI over the same sample period of 2018–2020, it can be seen that returns of BRE and WTI are more volatile than those of INE. This may be explained by the differences in trading volumes and liquidities of the three products.

4.1. Volatility Modelling and Forecasting Results: 2018–2020

We now analyze our baseline results of the INE volatility. The daily returns of INE are firstly demeaned, and then fitted by each of the GARCH, T-GARCH, ZD-GARCH and TZD-GARCH models individually, all with the (1,1) specification described in Section 2. The fitted results are presented in Table 2.
All parameters of the GARCH model are significant at 5% level. In particular, the volatility persistence is over 0.95, indicating potential non-stationarity, which is consistent with our observation of Figure 1. When testing the non-stationarity ( γ 0 0 ) using the test developed in Francq and Zakoïan [12], we observe that the γ ^ n is close to 0, and T n = 1.31 > 1.65 , leading to non-rejection of the non-stationary hypothesis. Consequently, the estimate of ω ^ n is inconsistent, and forecasting with it may cause inaccuracy. The same conclusions mostly hold for the fitted T-GARCH model. It is notable that the estimate of α ^ n + is only 0.06 and insignificantly different from 0. In contrast, the estimate of α ^ n is quite large at 0.27 and significant at 5% level. This suggests that only negative shocks can significantly influence the conditional variance. Nevertheless, the Li and Mak [21] test statistics Q 2 ( 6 ) and Q 2 ( 12 ) suggest no model misspecification in both cases, whereas the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) prefer T-GARCH to GARCH.
Since all the unit root test (described in Section 1) and stationarity tests support the non-stationarity, it is more appropriate to employ the ZD-GARCH-type models. From Table 2, the estimated volatility persistences of both ZD-GARCH and TZD-GARCH are slightly over 1, suggesting that the unconditional variance s t 2 is increasing over time. The stability test, however, supports the null hypothesis of γ 0 = 0 . Thus, as expected for financial products including the oil futures, volatility of INE will not diverge to infinity in the long run. As for the asymmetric impacts, TZD-GARCH indicates that the negative shocks have greater influence on the conditional variance, comparing to the positive shocks. On average, with the same magnitude, the influence of a negative shock is around 40% larger than that of a positive shock. Finally, according to our proposed portmanteau test statistics, both ZD-GARCH and TZD-GARCH models are adequate, with the results of TZD-GARCH comparatively better. The same conclusion also holds when the AIC and BIC are employed. Overall, the TZD-GARCH model is the most preferred model among the four competing specifications, when the BIC is employed.
To compare with the results of INE, we now consider modelling BRE and WTI volatilities by all the four investigated models over the same period of 2018–2020. The results are reported in Table 3. It can be seen that all our previous conclusions are robust for both BRE and WTI volatilities. In short, non-stationarity is supported by GARCH-type models in both cases. T-GARCH model suggests that only negative shocks can significantly influence the conditional variance. Stability significantly holds, as argued by the ZD-GARCH-type models. The TZD-GARCH model produces preferable results to ZD-GARCH, in terms of the model misspecification test and information criteria. There are only one outstanding difference, comparing to the results of Table 2. With the same magnitude, a negative shock of BRE (WTI) is expected to be 120% (200%) more influential than a positive shock. This suggests that the asymmetric effects are much more significant for the BRE and WTI than for the INE.
As argued above, the two GARCH-type models may not produce accurate forecasts, since the INE exhibits non-stationarity. To examine this, we produce out-of-sample forecasting results via an expanding window approach. Altogether, there are around 500 observations of the INE daily returns. We use the first 400 as the starting training sample to fit each of the GARCH, T-GARCH, ZD-GARCH and TZD-GARCH models. Following the descriptions in Section 2 and Section 3, we then calculate the one-step-ahead out-of-sample forecast of the conditional variance. Next, we include the t = 401 observation in the training sample and produce another one-step-ahead out-of-sample forecast at time t = 402 . We continue this procedure until the 100 out-of-sample forecasts are all collected. Finally, assume that the true values of volatility are those estimated via the Yang and Zhang [4] method, we derive four forecasting error measures as follows:
RMSE = 1 100 N = 1 100 e ^ n + N 2 MAE = 1 100 N = 1 100 | e ^ n + N | MAPE = 1 100 N = 1 100 | e ^ n + N | A n + N RMSE = 1 100 N = 1 100 | e ^ n + N | / 1 n 1 t = 2 n | A t A t 1 |
where A t is the actual volatility at time t, e ^ n + N is the forecast error (actual value-forecast value) at time n + N , RMSE is the root of mean squared error, MAE is the mean absolute error, MAPE is the mean absolute percentage error, and MASE is the mean absolute scaled error. The results of the four measures are presented in Table 4.
From Table 4, we observe that TZD-GARCH is the best performing model out of the four competing specifications, as recognized by all error measures except MAPE. It is also worth noting that the ZD-GARCH-type models are preferred to the GARCH-type models in all cases, which is consistent with the tested non-stationarity in volatility. In particular, T-GARCH model leads to the worst performance, despite its better AIC and BIC than those of the GARCH and ZD-GARCH models. To statistically test the differences in out-of-sample forecasting, we employ the famous Diebold and Mariano [19] (DM) test. For its pairwise nature, we employ the DM test to evaluate the relative forecasting accuracy of TZD-GARCH against each of the GARCH, T-GARCH and ZD-GARCH models. According to the p-values presented in Table 4, the null hypothesis of identical performance is rejected in all cases, and our proposed TZD-GARCH model significantly beats the GARCH, T-GARCH and ZD-GARCH counterparties at 5% level.
Finally, we examine the out-of-sample forecast results of BRE and WTI using the same setting as INE. The four forecasting error measures are presented in Table 5. Our conclusions of Table 4 largely hold here. In almost all cases, the TZD-GARCH model beats the rest according to various forecasting error measures. In all scenarios, the pairwise DM test further supports that the forecasts of TZD-GARCH are significantly more accurate than those of the other competing models at 10% level.

4.2. Additional Robustness Check

We now consider additional robustness checks of the modelling results. Note that the period over 2018–2020 has around 500 observations, which may be treated as the small sample case. Thus, we further explore the scenarios with 1000 observations (2016–2020) and 3000 observations (2008–2020), the fitted results of which are presented in Table 6 and Table 7, respectively. Overall, the findings are largely robust against those described above in both cases, with several notable differences. First, the results of TZD-GARCH indicate that the asymmetric effects of negative shocks are stronger when the sample size increases to 1000 and/or 3000, compared to those observed in Table 3. Second, TZD-GARCH is the only model that can pass the model misspecification test at 5%, according to both the Q 2 ( 6 ) and Q 2 ( 12 ) , when the full sample of 2008–2020 for BRE is fitted. Third, despite the potentially unreliability of the estimate of ω ^ n , both GARCH and T-GARCH models suggest that the estimates reduce rapidly with the sample size. For instance, contrasting Table 3 and Table 7, the reduction of ω ^ n is nearly 90% and over 90% for BRE and WTI, respectively. This may further support the preference of the ZD-GARCH-type models to GARCH-type competitors, when the sample size of the crude oil futures increases. Last but not least, the standard errors of QMLE of all parameters in the TZD-GARCH model almost uniformly decline with the increase of sample coverage. This provides additional empirical evidence of the asymptotics of QMLE as argued in Section 3.
Finally, we examine the robustness of the out-of-sample forecast results. Similar to the in-sample modelling analyses, we consider another two different starting size of the training sample: 900 (2016–2019) and 2900 (2008–2019). In both cases, the test sample is the same (i.e., the last 100 observations, ranging from the end of November 2019 to the end of April 2020). The four forecasting error measures are presented in Table 8. Our conclusions of Table 4 and Table 5 largely hold here. Nevertheless, it is interesting to note that when more observations are included, the performances of GARCH and T-GARCH models almost uniformly improve. When the complete period (2008–2019) is considered, the measures of GARCH and T-GARCH models are not too much worse than those of ZD-GARCH and TZD-GARCH models. This is as expected, since the estimated ω ^ n of the GARCH-type models are close to 0 in this case, as observed in Table 7. However, comparing the results of Panels A, B and C of Table 8, ZD-GARCH-type models demonstrate relatively similar results. Hence, it may indicate that the proposed TZD-GARCH model is not ‘hungry for data’ in order to precisely forecast volatility of crude oil futures. Therefore, when a new product, such as INE, becomes available at the market, the corresponding sample size is limited. In such a case, the market participants are more motivated to use the TZD-GARCH model to obtain reliable volatility forecasts.
To sum up, with various sample sizes, the analyses of BRE and WTI volatilities lead to largely robust results against those of the INE volatility. The only outstanding difference is the much stronger additional influence of the negative shocks on the conditional variance of BRE and WTI returns than that of INE. The out-of-sampling forecasting results support the superiority of the proposed TZD-GARCH model over the other competitors. This is particularly important when market participants are facing a new oil product, and need to build up their strategies for hedging, speculating and portfolio management with limited data.

4.3. Other Practical Applications: News Impact Curves and Value-at-Risk Analyses

For its outstanding in-sample and out-of-sample performance, we now discuss other practical applications of the TZD-GARCH model when employed to study the oil volatilities. For the seek of the maximized data availability, we use the TZD-GARCH model fitted with the INE data spanning 2018–2020 (Table 2), and BRE and WTI data ranging over 2008–2020 (Table 7). We discuss two common practical applications in this section, including the news impact curves and Value-at-Risk (VaR) analyses.
The news impact curve is proposed in Engle and Ng [22] to study the (asymmetric) influences of news (standardized shocks) on the conditional volatility. For the case of the TZD-GARCH model, we investigate the case such that the conditional variance is evaluated at the magnitude of the sample variance. This is slightly different from the original news impact curve for the GARCH model, where the unconditional variance is assumed homoskedastic. For a given value of η t , the curve of the TZD-GARCH is calculated as follows
β ^ n h t + α ^ n + ( η ^ t + ) 2 + α ^ n ( η ^ t ) 2
where h t is set to the sample variance of the returns, and η t ranges from −3 to 3, which covers 99.7% of the Gaussian distributed shocks. The results of INE, BRE and WTI are plotted in Figure 3.
From Figure 3, we observe that when η t = 0 , the starting influence ranges from 2.1 (INE) to 2.2 (BRE) and 2.7 (WTI). With η t being more negative, the accelerations of the three curves are much similar. When η t reaches −3, the news impact curve increases to 2.4 for INE and BRE and 2.9 for WTI. As for the positive news, the increase in slopes is much faster for INE than for BRE and WTI. This is consistent with the observed smaller relative asymmetric effect of INE than those of BRE and WTI, as discussed above. When η t is 3, the news impact varies from around 2.35 for INE, 2.25 for BRE and 2.75 for WTI.
The second application considered here is the popular VaR analysis. We calculate both the 95% and 99% VaRs using the fitted conditional volatilities of TZD-GARCH for INE, BRE and WTI. More specifically, the 95% (99%) VaR at time t is the estimated σ ^ t times −1.96 (−2.33). We plot the results in Figure 4, together with the demeaned daily returns. Overall, the VaRs well capture the temporal patterns of volatilities in all cases. When counting the percentages of observations that exceed the 95% VaR and 99% VaRs, the results are 6.4% and 1.2% for INE, 4.7% and 1.8% for BRE and 4.9% and 1.7% for WTI. It is also worth mentioning that the negative daily return of INE does not exceed the estimated 95% VaR on 20 April 2020, the date of the crash in WTI. Consequently, even with a Gaussian distribution, the resulting VaRs of the TZD-GARCH are satisfactorily accurate for all the three oil future volatilities.

5. Discussion

Our empirical results, especially those in Section 4.3, demonstrate implications that are of key importance for both policymakers and investors. First, consistent with the estimated strong asymmetric effects, the news impact curve indicates the importance to monitor the negative news on the oil markets. This is especially critical for the BRE and WTI, since the asymmetric effect is stronger than that of INE. For policymakers to hedge the risks of oil future movements to the macro-economy, this suggests that an uneven allocation of scarce resource should be made for the positive and negative shocks. Specifically, comparatively more resource should be distributed to monitor the arrival of negative shocks to the market to achieve the best hedging effectiveness.
In addition, the relatively more important role of negative news for BRE and WTI may be explained by their much larger trading volumes and liquidities than those of the INE. Thus, when negative news of crude oil is received, prices of BRE and WTI will respond faster with more volatilities. For policymakers and market participants, this might indicate the less market efficiency of INE, compared to BRE and WTI for the crude oil future. A quantitative comparison of this will involve the price discovery and information share [23], which remains for future works. Nevertheless, relevant trading limitations/conditions of INE may need to be redesigned by the policymaker to improve its market efficiency to attract more investors.
Last but not least, our results suggest that performing the VaR analyses with TZD-GARCH can assist policymakers and market traders in constructing accurate risk measures and other risk/portfolio management applications. For instance, studies including Van Eyden et al. [24] argue that changes in conditional volatility of crude oil could indicate recession in the economic growth globally. Further, as suggested by research such as Bampinas and Panagiotidis [25], the share of crude oil future has steadily increased in various global hedge funds, pension funds, and insurance companies, as alternative assets of safe heaven. Understanding the tail risks of crude oil futures using VaR and the proposed TZD-GARCH model would therefore be critical to the effectiveness of portfolio management. This is particularly important in the response of accelerated uncertainty over recent periods, especially after the pandemic caused by COVID-19.

6. Conclusions

This paper proposes a threshold zero-drift GARCH (TZD-GARCH) model to study and forecast the potentially non-stationary volatility of oil futures. Nesting the recent ZD-GARCH model [13] as a special case, the new TZD-GARCH model can incorporate the asymmetric influence of positive and negative news on the conditional variance. The key conclusions of this paper are summarized below. First, for the three examined crude oil futures, the INE returns over 2018–2020, and the BRE and WTI returns over 2018–2020, 2016–2020 and 2008–2020, the significant non-stationarity (stability) is evidenced by GARCH and T-GARCH (ZD-GARCH and TZD-GARCH) models. This is consistent with the empirical observations, as all three oil futures have demonstrated explosive volatilities over early-2020, which, however, is not expected to last long from a practical perspective. Second, the TZD-GARCH model almost uniformly beats the GARCH, T-GARCH and ZD-GARCH models for volatility modelling and forecasting. This is robust across various forecasting performance valuation criteria/test, oil future products, model diagnostics and sample periods. Finally, practical applications such as the Value-at-Risk analyses can be performed with the TZD-GARCH model for both policy makers and oil market participants for market monitoring and risk management purposes.
The limitation of this research is in the employed univariate methodological framework. To address this, there are two potential future directions to extend this paper. First, as examined in Chang et al. [6] and Marchese et al. [11], multivariate GARCH-type models are employed to study multiple oil futures simultaneously. From a technical point of view, it is worth extending the current univariate ZD-GARCH-type framework to be multivariate, which enables the empirical analyses such as the existence of volatility spillovers. Second, as shown in Section 4, the role of negative news is much less important for INE than those for BRE and WTI. This might be due to the slower response speed of INE with respect to the market news. Future works may consider quantitative comparison of this speed, which could adopt the price discovery method and information share [23].

Author Contributions

All authors contributed equally to this work. All authors wrote, reviewed and commented on the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Tang, K.; Xiong, W. Index investment and the financialization of commodities. Financ. Anal. J. 2012, 68, 54–74. [Google Scholar] [CrossRef]
  2. Sadorsky, P. Modeling volatility and correlations between emerging market stock prices and the prices of copper, oil and wheat. Energy Econ. 2014, 43, 72–81. [Google Scholar] [CrossRef]
  3. Lv, F.; Yang, C.; Fang, L. Do the crude oil futures of the Shanghai International Energy Exchange improve asset allocation of Chinese petrochemical-related stocks? Int. Rev. Financ. Anal. 2020, 71, 101537. [Google Scholar] [CrossRef]
  4. Yang, D.; Zhang, Q. Drift-independent volatility estimation based on high, low, open, and close prices. J. Bus. 2000, 73, 477–492. [Google Scholar] [CrossRef] [Green Version]
  5. Bollerslev, T. Generalized autoregressive conditional heteroskedasticity. J. Econom. 1986, 31, 307–327. [Google Scholar] [CrossRef] [Green Version]
  6. Chang, C.L.; McAleer, M.; Tansuchat, R. Crude oil hedging strategies using dynamic multivariate GARCH. Energy Econ. 2011, 33, 912–923. [Google Scholar] [CrossRef] [Green Version]
  7. Hou, A.; Suardi, S. A nonparametric GARCH model of crude oil price return volatility. Energy Econ. 2012, 34, 618–626. [Google Scholar] [CrossRef]
  8. Basher, S.A.; Sadorsky, P. Hedging emerging market stock prices with oil, gold, VIX, and bonds: A comparison between DCC, ADCC and GO-GARCH. Energy Econ. 2016, 54, 235–247. [Google Scholar] [CrossRef] [Green Version]
  9. Hou, Y.; Li, S.; Wen, F. Time-varying volatility spillover between Chinese fuel oil and stock index futures markets based on a DCC-GARCH model with a semi-nonparametric approach. Energy Econ. 2019, 83, 119–143. [Google Scholar] [CrossRef]
  10. Lin, Y.; Xiao, Y.; Li, F. Forecasting crude oil price volatility via a HM-EGARCH model. Energy Econ. 2020, 87, 104693. [Google Scholar] [CrossRef]
  11. Marchese, M.; Kyriakou, I.; Tamvakis, M.; Di Iorio, F. Forecasting crude oil and refined products volatilities and correlations: New evidence from fractionally integrated multivariate GARCH models. Energy Econ. 2020, 88, 104757. [Google Scholar] [CrossRef]
  12. Francq, C.; Zakoïan, J.M. Strict stationarity testing and estimation of explosive and stationary generalized autoregressive conditional heteroscedasticity models. Econometrica 2012, 80, 821–861. [Google Scholar]
  13. Li, D.; Zhang, X.; Zhu, K.; Ling, S. The ZD-GARCH model: A new way to study heteroscedasticity. J. Econom. 2018, 202, 1–17. [Google Scholar] [CrossRef] [Green Version]
  14. Engle, R.F. Long-term skewness and systemic risk. J. Financ. Econom. 2011, 9, 437–468. [Google Scholar] [CrossRef]
  15. Nomikos, N.; Andriosopoulos, K. Modelling energy spot prices: Empirical evidence from NYMEX. Energy Econ. 2012, 34, 1153–1169. [Google Scholar] [CrossRef]
  16. Žikeš, F.; Baruník, J. Semi-parametric conditional quantile models for financial returns and realized volatility. J. Financ. Econom. 2015, 14, 185–226. [Google Scholar] [CrossRef] [Green Version]
  17. Glosten, L.R.; Jagannathan, R.; Runkle, D.E. On the relation between the expected value and the volatility of the nominal excess return on stocks. J. Financ. 1993, 48, 1779–1801. [Google Scholar] [CrossRef]
  18. Zakoian, J.M. Threshold heteroskedastic models. J. Econ. Dyn. Control 1994, 18, 931–955. [Google Scholar] [CrossRef]
  19. Diebold, F.X.; Mariano, R.S. Comparing predictive accuracy. J. Bus. Econ. Stat. 2002, 20, 134–144. [Google Scholar] [CrossRef]
  20. Bougerol, P.; Picard, N. Stationarity of GARCH processes and of some nonnegative time series. J. Econom. 1992, 52, 115–127. [Google Scholar] [CrossRef]
  21. Li, W.; Mak, T. On the squared residual autocorrelations in non-linear time series with conditional heteroskedasticity. J. Time Ser. Anal. 1994, 15, 627–636. [Google Scholar] [CrossRef]
  22. Engle, R.F.; Ng, V.K. Measuring and testing the impact of news on volatility. J. Financ. 1993, 48, 1749–1778. [Google Scholar] [CrossRef]
  23. Hasbrouck, J. One security, many markets: Determining the contributions to price discovery. J. Financ. 1995, 50, 1175–1199. [Google Scholar] [CrossRef]
  24. Van Eyden, R.; Difeto, M.; Gupta, R.; Wohar, M.E. Oil price volatility and economic growth: Evidence from advanced economies using more than a century’s data. Appl. Energy 2019, 233, 612–621. [Google Scholar] [CrossRef] [Green Version]
  25. Bampinas, G.; Panagiotidis, T. Oil and stock markets before and after financial crises: A local Gaussian correlation approach. J. Futur. Mark. 2017, 37, 1179–1204. [Google Scholar] [CrossRef]
Figure 1. Volatility of INE and associated ACFs. (a) Volatility. (b) ACFs. Note: this figure presents the estimated volatility of the INE returns from the period 2018–2020 via the Yang and Zhang [4] method. The associated ACFs of the volatility are also reported.
Figure 1. Volatility of INE and associated ACFs. (a) Volatility. (b) ACFs. Note: this figure presents the estimated volatility of the INE returns from the period 2018–2020 via the Yang and Zhang [4] method. The associated ACFs of the volatility are also reported.
Mathematics 10 02757 g001
Figure 2. Volatility of BRE and WTI: 2008–2020. (a) BRE. (b) WTI. Note: this figure presents the estimated volatility of the BRE and WTI returns over 2008–2020 via the Yang and Zhang [4] method.
Figure 2. Volatility of BRE and WTI: 2008–2020. (a) BRE. (b) WTI. Note: this figure presents the estimated volatility of the BRE and WTI returns over 2008–2020 via the Yang and Zhang [4] method.
Mathematics 10 02757 g002
Figure 3. News impact curves. (a) INE. (b) BRE. (c) WTI. Note: this figure presents the fitted news impact curves of INE, BRE and WTI via the TZD-GARCH model. The sample ranges from 2018 to 2020 for INE, and from 2008 to 2020 for BRE and WTI.
Figure 3. News impact curves. (a) INE. (b) BRE. (c) WTI. Note: this figure presents the fitted news impact curves of INE, BRE and WTI via the TZD-GARCH model. The sample ranges from 2018 to 2020 for INE, and from 2008 to 2020 for BRE and WTI.
Mathematics 10 02757 g003
Figure 4. Value-at-Risk analyses. (a) INE. (b) BRE. (c) WTI. Note: this figure presents the Value-at-Risk results of INE, BRE and WTI via the TZD-GARCH model. The sample ranges from 2018 to 2020 for INE, and from 2008 to 2020 for BRE and WTI. Dashed red and dashed green curves are the fitted 95% and 99% Value-at-Risk, respectively.
Figure 4. Value-at-Risk analyses. (a) INE. (b) BRE. (c) WTI. Note: this figure presents the Value-at-Risk results of INE, BRE and WTI via the TZD-GARCH model. The sample ranges from 2018 to 2020 for INE, and from 2008 to 2020 for BRE and WTI. Dashed red and dashed green curves are the fitted 95% and 99% Value-at-Risk, respectively.
Mathematics 10 02757 g004
Table 1. Descriptive statistics.
Table 1. Descriptive statistics.
MeanStd. Dev.Median Q 0.05 Q 0.25 Q 0.75 Q 0.95
Panel A: 2018–2020
INE−0.10882.21260.0239−3.9135−1.03911.05902.8180
BRE−0.21743.12990.0146−4.4388−1.23530.97723.1626
WTI−0.23544.24350.0156−5.0563−1.19941.09933.5455
Panel B: 2008–2020
BRE−0.05172.25690.0000−3.5725−1.09050.99543.2039
WTI−0.06042.80700.0214−3.9148−1.17971.12543.6384
Note: this table presents the descriptive statistics of the INE, BRE and WTI daily returns over 2018–2020 and 2008–2020. Std. Dev. is the standard deviation. Q0.05, Q0.25, Q0.75 and Q0.95 are the 5th, 25th, 75th and 95th percentiles, respectively.
Table 2. Empirical results: INE.
Table 2. Empirical results: INE.
GARCHT-GARCHZD-GARCHTZD-GARCH
ω ^ n 0.2475 *0.3649 *
(0.1129)(0.1223)
α ^ n or α ^ n + 0.1667 *0.06060.1319 *0.1088 *
(0.0454)(0.0354)(0.0358)(0.0382)
β ^ n 0.7843 *0.7491 *0.8902 *0.8930 *
(0.0582)(0.0598)(0.0268)(0.0290)
α ^ n 0.2748 * 0.1491 *
(0.0752) (0.0132)
v ^ n 0.95110.91161.02171.0204
γ ^ n −0.0131−0.01640.00270.0026
(0.2224)(0.2601)(0.1759)(0.1742)
T n −1.3135−1.41080.34320.3337
log lik.−1060−1054−1063−1055
AIC2127211521302116
BIC2140213221382129
Q 2 ( 6 ) 3.505.635.332.78
Q 2 ( 12 ) 8.479.148.268.10
Note: this table presents empirical results of the fitted GARCH, T-GARCH, ZD-GARCH and TZD-GARCH models of INE daily returns over 2018–2020. ω ^ n , α ^ n , α ^ n + , α ^ n and β ^ n are the model parameters. v ^ n is the sample volatility persistence. γ ^ n is the Lyapunov exponent. Values in the parentheses are the corresponding standard errors. Tn is the stationarity (GARCH and T-GARCH) and stability (ZD-GARCH and TZD-GARCH) test statistic. log lik. is the log likelihood. AIC and BIC are the Akaike Information Criterion and Bayesian Information Criterion, respectively. Q2(6) and Q2(12) are the portmanteau test statistics at the 6th and 12th lags of standardized residuals, respectively. * denotes that the corresponding test is significant at 5% level.
Table 3. Empirical results: BRE and WTI (2018–2020).
Table 3. Empirical results: BRE and WTI (2018–2020).
BREWTI
GARCHT-GARCHZD-GARCHTZD-GARCHGARCHT-GARCHZD-GARCHTZD-GARCH
ω ^ n 0.2382 *0.5935 * 0.3151 *0.4291 *
(0.1111)(0.2290) (0.1497)(0.1775)
α ^ n or α ^ n + 0.2081 *0.05560.1629 *0.0816 *0.2295 *0.03030.1796 *0.0686 *
(0.0474)(0.0352)(0.0307)(0.0324)(0.0475)(0.0353)(0.0308)(0.0300)
β ^ n 0.7947 *0.6653 *0.8826 *0.9058 *0.7743 *0.7693 *0.8718 *0.9017 *
(0.0472)(0.0912)(0.0187)(0.0190)(0.0506)(0.0660)(0.0186)(0.0188)
α ^ n 0.5406 * 0.1885 * 0.4038 * 0.2096 *
(0.1485) (0.0323) (0.0959) (0.0361)
v ^ n 1.00150.93341.04381.02871.00160.95871.04861.0295
γ ^ n −0.0110−0.01170.00440.0046−0.0063−0.00990.00560.0059
(0.2820)(0.2958)(0.2311)(0.2100)(0.2994)(0.3412)(0.2429)(0.2229)
T n −0.8769−0.92750.42580.4898−0.4723−0.65050.51550.5918
log lik.−1121−1107−1123−1110−1171−1159−1176−1163
AIC22482221225022262348232523562332
BIC22612238225822392360234223642344
Q 2 ( 6 ) 6.476.866.075.447.037.116.615.62
Q 2 ( 12 ) 8.598.698.337.298.838.698.887.62
Note: this table presents empirical results of the fitted GARCH, T-GARCH, ZD-GARCH and TZD-GARCH models of BRE and WTI daily returns over 2018–2020. ω ^ n , α ^ n , α ^ n + , α ^ n and β ^ n are the model parameters. v ^ n is the sample volatility persistence. γ ^ n is the Lyapunov exponent. Values in the parentheses are the corresponding standard errors. Tn is the stationarity (GARCH and T-GARCH) and stability (ZD-GARCH and TZD-GARCH) test statistic. log lik. is the log likelihood. AIC and BIC are the Akaike Information Criterion and Bayesian Information Criterion, respectively. Q2(6) and Q2(12) are the portmanteau test statistics at the 6th and 12th lags of standardized residuals, respectively. * denotes that the corresponding test is significant at 5% level.
Table 4. Out-of-sample forecasting results: INE.
Table 4. Out-of-sample forecasting results: INE.
GARCHT-GARCHZD-GARCHTZD-GARCH
RMSE0.40940.60320.30280.2849
MAE0.29610.41670.22170.2169
MAPE4.47054.80822.12032.2031
MASE1.89892.67271.42211.3915
DM0.0028 *0.0002 *0.0209 *-
Note: this table presents the out-of-sample forecasting results of INE volatility between the end of November 2019 and the end of April 2020. The training sample ranges from the end of March 2018 to the end of November 2019. The benchmark volatility is produced via the model described in Yang and Zhang [4]. RMSE is the root of mean squared error. MAE is the mean absolute error. MAPE is the mean absolute percentage error. MASE is the mean absolute scaled error. Bold numbers indicate the smallest forecasting errors for each criterion. DM is the p-value of the Diebold and Mariano [19] test, which is performed pair-wisely by contrasting forecasts of TZD-GARCH to those of one of the rest three models. In each case, the null (alternative) hypothesis is that the forecasts of TZD-GARCH are as accurate as (more accurate than) those of the tested model. * denotes that the corresponding test is significant at 5% level.
Table 5. Out-of-sample forecasting results (2018–2019): BRE and WTI.
Table 5. Out-of-sample forecasting results (2018–2019): BRE and WTI.
BRE WTI
GARCH T-GARCH ZD-GARCH TZD-GARCH GARCH T-GARCH ZD-GARCH TZD-GARCH
RMSE0.57790.78630.41030.37510.41960.46520.37830.3740
MAE0.35880.51520.27520.23500.24120.24590.25190.2511
MAPE0.36840.50270.27860.23290.46210.46660.44620.4096
MASE3.46824.97982.66032.27162.89252.90162.83182.7779
DM0.0029 *0.0003 *0.0014 *-0.0491 *0.0410 *0.0654-
Note: this table presents the out-of-sample forecasting results of BRE and WTI volatility between the end of November 2019 and the end of April 2020. The training sample ranges from the end of March 2018 to the end of November 2019. The benchmark volatility is produced via the model described in Yang and Zhang [4]. RMSE is the root of mean squared error. MAE is the mean absolute error. MAPE is the mean absolute percentage error. MASE is the mean absolute scaled error. Bold numbers indicate the smallest forecasting errors for each criterion. DM is the p-value of the Diebold and Mariano [19] test, which is performed pair-wisely by contrasting forecasts of TZD-GARCH to those of one of the rest three models. In each case, the null (alternative) hypothesis is that the forecasts of TZD-GARCH are as accurate as (more accurate than) those of the tested model. * denotes that the corresponding test is significant at 5% level.
Table 6. Empirical results: BRE and WTI (2016–2020).
Table 6. Empirical results: BRE and WTI (2016–2020).
BRE WTI
GARCHT-GARCHZD-GARCHTZD-GARCHGARCHT-GARCHZD-GARCHTZD-GARCH
ω ^ n 0.1765 *0.1909 * 0.2186 *0.1694 *
(0.0710)(0.0835) (0.0907)(0.0774)
α ^ n or α ^ n + 0.1422 *0.00910.1075 *0.01350.1575 *0.00510.1160 *0.0225
(0.0289)(0.0149)(0.0159)(0.0128)(0.0299)(0.0174)(0.0170)(0.0158)
β ^ n 0.8331 *0.8513 *0.9131 *0.9478 *0.8192 *0.8736 *0.9074 *0.9392 *
(0.0376)(0.0482)(0.0113)(0.0092)(0.0399)(0.0400)(0.0120)(0.0114)
α ^ n 0.2205 * 0.1116 * 0.2058 * 0.1253 *
(0.0567) (0.0157) (0.0442) (0.0179)
v ^ n 0.97580.95791.02091.00830.97600.96641.02271.0087
γ ^ n −0.0054−0.00640.00250.0027−0.0057−0.00460.00320.0033
(0.2083)(0.2227)(0.1643)(0.1386)(0.2203)(0.2121)(0.1703)(0.1517)
T n −0.8248−0.90850.48110.6162−0.8187−0.69190.59420.6881
log lik.−2089−2068−2094−2071−2171−2153−2178−2157
AIC41854143419241484347431343614320
BIC41994163420241624362433343714333
Q 2 ( 6 ) 11.4811.809.699.589.619.559.579.37
Q 2 ( 12 ) 12.3915.1810.9711.3011.3611.4311.7711.31
Note: this table presents empirical results of the fitted GARCH, T-GARCH, ZD-GARCH and TZD-GARCH models of BRE and WTI daily returns over 2016–2020. ω ^ n , α ^ n , α ^ n + , α ^ n and β ^ n are the model parameters. v ^ n is the sample volatility persistence. γ ^ n is the Lyapunov exponent. Values in the parentheses are the corresponding standard errors. Tn is the stationarity (GARCH and T-GARCH) and stability (ZD-GARCH and TZD-GARCH) test statistic. log lik. is the log likelihood. AIC and BIC are the Akaike Information Criterion and Bayesian Information Criterion, respectively. Q2(6) and Q2(12) are the portmanteau test statistics at the 6th and 12th lags of standardized residuals, respectively. * denotes that the corresponding test is significant at 5% level.
Table 7. Empirical results: BRE and WTI (2008–2020).
Table 7. Empirical results: BRE and WTI (2008–2020).
BRE WTI
GARCHT-GARCHZD-GARCHTZD-GARCHGARCHT-GARCHZD-GARCHTZD-GARCH
ω ^ n 0.0316 *0.0346 * 0.1277 *0.1224 *
(0.0111)(0.0113) (0.0276)(0.0228)
α ^ n or α ^ n + 0.0831 *0.0172 *0.0749 *0.0184 *0.1226 *0.0154 *0.1142 *0.0342 *
(0.0095)(0.0070)(0.0078)(0.0053)(0.0143)(0.0097)(0.0125)(0.0119)
β ^ n 0.9158 *0.9281 *0.9344 *0.9503 *0.8651 *0.8872 *0.9069 *0.9318 *
(0.0102)(0.0106)(0.0061)(0.0043)(0.0152)(0.0141)(0.0090)(0.0096)
α ^ n 0.1114 * 0.1061 * 0.1735 * 0.1259 *
(0.0147) (0.0090) (0.0206) (0.0132)
v ^ n 0.99890.99431.00941.00550.98770.98061.02111.0117
γ ^ n −0.0011−0.00130.00090.0009−0.0035−0.00340.00120.0012
(0.1272)(0.1309)(0.1166)(0.1105)(0.1791)(0.1891)(0.1673)(0.1539)
T n −0.4834−0.53240.42270.4463−1.0551−0.99730.39280.4269
log lik.−6117−6078−6123−6082−6550−6505−6560−6508
AIC12,23912,16312,25012,16913,10613,01813,12413,022
BIC12,25712,18712,26212,18713,12413,04213,13613,040
Q 2 ( 6 ) 15.09 *15.42 *13.69 *10.982.963.042.581.99
Q 2 ( 12 ) 18.3218.2816.8414.907.287.956.106.14
Note: this table presents empirical results of the fitted GARCH, T-GARCH, ZD-GARCH and TZD-GARCH models of BRE and WTI daily returns over 2008–2020. ω ^ n , α ^ n , α ^ n + , α ^ n and β ^ n are the model parameters. v ^ n is the sample volatility persistence. γ ^ n is the Lyapunov exponent. Values in the parentheses are the corresponding standard errors. Tn is the stationarity (GARCH and T-GARCH) and stability (ZD-GARCH and TZD-GARCH) test statistic. log lik. is the log likelihood. AIC and BIC are the Akaike Information Criterion and Bayesian Information Criterion, respectively. Q2(6) and Q2(12) are the portmanteau test statistics at the 6th and 12th lags of standardized residuals, respectively. * denotes that the corresponding test is significant at 5% level.
Table 8. Out-of-sample forecasting results (2016–2019 and 2008–2019): BRE and WTI.
Table 8. Out-of-sample forecasting results (2016–2019 and 2008–2019): BRE and WTI.
BRE WTI
GARCHT-GARCHZD-GARCHTZD-GARCH GARCHT-GARCHZD-GARCHTZD-GARCH
Panel A: 2016–2019 (900 observations)
RMSE0.48950.56670.39060.3767 0.38920.40310.38220.3722
MAE0.31650.36410.26420.2501 0.26640.27130.23020.2118
MAPE0.29590.32680.25140.2333 0.49420.50690.42940.3641
MASE3.05933.51932.55382.4172 3.06783.12412.65142.4394
DM0.0006 *0.0001 *0.0831- 0.0473 *0.0150 *0.0513-
Panel B: 2008–2019 (2900 observations)
RMSE0.41550.48280.38810.3719 0.38090.39010.37910.3707
MAE0.26330.30100.25840.2443 0.26620.26910.24160.2411
MAPE0.23920.26550.24230.2256 0.49040.49800.43760.4205
MASE2.54472.90912.49742.3608 3.06623.09972.78312.7767
DM0.0024 *0.0002 *0.0389 *- 0.0391 *0.0272 *0.0429 *-
Note: this table presents the out-of-sample forecasting results of BRE and WTI volatility between the end of November 2019 and the end of April 2020. The training samples are classified in to two cases: 2016–2019 (900 observations) and 2008–2019 (2900 observations). The benchmark volatility is produced via the model described in Yang and Zhang [4]. RMSE is the root of mean squared error. MAE is the mean absolute error. MAPE is the mean absolute percentage error. MASE is the mean absolute scaled error. Bold numbers indicate the smallest forecasting errors for each criterion. DM is the p-value of the Diebold and Mariano [19] test, which is performed pair-wisely by contrasting forecasts of TZD-GARCH to those of one of the rest three models. In each case, the null (alternative) hypothesis is that the forecasts of TZD-GARCH are as accurate as (more accurate than) those of the tested model. * denotes that the corresponding test is significant at 5% level.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Liu, T.; Shi, Y. Forecasting Crude Oil Future Volatilities with a Threshold Zero-Drift GARCH Model. Mathematics 2022, 10, 2757. https://0-doi-org.brum.beds.ac.uk/10.3390/math10152757

AMA Style

Liu T, Shi Y. Forecasting Crude Oil Future Volatilities with a Threshold Zero-Drift GARCH Model. Mathematics. 2022; 10(15):2757. https://0-doi-org.brum.beds.ac.uk/10.3390/math10152757

Chicago/Turabian Style

Liu, Tong, and Yanlin Shi. 2022. "Forecasting Crude Oil Future Volatilities with a Threshold Zero-Drift GARCH Model" Mathematics 10, no. 15: 2757. https://0-doi-org.brum.beds.ac.uk/10.3390/math10152757

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop