Predicting Extreme Daily Regime Shifts in Financial Time Series Exchange/Johannesburg Stock Exchange—All Share Index

Makatjane, Katleho; Moroke, Ntebogang

doi:10.3390/ijfs9020018

Open AccessArticle

Predicting Extreme Daily Regime Shifts in Financial Time Series Exchange/Johannesburg Stock Exchange—All Share Index

by

Katleho Makatjane

^*

and

Ntebogang Moroke

Faculty of Economic and Management Sciences, Department of Statistics and Operations Research, North-West University, Mmabatho 2745, South Africa

^*

Author to whom correspondence should be addressed.

Int. J. Financial Stud. 2021, 9(2), 18; https://0-doi-org.brum.beds.ac.uk/10.3390/ijfs9020018

Submission received: 4 January 2021 / Revised: 4 March 2021 / Accepted: 16 March 2021 / Published: 25 March 2021

(This article belongs to the Special Issue Alternative Models and Methods in Financial Economics)

Download

Browse Figures

Versions Notes

Abstract

:

During the past decades, seasonal autoregressive integrated moving average (SARIMA) had become one of a prevalent linear models in time series and forecasting. Empirical research advocated that forecasting with non-linear models can be an encouraging alternative to traditional linear models. Linear models are often compared to non-linear models with mixed conclusions in terms of superiority in forecasting performance. Therefore, the aim of this study is to build an early warning system (EWS) model for extreme daily losses for financial stock markets. A logistic model tree (LMT) is used in collaboration with a seasonal autoregressive integrated moving average-Markov-Switching exponential generalised autoregressive conditional heteroscedasticity-generalised extreme value distribution (SARIMA-MS-EGARCH-GEVD) estimates. A time series of the study is a five-day financial time series exchange/Johannesburg stock exchange-all share index (FTSE/JSE-ALSI) for the period of 4 January 2010 to 31 July 2020. The study is set into a two-stage framework. Firstly, SARIMA model is fitted to stock returns in order to obtain independently and identically distributed (i.i.d) residuals and fit the MS(k)-EGARCH(p,q)-GEVD to i.i.d residuals; while, in the second stage, we set-up an EWS model. The results of the estimated MS(2)-EGARCH(1,1) -GEVD revealed that the conditional distribution of returns is highly volatile giving the expected duration to approximately 36 months and 4 days in regime one and 58 months and 2 days in regime two. We further found that any degree losses above 25% implies that there will be no further losses. Using the seven statistical loss functions, the estimated

S A R I M A (2, 1, 0) \times {(2, 1, 0)}_{240} - M S (2) - E G A R C H (1, 1) - G E V D

proved to be the most appropriate model for predicting extreme regimes losses as it was ranked at 71%. Finally, the results of EWS model exhibit reasonably an overall performance of 98%, sensitivity of 79.89% and specificity of 98.40% respectively. The model further indicated a success classification rate of 89% and a prediction rate of 95%. This is a promising technique for EWS. The findings also confirmed 63% and 51% of extreme losses for both training sample and validation sample to be correctly classified. The findings of this study are useful for decision makers and financial sector for future use and planning. Furthermore, a base for future researchers for conducting studies on emerging markets, have been contributed. These results are also important to risk managers and and investors.

Keywords:

Bayesian; block minima; extreme value theory; generalised extreme value distribution; Markov-Chain-Monte-Carlo; Markov-Switching models

1. Introduction

The last decade saw a large number of financial crises in emerging market economies (EMEs) with often devastating economic, social, and political consequences. These financial crises were in many cases not confined to individual economies, but also spread contagiously to other markets. As a result, international financial institutions have developed early warning system (EWS) models, with the aim of identifying economic weaknesses and vulnerabilities among emerging markets and ultimately anticipating such events. As a result, international and private sector institutions have begun to develop EWS models with the aim of anticipating whether and when individual countries may be affected by a financial crisis. The international monetary fund (IMF) has taken a lead in putting significant effort into developing EWS models for EMEs, which has resulted in influential papers by Kaminsky et al. (1998). However, many central banks, such as the United States (US) Federal Reserve and the Bundes bank, academics, and various private sector institutions have also developed models in recent years. Early warning system models can have substantial value to policy-makers by allowing them to detect underlying economic weaknesses and vulnerabilities, and possibly taking pre-emptive steps to reduce the risks of experiencing a crisis. However, the central concern is that these models have been shown to only perform modestly well in predicting crises.

The aim of this paper is to develop a new EWS model for extreme daily losses that significantly improves upon existing models in three ways. First, and most importantly, the paper argues that a key weakness of existing EWS models is that they are subject to what we call a post-crisis bias. This bias implies that these models fail to distinguish between tranquil periods, when the economic fundamentals are largely sound and sustainable, and post-crisis/recovery periods, when economic variables go through an adjustment process before reaching a more sustainable level or growth path. We show that making this distinction using a logistic model tree (LMT) model with two regimes (a pre-crisis regime and post-crisis/recovery regime) constitutes a substantial improvement in the forecasting ability of EWS models.

Second, many financial crises since the 1990’s have been contagious in spreading across the markets. Therefore, another aim of this paper is to apply to EWS models contagion indicators, to measure real contagion channels through trade linkages (direct and indirect) and financial contagion channels via equity market interdependence. We find that, in particular, the financial contagion channel has been an important factor in explaining and anticipating market crises. Third, we uses a data sample of South African stock market with five day frequency, for the period 2010–2020 as the basis for the EWS model estimations. Because the aim of an EWS model is to develop a framework that allows for predicting crises in relatively open economies in the future, it is imperative to use for the in-sample estimation only those crises and country observations that are similar to those that are likely to occur in the future. Therefore, we have started our sample only in 2010, excluding the 1980’s and early 1990’s, during which capital markets were not yet integrated and capital accounts often still closed to foreign investors, also because many countries still experienced hyperinflation in the early 1990’s. Likewise, we have excluded the years immediately following the transition to a free market in Eastern European countries.

To achieve our objective, we propose a hybrid approach to time series forecasting using SARIMA, MS-EGARCH and GEVD. It is often difficult in practice to determine whether a series under study is generated from a linear or non-linear underlying process or whether one particular method is more effective than the other in out-of-sample forecasting. Thus, it is difficult for forecasters to choose the right technique for their unique situations Zhang (2003). Typically, a number of different models are estimated and the one with the most accurate result is selected. However, the final selected model is not necessarily the best for future use due to some potential influencing factors such as sampling variation, model uncertainty, and change in structure. By combining different methods, the problem of model selection can be eased with little extra effort. Second, real-world time series are rarely pure linear or non-linear. They often contain both linear and non-linear patterns. If this is the case, neither SARIMA, MS-EGARCH nor GEVD can be adequate in modelling and forecasting time series since the SARIMA model cannot deal with non-linear relationships while the MS-EGARCH and GEVD alone are not able to handle both linear and non-linear patterns equally well. Hence, by combining SARIMA with MS-EGARCH and GEVD models, will lead us to accurately model complex time series structures. Third, it is almost universally agreed in the forecasting literature that no single method is best in every situation. This is largely due to the fact that a real-world problem is often complex in nature and any single model may not be able to capture different patterns equally well.

lSigauke et al. (2012) used some ARIMA (herein reference autoregressive integrated moving average) model as a benchmark to test the effectiveness of a generalised extreme value distribution with mixed results in order to obtain i.i.d residuals. Many empirical studies including several large-scale competitions suggest that by combining various models, forecasting accuracy can often be improved over individual model without the need to find a ’true’ or ’best’ model Zhang (2003). In addition, the developed hybrid model is more robust with regard to the possible structure change in the data and a basic idea of a hybrid model in forecasting is to use each model’s unique feature to capture various patterns and both theoretical and empirical findings suggest that combining various methods can be an effective and efficient way to improve forecasts.

With the proposed hybrid model, we amalgamate

S A R I M A (p, d, q) \times {(P, D, Q)}_{S}

to obtain i.i.d residuals while at the same time model seasonally linear structures in a time series. Using the residual of SARIMA, we fit MS-EGARCH-GEVD to model volatility, regime dependence and extreme tail losses. Specifically, we consider a conditional GEVD with a specification that the extreme value sequence comes from an exponential GARCH-type process in the conditional variance structure. The dependence is captured by an appropriate temporal trend in the location and scale parameters of the GEVD. The advantage of a proposed hybrid lies in its ability to capture conditional heteroscedasticity, structural breaks, asymmetric behaviour through an exponential switching GARCH framework, and further models fat-tail and extreme tail behaviour by the use of GEVD. The SARIMA-MS-EGARCH-GEVD modelling approach is believed to perform better in forecasting and it is suited to explain extremes better than the classical MS-EGARCH and SARIMA alone, which cannot capture the tail behaviour adequately with neither normally distributed nor even fatter tailed distributed (e.g., t) innovations as suggested by Calabrese and Giudici (2015). Following Makatjane et al. (2018b), we denote regime classification of SARIMA-MS-EGARCH-GEVD by the following interval [0,1] for low and high regimes in order to develop a dummy variable for the LMT as this model serves as a warning sign model. This study is the first empirical analysis that employs SARIMA, MS-EGARCH in conjunction with the GEVD and LMT models to quantify the likelihood of future extreme daily losses.

Literature Review

The financial chaos that hit developing markets in the latest decades has initiated the need for precise country hazard assessment Fuertes and Kalotychou (2007). In order to explain and predict the crisis of a country, including the currency crises, several models have been developed by a number of studies worldwide. In empirically observing the crisis, it is significant to be indistinct in how a crisis is defined. The models used for determining the early warning signals of extreme daily losses are discussed in this section together with those used for obtaining regime switches of extreme daily losses. Two non-linear models, such as Markov-Switching autoregressive (MS-AR) and Logistic Regression model, have been considered by Cruz and Mapa (2013) with the aim of developing an early warning system model for inflation in the Philippines. The forecasts were combined by the regime switching of inflation and the likelihood of the occurrence of the inflation crisis. However, the results of these authors showed that an outcome of the regime classification appeared to be erratic with regime lasting for a month. By using penalised maximum likelihood methodology, Arias and Erlandsson (2004), in their study of regime switching as an alternative of early warning system of currency crises, found that the method allowed for them to extract smoother transition probabilities than in the standard case, reflecting the need of policy makers to have advance warning in the medium to long term, rather than the short term. See also Abiad (2003).

Two primary issues have been faced by past analysts. Firstly, considerable research has been developed around the significance of precision in deciding the timing and duration of crisis periods. Furthermore, there has been a significant debate by researchers endeavouring to decide the most ideal approach to analyse correlation dynamics before, during, and after these crisis stages Troug and Murray (2020). Scholars, like Forbes and Rigbon (2002), had a problem in accurately determining the crisis. These authors utilised different methods, such as an exogenous and endogenous approach, but all found different results. Moysiadis and Fokianos (2014) noted that a Markov-chain (MC) algorithm has given future states and it is significance while the categorical response variable is lagged. Two reasons arise for the problem caused by a clear categorical time series when it is modelled by Markovian methods. (1) There is a positive non-linear relationship between the order of MC and free parameters, i.e., as the order of the MC accumulates, so does the free parameters. Nonetheless, these free parameters increase exponentially. However, we incorporate the Bayesian approach to restrict these free parameters to increase exponentially, but remain constant over time with non-constant regime switching probabilities (2). The response variable and the covariates that are observed jointly must be a joint flow between them. Of course, this type of determination might be impossible in the stochastic processes of higher time series frequencies. Hence, the proposed Bayesian approach in this study.

Nevertheless, the model that is known as an EWS according to Edison (2002) is engaged for the prediction of crises mainly the financial crises. There are various types of crises, which included the 2008 US financial crises, including currency crises that were studied by Jeanne and Masson (2000), banking crises by Borio and Drehmann (2009), sovereign debt crises and private sector debt crises by Schimmelpfennig et al. (2003), and equity market crises by Bekaert et al. (2014). Therefore, the study extends the current focus of crises to the prediction of extreme daily losses. In developing an EWS for markets crisis, there are three methodologies that are emphasized. These are the bottom-up methodology, the aggregate methodology and the macroeconomic methodology. The odds of extreme market crisis are addressed and the systemic volatility is being activated and signed if the odds become significant. For the second method, the model is applied to data other than individual banking data. On the third method, the focal point is centred in building a relationship between economic variables with the reason that various macroeconomic variables are required to affect the financial system and reflect their own condition.

Davis and Karim (2008) used a multivariate logit model in their comparison study of an early warning system with the aim of relating the likelihood of occurrence or non-occurrence of a crisis to a vector of n explanatory variables. The probability that a dummy variable takes a value of one (crisis occurs) at a point in time was given by the value of a logistic cumulative distribution that was evaluated for the data and parameters at that point in time. Their results showed that the logit model they estimated might be the best model for globally detecting banking crises. Because of small samples and the need to keep the degrees of freedom, Kolari et al. (2002) added to the work of EWS by estimating a stepwise logistic regression in order to identify the subset of covariates that are needed in the model through their power to discriminate. The predefined significance level was set at 10% and the impact of this was that few variables were chosen in the model, hence the need to increase the significance level to 30%, which was used as a threshold to add variables in the model. The main problem that caused the lack of significance of the variables in entering the model is due to the fact that the error term in the regression model followed a cumulative distribution that does not accurately estimate a logit function.

2. Materials and Methods

Let

r_{t}

be stock returns at time t, Chinhamu et al. (2015) and Bee (2018) showed that the returns can be modelled by

r_{t} = μ_{t} + ε_{t},

(1)

where

μ_{t}

is a time-varying mean and

ε_{t}

is the error term that can be modelled by

ε_{t} = ν_{t} σ_{t} .

(2)

σ_{t}

is the time-varying dynamics, while

ν_{t}

is an i.i.d process. The distribution of

ν_{t}

, specifically its tails, is our focus in this study. We apply block minima (BM) to tails regions of the innovation distribution of Equation (2) and its associated extreme quantiles.

2.1. SARIMA-(p,d,q)(P,D,Q)-MS(K)-EGARCH(p,q)-GEVD

Box et al. (2015) invented two models which are known as SARIMA intervention and SARIMA respectively. Additionally, SARIMA model is proposed in this study to serve as a predecessor that is used to filter a time series in order to obtain i.i.d residuals. According to Makatjane et al. (2018a), a multiplicative SARIMA model denoted by

S A R I M A (p, d, q) \times {(P, D, Q)}_{S}

follows this mathematical form

Φ (β) Φ_{S} (β^{S}) {(- β)}^{D} r_{t} = Θ (β) Θ_{S} (β^{S}) ε_{t}

(3)

where

ε_{t} \sim i . i . d (0, 1)

and S is the seasonal length while

β

is the lag operator. Tsay (2014), emphasized that there should be no common factors between the polynomials of seasonal autoregressive (SAR) and seasonal moving average (SMA); if not the order of the model must be in a reduced form. Moreover, SAR polynomials should acquaint with the characteristic equation of SARMA because that is the duty of SAR model Moroke (2014).

Using the residuals of model (3), we fit the MS-EGARCH-GEVD subject to two regimes. The observations fitted on this model comes from the GEVD following an exponential GARCH process that describes a conditional variance of extremes. To account for the leverage effects, we model Equation (2) with respect to a vector

θ_{k}

because past negative values have larger stimulus on the conditional volatility than the past positive values of the same magnitude Ardia et al. (2018). Therefore, covariance-stationarity in each regime is accomplished by setting

k > 1

as Ardia et al. (2016) has declared. Hence the MS(k)-EGARCH(p,q)1 is given by

ln (σ_{t}^{2}) = ω + \sum_{i = 1}^{p} \{α_{i} * (\frac{∣ ε_{t - i} ∣}{σ_{t - i}} - E [\frac{∣ ε_{t - i} ∣}{σ_{t - i}}]) + γ_{i} \frac{ε_{t - i}}{σ_{t - i}}\} + \sum_{j}^{q} ln (σ_{t - j}^{2}) .

(4)

Let

S_{t}

to be an ergodic homogeneous MC on a finite set, Bauwens et al. (2014) disclosed that

S_{t} = \{1, \dots K\}

and Ardia et al. (2018) defined a transition matrix as

P \equiv {\{p_{i j}\}}_{i, j = 1}^{K}

where

p_{i, j} \equiv P [S_{t} = j ∣ S_{t - 1} = i]

. We initiate the chain at

t = 0

so that

{\{S_{t}\}}_{\geq 1}

is independent from

{\{u_{t}\}}_{\geq 1}

. Finally, a time-dependent generalised extreme value distribution is fitted and according to Gagaza et al. (2019) and Coles (2001) this distribution is given by

G_{ξ} (x) = \exp \{- {[1 + ξ (\frac{x_{i} - μ}{σ})]}^{- \frac{1}{ξ}}\} .

(5)

Nonetheless, Equation (5) is valid for

\{x : μ - \frac{σ}{ξ} < x < \infty\}

, where

μ

is the location parameter that ranges from

- \infty < μ < \infty

and

σ

is a scale parameter and it must be greater than zero; though

ξ

is a shape parameter that ranges from

- \infty < ξ < \infty

. When

ξ > 0

, a distribution defined in Equation (5) reduces to Fréchet tails, but when

ξ < 0

it corresponds to the Weibull tail type Coles (2001). In order to correct undefined results of distribution (5) when

ξ = 0

, we take

lim_{ξ \to \infty}

and this results in Gumbel tails.

Once modelling extremes with BM approach, the block size is usually set to one year Nemukula (2018). In this way, the tactic will give only 12 annual maximum or minimum points hereafter, the authors fitted the GEVD to a 30-day (one month) block maxima. But in this study, we use a five-day (five days) blocking to fit the GEVD over a block minima. This is because the stock market data is only observed in a five day period. From each block, the minimas say,

X_{i}, i = 1, \dots, m

are selected and this forms a series of m five day minimas to which the GEVD is fitted to. If

X_{1}, \dots, X_{n}

constitute five day maximum losses that are distributed with the GEVD in model (5), Maposa et al. (2016) showed that in period t,

r_{t}

follows

G E V D (μ (t), ξ, σ)

and Bee (2012) emphasised that

μ (t) = μ_{0} + μ_{1}^{t}

for a linear variation in location with an intercept parameter

μ_{0}

and a slope parameter

μ_{1}

, that expresses the rate of change in daily losses. We finally express our proposed hybrid for extreme return losses as

r_{t} \sim S A R I M A (p, d, q) \times (P, D, Q) - M S (k) - E G A R C H (p, q) - G E V D (x i, μ_{t}, σ_{t})

.

2.2. Logistic Model Tree

The logistic model tree (LMT) is engaged as part of this study to determine an early warning system (EWS) for the extreme daily losses. This method is utilised as a development from the

M S (k) - E G A R C H (p, q) - G E V D

. The regimes are regarded as the binary response variable in an LMT model. The practicality of crises of extreme losses is being assessed through the probabilities that are extracted from the LMT. The logistic model tree is a classification model, which combines decision tree learning methods and logistic regression (LR) Bui et al. (2016). Following Chen et al. (2017), we make use of the LogitBoost algorithm to produce an LR model at every node in the tree, and the tree is pruned using a classification and regression tree (CART) algorithm. The LMT uses cross-validation to find a number of LogitBoost iterations to prevent the over-fitting of training data. The LogitBoost algorithm uses additive least-squares fits of the logistic regression for each

M_{i}

class according to Doetsch et al. (2009); it is as follows

L_{M} (X) = \sum_{i = 1}^{n} β_{i} X_{i} + β_{0}

(6)

where

β_{i}

is the coefficient of the

i^{t h}

component of vector x,whereas n is the number of factors. Furthermore, just as in Chen et al. (2017)’s work, we make use of a linear logistic regression procedure to compute the posterior probabilities of leaf nodes in the LMT model and, according to Agresti (2018) and Stokes et al. (2012), the posteriors are computed by

P (M ∣ X) = \frac{\exp (L_{M} (X))}{\sum_{M^{'} = 1}^{D} \exp (L_{M^{'}} (X))}

(7)

where D is the number of classes. For more readings on LogitBoost algorithm, see, for instance, (Pham and Prakash (2019), Pourghasemi et al. (2018), and Kamarudin et al. (2017)), respectively.

2.3. Bayesian Markov-chain-Monte-Carlo Framework

This subsection presents a theoretical overview of the Bayesian Markov-chain-Monte-Carlo (MCMC) on a five day loss frequency modelling. A five day minima

- X_{1}, \dots, - X_{n}

are i.i.d residuals from the estimated MS (k)-EGARCH (p,q) model for reasonably large n; and, the marginal distribution of each minima is said to be

x_{t} \sim G E V D (x i, μ_{t}, σ_{t})

. For the parametric inference using MCMC, the posterior distribution is computed using the following Bayes’ Theorem 1.

Theorem 1.

π (θ ∣ x) = \frac{π (x ∣ θ) π (θ)}{\int_{Θ} π (x ∣ θ) π (θ) d θ}

which is usually written as

π (θ ∣ x) \propto π (x ∣ θ) π (θ) .

Here,xis a vector of observations,

θ

is a parameter vector, while

π (θ)

is the prior and

π (θ ∣ x)

is the posterior distribution. Finally,

π (x ∣ θ)

is the likelihood function with the following

π (x)

normalisation constant and

Θ

is the space parameter.

According to Maposa et al. (2016), a

100 (1 - α) %

Bayesian credible set C (or, in particular, credible interval) is a subset of the space parameter

Θ

, such that

\int_{C} π (θ ∣ x) d θ = 1 - α .

(8)

If the space parameter

Θ

is discrete, the integral in model (8) is replaced by the summation

(\sum)

. The quantile-based credible interval is such that if

θ_{L}^{*}

and

θ_{U}^{*}

are posterior quantiles for

θ

, where the former is computed by

\frac{α}{2}

and the latter by

1 - \frac{α}{2}

; then,

(θ_{L}^{*}, θ_{U}^{*})

is a

100 (1 - α) %

credible interval for

θ

; hence, the likelihood function for

G E V D (x i, μ_{t}, σ_{t})

is given by

π (x ∣ θ) = \prod_{i = 1}^{k} \frac{1}{σ} {[1 + ξ (\frac{x_{i} - μ}{σ})]}^{- \frac{1}{ξ} - 1} \exp \{- {[1 + ξ (\frac{x_{i} - μ}{σ})]}^{- \frac{1}{ξ}}\} .

(9)

The joint posterior density is

\begin{matrix} π (θ ∣ x) \propto π (θ) π (x ∣ θ) \\ π (θ ∣ x) \propto \frac{1}{σ} \exp \{- \frac{1}{2} {(θ^{'} - ϑ)}^{T} \sum^{- 1} (θ^{'} - ϑ)\} \times \prod_{i = 1}^{K} \frac{1}{σ} {[1 + ξ (\frac{x_{I} - μ}{σ})]}^{- \frac{1}{ξ} - 1} \exp \{- {[1 + ξ (\frac{x_{i} - μ}{σ})]}^{- \frac{1}{ξ}}\} \\ π (θ ∣ x) \propto \frac{1}{σ^{k + 1}} \exp \{- \frac{1}{2} {(θ^{'} - ϑ)}^{T} \sum^{- 1} (θ^{'} - ϑ)\} \times \exp \{\sum_{i = 1}^{k} {[1 + ξ (\frac{x_{i} - μ}{σ})]}^{- \frac{1}{ξ}}\} \times \prod_{i = 1}^{k} \frac{1}{σ} {[1 + ξ (\frac{x_{i} - μ}{σ})]}^{- \frac{1}{ξ} - 1} \\ π (θ ∣ x) \propto \frac{1}{σ^{k + 1}} \exp \{- \frac{1}{2} {(θ^{'} - ϑ)}^{T} \sum^{- 1} (θ^{'} - ϑ) - \sum_{i = 1}^{k} {[1 + ξ (\frac{x_{i} - μ}{σ})]}^{- \frac{1}{ξ}}\} \times \prod_{i = 1}^{k} {[1 + ξ (\frac{x_{i} - μ}{σ})]}^{- \frac{1}{ξ} - 1} . \end{matrix}

Having the above joint posterior density, a posterior predictive density is used to predict observations of future posterior tail probabilities,

X_{0}

, as follows

Pr (X_{0} > x_{0} ∣ x_{1}, \dots, x_{n}) = \int_{0} Pr (X_{0} > x_{0} ∣ θ) π (θ ∣ x) d θ .

(10)

According to Billio et al. (2018) and Stephenson and Ribatet (2015), Equation (10) becomes

Pr (X_{0} > x_{0} ∣ x_{1}, \dots, x_{n}) \approx \frac{1}{k - α + 1} \sum_{i = 1}^{k} Pr (X_{0} > x_{0} ∣ θ),

(11)

where

Pr (X_{0} > x_{0} ∣ θ)

is the GEVD that is evaluated at

t_{0}

and

α

is the burn-in period. Moreover, if

X_{N}

is an annual daily maximum losses over some future period of N years, Maposa et al. (2014) showed that a posterior predictive distribution is given by

Pr (X_{0} > x_{0} ∣ x_{1}, \dots, x_{n}) \approx \frac{1}{k - α + 1} \sum_{i = 1}^{k} Pr {(X_{0} > x_{0} ∣ θ_{i})}^{N} .

(12)

2.4. Forecasting Performance and Predictive Accuracy

The forecasting exercise is performed in pseudo-real-time, i.e., the information which is not accessible, is never utilised at the time a forecast is made. For all models, Carriero et al. (2015) utilized the recursive estimation window and assessed their outcomes with root mean squared forecast error (RMSFE). In any case, the present study tails these methods of Carriero et al. (2015) to evaluate the forecasting performance of the proposed models. The root mean square error (RMSE), mean absolute percentage error (MAPE), mean absolute error (MAE), Diebold-Mariano (D-M) and Theil Inequality Coefficient (TIC) are used.

Given the time series

Y_{t}

and the estimated

{\hat{Y}}_{t}

, Spyros (1993) defined MAE and MAPE as

M A E = \frac{1}{n} \sum_{i = 1}^{n} [Y_{t} - {\hat{Y}}_{t}]

(13)

MAPE = \frac{1}{n} \sum_{t = 1}^{n} |Y_{t} - {\hat{Y}}_{t}| * 100 .

(14)

Carriero et al. (2015) specified RMSE by

R M S E = \sqrt{\frac{1}{n} \sum_{1}^{n} {[Y_{t} - {\hat{Y}}_{t}]}^{2}}

(15)

Theil Inequality Coefficient (TIC) has been derived by Theil (1971) as follows

U_{2} = \frac{\sqrt{\sum {(p_{i} - A_{i})}^{2}}}{\sum A_{i}^{2}},

(16)

where, the P’s and A’s are defined as changes in predictive and actual values, respectively and

0 < U_{2} < 1

. By letting

Y_{t + h| t}^{1}

and

Y_{t + h| t}^{2}

, Diebold (2015) tested the null hypothesis that two or more forecasts have the same accuracy by utilising the following statistic

D M = \frac{\bar{d}}{\frac{\sqrt{2 π {\hat{f}}_{d} (0)}}{T}}

(17)

where

{\hat{f}}_{d} (0)

is a consistent estimate of

f_{d} (0)

which is

{\hat{f}}_{d} (0) = \frac{1}{2 π} \sum_{k = - (T - 1)}^{T - 1} Ω (\frac{k}{h - 1}) {\hat{γ}}_{d} (k) .

(18)

Note that

{\hat{γ}}_{d} (k) = \frac{1}{T} \sum_{t}^{T} = |k| + (d_{t} - \bar{d}) (d_{t -} |k| - \bar{d})

and

Ω (\frac{k}{h - 1}) = \{\begin{matrix} 1, f o r |\frac{k}{h - 1}| \leq 1 \\ 0, O t h e r w i s e \end{matrix}

. Perceiving that

{\hat{γ}}_{d} (- k) = (d_{t} - \bar{d}) (d_{t - ∣ k ∣} - \bar{d})

and

Ω (\frac{k}{h - 1}) = 0

for

∣ k ∣ > h - 1

, Diebold (2015) revealed that

{\hat{f}}_{d} (0) = \frac{1}{2} π ({\hat{γ}}_{d} (0) + 2 \sum_{k = 1}^{h - 1} {\hat{γ}}_{d} (k)) .

(19)

If

h \geq 1

then;

D M = \frac{\bar{d}}{\frac{\sqrt{{\hat{γ}}_{d} (0)} + 2 \sum_{k = 1}^{h - 1} {\hat{γ}}_{d} (k)}{T}} .

(20)

Under the null hypothesis, the test statistic DM is asymptotically normally distributed with mean zero and unit variance. The null hypothesis of no difference will be rejected if the computed DM statistic falls outside the range of

- Z_{α} / 2

to

Z_{α} / 2

.

3. Results

In this study, we fit

S A R I M A (p, d, q) \times {(P, D, Q)}_{S} - M S (K) - E G A R C H (p, q) - G E V D

to a five-day FTSE/JSE-ALSI. This is a high frequency time series obtained from the Johannesburg stock exchange for the period of 4 January 2010 to 31 July 2020. This resulted into 2644 observations2. To achieve this task, we use the Bayesian approach to fit a stationary distribution, denoted by

G E V D (Φ)

. This procedure is executed using various R packages, such as evdbayes of Stephenson and Ribatet (2015), MSGARCH of Ardia et al. (2018), and ismev of Heffernan et al. (2018), among others. As evidenced in Table 1, the distribution of return losses is asymmetric(i.e., negatively skewed); while, on the other hand, kurtosis is above three designating Fat-tailed distribution. A visual inspection of Figure 1 also shows that the largest losses are experienced between the year 2010 and 2018. Nevertheless, Wang et al. (2020) declared that these large losses are caused by a contractionary monetary policy that was implemented by the South African Reserve bank (SARB). In that context, Acharya et al. (2009) also pointed out six events that are linked to these losses. This includes in them and also as commented by the Global Economist

Repeal of Glass-Steagall act by the Clinton administration. According to Wallison (2011), Glass-Steagall act is one of the biggest post-depression piece of legislations, separating commercial and investment banks. Although this “deregulation” did not repeal the most important section distinguishing the role of commercial and investment banks.
The surge in a number of sub-prime mortgages as a response to high level of housing speculations and building up of the bubble.
The creation of new financial instruments which was risky, hard to assess, and shifted the accountability between agents.
The fall of real interest rate, combined with the Federal Reserves’ expansionary monetary policy.
Global financial imbalances.

3.1. SARIMA-MS-EGARCH-GEVD Model

To begin our analysis, we first train

S A R I M A (p, d, q) \times {(P, D, Q)}_{S}

with the ratio of 80% training and 20% validation sets. The aim here is to filter the returns so to obtain i.i.d residuals. Figure 2 shows the plot of residuals demand, the Q-Q plot and their probability density (pd). First and foremost, we applied an augmented Dickey Fuller (ADF) test to a time series with the aim to accommodate the Box-Jenkins methodology. The results of the ADF test provided a sufficient evidence that the FTSE/JSE-ALSI contains unit root with both seasonal and non-seasonal differencing of order one. i.e.,

d = 1

. The ADF model with intercept plus trend is the one used as its

Pr (τ) = 0.001

. In this case, a stationary time series is achieved. In model selection, Akaike information criteria (AIC) and Schwartz Bayesian criteria (SBC) congruently advocate that the SARIMA model is of the form

A R I M A (2, 1, 0) \times {(2, 1, 0)}_{240}

which is formulated below as

r_{t} = [1 - 0.498 Φ * * * (1) (1 + 0.916 Φ * * * (2)) (1 + 0.65 Φ * * * (240) (1 + 0.673 Φ * * * (241)))] ε_{t} .

When diagnosing the estimated SARIMA, all parameter estimates are found significant at 5% level of significance. Additionally, the estimated Q-statistics also gave vast significant evidence that the estimated model is a white noise process with

Pr (Q s t a t i s t i c = 0.1732)

and

ε_{t} \sim i . i . d (0, 0.02978)

. Makatjane et al. (2018a) have suggested that model parameter estimates must be less than one as to deem them to be sufficient and significant.

Using the residuals of the estimated

S A R I M A (2, 1, 0) \times (2, 1, 0) 240

we fit MS-EGARCH-GEVD subject to two regimes. The results of fitting the MS(2)-EGARCH(1,1) model to the residuals of the SARIMA model using a training data are presented in Table 2. Setting the number of burns to 10,000 L and the number of MCMC to 510,000 L, we obtained the acceptance rate MCMC sampler of 54.5%. The estimate of a time-invariant mean parameter denoted by

\hat{μ}

is statistically significant. Both the ARCH

\hat{α}

and GARCH estimates

\hat{β}

are significant, indicating the presence of conditional heteroscedasticity effects in each regime. According to Sigauke et al. (2014), this implies that volatility shocks are also persistent. In addition, the estimates of a skewed student-t

\hat{δ}

parameter is positive and statistically significant, signifying non-normal distribution as reported in Table 1; while the estimates of a shape parameter

\hat{ξ}

are significant implying dependent volatility in each regime. Finally, the gamma estimate

\hat{γ}

is statistically significant, meaning that the effect of negative returns’ shocks on the conditional variance is higher returns.

Each of the two regimes identified has a clear economic interpretation. The variance in regime two is higher than that of regime one by 1.42%. This is a clear indication that the conditional distribution is very volatile, and it is subject to regime-shifts with a daily value of 7.36%. Nevertheless, when JSE-ALSI follows the second regime, on daily average, it falls to 9.67%. This means that when the series is in the second regime, its probability to switch to regime one is

P (S_{t} = 2 ∣ S_{t} = 1) = 0.0164

. The average duration of each regime also supports this behaviour. Based on the expected duration, regime one has approximately 36 months and four days in regime one and 58 months and two days in regime two. Therefore, we conclude that there is a significant regime shifts in JSE-ALI and it can be shown using the filtered and smoothed probabilities in Figure 3.

3.2. Generalised Extreme Value Distribution Framework

Finally, we establish a generalised extreme value distribution. Reported in Table 3 are the MCMC results of the fitted GEVD. As emphasised by Stephenson and Ribatet (2015) and Maposa (2016), näive standard errors are given by dividing the actual standard deviation by the number of iterations. This is due to the fact that Bayesian approach allows for an additional source of variation; probability distributions with hyper parameters and small standard errors. Also, Droumaguet (2012) emphasised that Bayesian methods have densities that solves the problem of confidence interval and allow for model estimation with higher dimensions because some models have complex likelihood function and in practice it becomes difficult to estimate with classical algorithms. Moreover, lSigauke et al. (2012) declared that ambiguity about the parameters is very minimal when using Bayesian approach to model estimation.

The estimate of a shape parameter

\hat{ξ}

, as shown in Table 3, is negative indicating that FTSE/JSE-ALSI returns conforms to a Weibull class of distribution. The same negative value of the shape parameter is also found in the study of Gagaza et al. (2019) and Sigauke et al. (2014), who found Weibull class in their respective studies. The contrast is in the empirical analysis of Chan (2016) and Coles (2001), where the authors found positive shape parameters in their studies. In-addition, the 95% confidence interval for

\hat{ξ}

which is estimated using

\hat{ξ} \pm z_{α / 2} \times (s t a n d a r d e r r o r)

has negative interval limits that enclose the estimated shape parameter, hence confirming that the Weibull is an appropriate distribution for FTSE/JSE-ALSI. Gagaza et al. (2019) in their study of modelling non-stationary temperature extremes in KwaZulu-Natal, using the generalised extreme value distribution, found the same negative limits. Furthermore, the right end point is

μ - \frac{σ}{ξ} = 1.377113

, which implies that, with any degree losses above 25%, there is unlikely to be any further decrease in JSE-ALSI losses.

3.3. Comparative Analysis

The purpose of this section is to determine the model which best mimics the data and produces fewer forecasts. We therefore, use Akaike information criteria (AIC), Schwartz Bayesian criteria (SBC), mean square error (MSE), mean absolute error (MAE) log-likelihood (LL), mean absolute percentage error (MAPE) and Theil inequality (TIC) which are defined as statistical loss function. The well-known information criterion that are usually used for model selection remain as the criterion used for prediction ability of the estimated models, that is, they are used here to select the model that best predicts the extreme losses. While on the other hand error metrics are used for the forecasting performance. For instance, the log-likelihood selected the best model as SARIMA-MS-EGARCH-GEVD while looking at AIC; the best performing model was also selected as SARIMA-MS-EGARCH-GEVD while MAE and RMSE selected GEVD model as the best model. Therefore, Raihan (2017) in his study, ranked the models according to their statistical loss functions to overcome contradicting results. The author used 10 statistical loss functions. Following this author, we only used seven statistical loss functions and the models were ranked from 1 to 4. On that note, rank 1 denoted the best model while 3 denoted the poorest model. Table 4 gave much evidence that SARIMA model has the poorest performance, as the frequency of rank 4 is higher than the other three models by looking at their statistical loss functions. Nevertheless, the SARIMA-MS-EGARCH-GEVD model outperformed all the models, as the frequency of rank 1 was higher than the other three models as it recorded 5 out of 7 statistical loss functions.

3.4. Evaluation of the Performance of the Logit-Based EWS Model

Several measures are used to evaluate the outcomes of classification algorithms relying on the classified instances Ghatasheh (2014). After processing the input instances, each one is classified into one of four the possibilities that are visually presented in a matrix, by confronting the actual and predicted instances. Table 5 represents the matrix that is called the “Confusion or Contingency Matrix” to perform this task.

In Table 5, event A epitomizes the occasion when the model indicates a crisis when a high inflation event indeed occurs. Event B refers to an event when a signal that is issued by the model is not followed by the occurrence of high inflation, i.e., wrong signal. It is also possible that the model does not signal a crisis (low estimated probability), but a crisis, in fact, occurs, i.e., missing signal, event C. Finally, Event D indicates a situation in which the model does not predict a crisis and no crisis occurs. In this paper, a threshold value of 0.5 is used to indicate whether the probabilities can already be interpreted as crisis signals. To evaluate the EWS model performance, we make use of the following performance criteria that were recommended by Kaminsky et al. (1998).

Probability of extremes correctly called (PECC): $\frac{A}{B + C}$
Probability of non-extremes correctly called (PNECC): $\frac{D}{B + D}$
Probability of observations correctly called (POCC): $\frac{A + D}{A + B + C + D}$
Adjusted noise-to-signal ratio (ANSR): $\frac{B}{B + D} / \frac{A}{A + C}$
Probability of an event of high extremes given a signal (PRGS): $\frac{A}{A + B}$
Probability of an event of high extremes given no signal (PRGNS): $\frac{C}{C + D}$
Probability of false extremes to total extremes (PFE): $\frac{B}{A + B}$

The performance assessment results of the Logit based EWS model are given as a summary in Table 6 and Table 7. We further adopt the following scenarios, as presented by Kaminsky et al. (1998).

The results in Table 6 infer that the model has some EWS potential. In view of the training set estimates, the model has a capacity to accurately predict 63% of high extreme losses and 21% of low extreme losses. As it is, the proportion of high extreme losses, given a signal, is relatively high at 43% in the training sample while the validation sample is at 57%. The proportion of false extremes to total extremes shown in Table 7 is relatively low, at 21% for the training data and 14% for validation data.

The results in Table 7 indicate that there is 51% chance of high extreme losses in the next five years and this probability is about 12% lower than the current situation. The implication here is that the countries’ financial markets could expect investment losses in the future. This inference is made based on the presented validation data estimates. In addition, the proportion of expected false alarm (PFE) to the total alarm is premeditated as 21% which is higher than that of the validation data by 7%. The validation data percent of the non-extremes correctly called is 98%, which 13% higher than that of the training sample. One could conclude that, after all, extreme losses could be low in future.

3.5. Model Performance Evaluation

Ideally, the model-calculated-probability-scores of all actual positives (also known as Ones) should be greater than the model-calculated-probability-scores of all the negatives (also known as Zeroes). Such a model is said to be perfectly concordant and a highly reliable one. This phenomenon can be measured by concordance and discordance. In simpler terms, of all combinations of 1–0 pairs (actuals), concordance is the percentage of pairs, whose scores of actual positives are greater than the scores of actual negatives. For a perfect model, this will be 100%. Accordingly, the higher the concordance, the better the quality of model. In this case, our EWS model is found to be a good model for predicting extreme losses in FTSE/JSE-ALSI with a concordance of 89.15% and the area under curve (AUC) being 88.78%. All of the numbers in Table 8 are computed using validation data that were not used in the training model Hence, a truth detection rate of 79% on validation data is good, hence our model is a good EWS model for extreme losses.

4. Conclusions and Recommendations

This paper makes use of a stochastic econometric model and extreme value theory (EVT) procedures to establish an early warning system for extreme daily losses stock markets. The statistical input of this work lies in establishing a hybrid model MS(2)-EGARCH(1,1)-GEVD for predicting the extreme losses and to implement the estimated regimes in LMT. To achieve our objective, we set-up a two stage procedure. In the first stage, we train the MS(k)-EGARCH(p,q)-GEVD with 80% training set and 20% validation set. Therefore, we finally use the regime switching of this model to establish the logistic model tree in order to set-up an early warning system. Robust parameter estimates were achieved by using Bayesian MCMC procedure and setting number of burns (nburn) and number of MCMC replicate to

n b u r n

= 100,000 L and

n m c m c

= 510,000 L, respectively. This gave us an overall acceptance rate sampler of 55%; an acceptance rate for the location parameter to be 46.3% and, for the shape parameter, we had a 51.61% acceptance rate.

This study is innovative in a sense that no similar study has used the

S A R I M A (p, d, q) \times (P, D, Q) M S (k) - E G A R C H (p, q) - G E V D

together with the LMT in predicting the possibility of extreme daily losses; and to our knowledge thus far, this study is the first to use the proposed models. Estimation of these models has delivered enhanced understanding of prediction and classification of extreme daily losses. In particular, the study is unique in terms of uniting univariate methods in predicting periods of high and low extreme daily losses. Jointly, the results can be useful in guiding policy-makers in identifying episodes of high extreme losses and safeguard the extreme crisis well ahead of time. In order to optimally classify extreme crises, the LMT is estimated. The two regime probabilities from the

S A R I M A (p, d, q) \times (P, D, Q) M S (k) - E G A R C H (p, q) - G E V D

model are incorporated in the logistic model tree as the binary dependent variable. The low extreme regime was denoted as zero and high extreme regime denoted as one. The study here addresses the events of high and low extreme losses accordingly and found that for the training set, the model has indicated that probability of high extreme losses is 63% while using a validation set was found to be 51% for the next five years.

Stock market participants can use these results to quantify daily operational losses and further predict the probability of default in stock markets. It would be interesting to see what sort of results we get if more sophisticated machine learning methods are used to filter the series and quantify the possibility extreme losses and undertake a comparative analysis with the hybrid MS(k)-EGARCH(p,q)-GPD through the use of MCMC and bootstrapping of the credible confidence interval for returns losses. A probabilistic description and modelling of extreme peak loads using Poison point process is another area that requires future research. This approach helps in estimating the frequency of occurrence of peak losses. A sensitivity analysis with respect to daily losses performed and the development of a two-stage stochastic integer recourse models with the objective of optimising returns’ distribution is an interesting future research direction with stock market data. This will be studied elsewhere.

Author Contributions

All authors contributed equally to the conception and design of this empirical analysis. All authors have read and agreed to the published version of the manuscript.

Funding

This study received no specific financial support.

Acknowledgments

The authors are grateful to Johannesburg stock exchange (JSE) for providing them with the high frequency five-day data.

Conflicts of Interest

The authors declare that they have no competing interests.

Abbreviations

The following abbreviations are used in this manuscript:

ANSR	Adjusted Noise-to-Signal Ratio
AUC	Area Under Curve
CART	Classification and Regression Tree
EWS	Early Warning System
EVT	Extreme Value Theory
FTSE/JSE-ALSI	Financial Time Series exchange/Johannesburg Stock Exchange- All Share Index
GARCH	Generalised Autoregressive Conditional Heteroscedasticity
GEVD	Generalised Extreme Value Distribution
GPD	Generalised Pareto Distribution
i.i.d	Independent and identically distributed
JSE-ALSI	Johannesburg Stock Exchange-All share index
LMT	Logistic Model Tree
MC	Markov-chain
MCMC	Markov-chain-Monte-Carlo
MS-EGARCH-GEVD	Markov-Switching Exponential Generalised Autoregressive Conditional Heteroscedasticity-Generalised Extreme Value Distribution
MS-EGARCH-GEVD	Markov-Switching Exponential Generalised Autoregressive Conditional Heteroscedasticity-Generalised Pareto Distribution
MS-GARCH	Markov-Switching Generalised Autoregressive Conditional Heteroscedasticity
PD	Probability Density
PECC	Probability of Extremes Correctly Called
PFE	Probability of False Extremes to Total Extremes
PNECC	Probabilty of Non-Extremes Correctly Called
POCC	Probability of Observations Correctly Called
PRGS	Probability of an Extreme Event Given Signal
PRSNS	Probability of Event of High Extremes Given no Signal
SARIMA-MS-EGARCH-GEVD	Seasonal Autoregressive Integrated Moving Average-Markov-switching Exponential Generalised Autoregressive Conditional Heteroscedasticity-Generalised Extreme Value Distribution
Q-Q	Quantile-Quantile

References

Abiad, Abdul. 2003. Early Warning Systems: A Survey and a Regime-Switching Approach (EPub). Financial Markets, Institutions and Instruments 18: 89–137. [Google Scholar] [CrossRef]
Acharya, Viral, Thomas Philippon, Matthew Richardson, and Nouriel Roubini. 2009. The Financial Crisis of 2007–2009: Causes and Remedies. Financial Markets, Institutions and Instruments 18: 89–137. [Google Scholar] [CrossRef]
Agresti, Alan. 2018. An Introduction to Categorical Data Analysis. Hoboken: John Wiley and Sons. [Google Scholar]
Ardia, David, Keven Bluteau, Kris Boudt, and Leopoldo Catania. 2018. Forecasting Risk with Markov-Switching GARCH Models: A large-scale Performance Study. International Journal of Forecasting 34: 733–47. [Google Scholar] [CrossRef]
Ardia, David, Keven Bluteau, Kris Boudt, Leopoldo Catania, and Denis-Alexandre Trottier. 2016. Markov-Switching GARCH Models in R: The MSGARCH Package. Journal of Statistical Software. [Google Scholar] [CrossRef] [Green Version]
Arias, Guillaume, and Ulf Erlandsson. 2004. Regime Switching as an Alternative Early Warning System of Currency Crises—An application to South-East Asia. Working Papers, Department of Economics, School of Economics and Management, Lund University, Lund, Sweden. Available online: https://lup.lub.lu.se/record/1387367 (accessed on 8 March 2021).
Bauwens, Luc, Bruno De Backer, and Arnaud Dufays. 2014. Bayesian Method of Change Point Estimation with Recurrent Regimes: Application to GARCH Models. Journal of Empirical Finance 29: 207–29. [Google Scholar] [CrossRef]
Bee, Macro. 2012. Dynamic Value-at-Risk Models and the Peaks-Over-Threshold Method for Market Risk Measurement: An Empirical Investigation During a Financial Crisis. The Journal of Risk Model Validation 6: 2. [Google Scholar] [CrossRef]
Bee, Marco, and Luca Trapin. 2018. Estimating and Forecasting Conditional Risk Measures with Extreme Value Theory: A Review. Risks 6: 45. [Google Scholar] [CrossRef] [Green Version]
Bekaert, Geert, Michael Ehrmann, Marcel Fratzscher, and Arnaud Mehl. 2014. The Global Crises and Equity Market Contagion. The Journal of Finance 69: 2597–649. [Google Scholar] [CrossRef] [Green Version]
Billio, Monica, Roberto Casarin, and Anthony Osuntuyi. 2018. Markov-Switching GARCH Models for Bayesian Hedging on Energy Futures Markets. Energy Economics 70: 545–62. [Google Scholar] [CrossRef] [Green Version]
Borio, Claudio E. V., and Mathias Drehmann. 2009. Assessing the Risk of Banking Crises—Revisited. BIS Quarterly Review 29: 29–46. [Google Scholar]
Box, George E. P., Gwilym M. Jenkins, Gregory C. Reinsel, and Greta M. Ljung. 2015. Time Series Analysis: Forecasting and Control. Hoboken: John Wiley and Sons. [Google Scholar]
Bui, Dieu Tien, Tran Anh Tuan, Harald Klempe, Biswajeet Pradhan, and Inge Revhaug. 2016. Spatial Prediction Models for Shallow Landslide Hazards: A Comparative Assessment of the Efficacy of Support Vector Machines, Artificial Neural Networks, Kernel Logistic Regression, and Logistic Model Tree. Landslides 13: 361–78. [Google Scholar] [CrossRef]
Chan, K. S. 2016. Statistical Modelling of Extreme Values for Dependent Variables. Master’s thesis, South Korea University, Seoul, Korea. [Google Scholar]
Calabrese, Raffaella, and Paolo Giudici. 2015. Estimating Bank Default with Generalised Extreme Value Regression Model. Journal of the Operational Research Society 66: 1783–92. [Google Scholar] [CrossRef] [Green Version]
Carriero, Andrea, Todd E. Clark, and Massimiliano Marcellino. 2015. Bayesian VARS: Specification Choices and Forecast Accuracy. Journal of Applied Econometrics 30: 46–73. [Google Scholar] [CrossRef]
Chen, Wei, Xiaoshen Xie, Jiale Wang, Biswajeet Pradhan, Haoyuan Hong, Dieu Tien Bui, Zhao Duan, and Jianquan Ma. 2017. A Comparative Study of Logistic Model Tree, Random Forest, and Classification and Regression Tree Models for Spatial Prediction of Landslide Susceptibility. Catena 51: 147–60. [Google Scholar] [CrossRef] [Green Version]
Chinhamu, Knowledge, Chun-Kai Huang, Chun-Sung Huang, and Delson Chikobvu. 2015. Extreme Risk, Value-at-Risk and Expected Shortfall in the Gold Market. The International Business and Economics Research Journal (IBER) 14: 107–22. [Google Scholar] [CrossRef]
Cruz, Christopher John, and Dennis Mapa. 2013. An Early Warning System for Inflation in the Philippines Using Markov-Switching and Logistic Regression Models. Theoretical and Practical Research in Economic Fields (TPREF) 8: 137–52. [Google Scholar] [CrossRef]
Coles, Stuart. 2001. An Introduction to Statistical Modelling of Extreme Values. London: Springer. [Google Scholar] [CrossRef]
Davis, Philip, and Dilruba Karim. 2008. Comparing Early Warning Systems for Banking Crises. Journal of Financial Stability 4: 89–120. [Google Scholar] [CrossRef] [Green Version]
Diebold, Francis X. 2015. Perspective on the Use and Abuse of Diebold Mariano Tests. Journal of Business and Economic Statistics 33: 1–18. [Google Scholar] [CrossRef] [Green Version]
Doetsch, Patrick, Christian Buck, Pavlo Golik, Niklas Hoppe, Michael Kramp, Johannes Laudenberg, Christian Oberdörfer, Pascal Steingrube, Jens Forster, and Arne Mauser. 2009. Logistic Model Trees with AUC Split Criterion for the KDD Cup 2009 Small Challenge. In KDD-Cup 2009 Competition. Aachen: Human Language Technology and Pattern Recognition, vol. 7, pp. 77–88. [Google Scholar]
Droumaguet, Matthieu. 2012. Markov-Switching Vector Autoregressive Models: Monte Carlo Experiment, Impulse Response Analysis, and Granger-Causal. Ph.D. thesis, Department of Economics, European University Institute, San Domenico di Fiesole, Italy. [Google Scholar]
Edison, Hali. 2002. Do Indicators of Financial Crises Work? An Evaluation of an Early Warning System. An Evaluation of an Early Warning System. International Journal of Finance and Economics 8: 11–53. [Google Scholar] [CrossRef] [Green Version]
Forbes, Kristin J., and Roberto Rigobon. 2002. No contagion, only interdependence: Measuring stock market co-movements. The Journal of Finance 57: 2223–61. [Google Scholar] [CrossRef]
Fuertes, Ana-Maria, and Elena Kalotychou. 2007. Optimal Design of Early Warning Systems for Sovereign Debt Crises. International Journal of Forecasting 23: 85–100. [Google Scholar] [CrossRef]
Gagaza, Nceba, Murendeni Maurel Nemukula, Retius Chifurira, and Danielle Jade Roberts. 2019. Modelling Non-stationary Temperature Extremes in KwaZulu-Natal using the Generalised Extreme Value Distribution. In Annual Proceedings of the South African Statistical Association Conference. Port Elizabeth: Nelson Mandela Metropolitan University, pp. 1–8. [Google Scholar]
Ghatasheh, Nzaeeh. 2014. Business Analytics using Random Forest Trees for Credit Risk Prediction: A Comparison Study. International Journal of Advanced Science and Technology 72: 19–30. [Google Scholar] [CrossRef]
Heffernan, Janet E., Alec G. Stephenson, and Eric Gilleland. 2018. Ismev: An Introduction to Statistical Modelling of Extreme Values, R Package Version 1. Journal of Statistical Software. Available online: http://cran.rediris.es/web/packages/ismev/ismev.pdf (accessed on 8 March 2021).
Jeanne, Olivier, and Paul Masson. 2000. Currency Crises, Sunspots and Markov-Switching Regimes. Journal of International Economics 50: 327–50. [Google Scholar] [CrossRef] [Green Version]
Kamarudin, Muhammad Hilmi, Carsten Maple, Tim Watson, and Nader Sohrabi Safa. 2017. A Logitboost-Based Algorithm for Detecting Known and Unknown Web Attacks. IEEE 5: 26190–200. [Google Scholar] [CrossRef]
Kaminsky, Graciela, Saul Lizondo, and Carmen M. Reinhart. 1998. Leading Indicators of Currency Crises. Staff Papers 45: 1–48. [Google Scholar] [CrossRef] [Green Version]
Kolari, James, Dennis Glennon, Hwan Shin, and Michele Caputo. 2002. Predicting Large US Commercial Bank Failures. Journal of Economics and Business 54: 361–87. [Google Scholar] [CrossRef]
Makatjane, Katleho Daniel, Edward Kagiso Molefe, and Roscoe Bertrum Van Wyk. 2018a. The Analysis of the 2008 US Financial Crisis: An Intervention Approach. Journal of Economics and Behavioural Studies 10: 59–68. [Google Scholar] [CrossRef]
Makatjane, Katleho, Ntebgoang Maroke, and Diteboho Xaba. 2018b. On the Prediction of Inflation Crises of South Africa using Markov-Switching Bayesian Vector Autoregressive and Logistic Regression Models. Journal of Social Economics Research 5: 10–28. [Google Scholar] [CrossRef] [Green Version]
Maposa, Daniel. 2016. Statistics of Extremes with Applications to Extreme Flood Heights in the Lower Limpopo River Basin of Mozambique. Ph.D. thesis, Department of Mathematics and Statistical Science, School of Mathematical Sciences, University of Limpopo, Turfloop, South Africa. [Google Scholar]
Maposa, Daniel, Maseka Lesaoana, and James J. Cochran. 2016. Modelling Non-stationary Annual Maximum Flood Heights in the Lower Limpopo River Basin of Mozambique. Jàmbá: Journal of Disaster Risk Studies 8: 1–9. [Google Scholar] [CrossRef]
Maposa, Daniel, James Cochran, Maseka Lesaoana, and Caston Sigauke. 2014. Estimating High Quantiles of Extreme Flood Heights in the Lower Limpopo River Basin of Mozambique using Model based Bayesian Approach. Natural Hazards and Earth System Sciences Discussions 2: 5401–25. [Google Scholar] [CrossRef] [Green Version]
Moroke, Ntebogang Dinah. 2014. The Robustness and Accuracy of Box-Jenkins ARIMA in Modelling and Forecasting Household Debt in South Africa. Journal of Economics and Behavioural Studies 6: 748–59. [Google Scholar] [CrossRef] [Green Version]
Moysiadis, Theodoros, and Konstantnos Fokianos. 2014. On binary and Categorical Time Series Models with Feedback. Journal of Multivariate Analysis 131: 209–28. [Google Scholar] [CrossRef]
Nemukula, Maurel Murendeni. 2018. Modelling Temperature in South Africa Using Extreme Value Theory. Ph.D. thesis, School of Statistics and Actuarial Science, University of the Witwatersrand, Johannesburg, South Africa. [Google Scholar]
Pham, Binh Thai, and Indra Prakash. 2019. Evaluation and Comparison of Logitboost Ensemble, Fisher’s Linear Discriminant Analysis, Logistic Regression and Support Vector Machines Methods for Landslide Susceptibility Mapping. Geocarto International 34: 316–33. [Google Scholar] [CrossRef]
Pourghasemi, Hamid Reza, Amiya Gayen, Sungjae Park, Chang-Wook Lee, and Saro Lee. 2018. Assessment of Landslide-prone Areas and their Zonation using Logistic Regression, Logitboost, and Naïve Bayes Machine-Learning Algorithms. Sustainability 10: 3697. [Google Scholar] [CrossRef] [Green Version]
Raihan, Tasneem. 2017. Perormance of Markov-Switching GARCH Model Forecasting Inflation Uncertainty. In Res-Report. Munich: University Library of Munich. [Google Scholar]
Schimmelpfennig, Axel, Nouriel Roubini, and Paolo Manasse. 2003. Predicting Sovereign Debt Crises. Washington, DC: International Monetary Fund. [Google Scholar]
Sigaukea, Caston, Rhoda M. Makhwiting, and Maseka Lesaoana. 2014. Modelling Conditional Heteroscedasticity in JSE Stock Returns using the Generalised Pareto Distribution. African Review of Economics and Finance 6: 41–55. [Google Scholar]
Sigauke, Caston, André Verster, and Delson Chikobvu. 2012. Tail Quantile Estimation of Heteroscedastic Intraday Increases in Peak Electricity Demand. Open Journal of Statistics. 2: 435–42. [Google Scholar] [CrossRef] [Green Version]
Spyros, Makridakis. 1993. Accuracy Measures: Theoretical and Practical Concerns. International Journal of Forecasting 9: 527–29. [Google Scholar] [CrossRef]
Stephenson, Alec, and Mathieu Ribatet. 2015. evdbayes: Bayesian Analysis in Extreme Value Theory. Available online: https://cran.r-project.org/web/packages/evdbayes/evdbayes.pdf (accessed on 8 March 2021).
Stokes, Maura E., Charles S. Davis, and Gary G. Koch. 2012. Categorical Data Analysis Using SAS. Cary: SAS Institute. [Google Scholar]
Theil, Henri. 1971. An Economic Theory of the Second Moments of Disturbance of Behavioural Equations. The American Economic Review 61: 190–94. [Google Scholar]
Troug, Hayatem, and Matt Murray. 2020. Crisis Determination and Financial Contagion: An Analysis of the Hong Kong and Tokyo Stock Markets using an MSBVAR Approach. Journal of Economic Studies. [Google Scholar] [CrossRef]
Tsay, Ruey. 2014. An Introduction to the Analysis of Financial Data with R. Hoboken: John Wiley and Sons. [Google Scholar]
Wallison, Peter. 2011. Financial Market Regulation. In Did the Repeal of Glass-Steagall Have any Role in the Financial Crisis? Not Guilty. Not Even Close. New York: Springer, pp. 19–29. [Google Scholar] [CrossRef] [Green Version]
Wang, Lu, Feng Ma, Tianjiao Niu, and Chengting He. 2020. Crude Oil and BRICS Stock Markets under Extreme Shocks: New Evidence. Economic Modelling 86: 54–68. [Google Scholar] [CrossRef]
Zhang, Peter. 2010. Time Series Forecasting Using a Hybrid ARIMA and Neural Network Model. Neurocomputing 50: 159–75. [Google Scholar] [CrossRef]

1	Note that “k” is the number of regime switch, “p” is the order of the GARCH process and “q” is the order of ARCH process.
2	The index used has been kept in its original currency to avoid exchange rates fluctuations. The data has been collected from South African Stock Exchange and the index used is FTSE/JSE-ALSI.

Figure 1. Intraday Loses on JSE-ALSI.

Figure 2. Residuals; probability density and the Q-Q plot.

Figure 3. Smoothed, Filtered and Predicted Probabilities.

Table 1. Descriptive Statistics for intra-day loses.

	Skewness	Kurtosis	Frosini	Hegazy1	Anderson-Darling
JSE-ALSI	−4.072	5.930	1.357 (0.0002)	0.104 (0.0002)	28.066 (0.0002)

Key: values in () are probability values of Frosini, hegazy1 and Anderson-Darling.

Table 2. Naive Bayesian Estimates of MS(2)-EGARCH(1,1).

		Regime 1
Coefficient	Estimate	Std. Error	t-value	p-value
$\hat{γ}$	−0.675	0.003	61.215	0.000
$\hat{ξ}$	0.039	0.062	91.515	0.000
$\hat{α}$	−0.148	0.088	147.00	0.000
$\hat{β}$	0.873	0.119	107.29	0.000
$\hat{δ}$	1.042	0.198	747.29	0.000
$\hat{μ}$	56.10	0.004	37.811	0.000
$\hat{ν}$	0.878	0.045	13.025	0.000
		Regime 2
$\hat{γ}$	−0.415	0.004	4.506	0.000
$\hat{ξ}$	0.132	0.006	6.170	0.000
$\hat{α}$	−0.165	0.005	30.034	0.000
$\hat{β}$	0.905	0.001	67.067	0.000
$\hat{δ}$	6.234	0.002	87.985	0.000
$\hat{μ}$	86.24	0.065	60.098	0.000
$\hat{ν}$	0.882	0.006	14.056	0.000
$\hat{ν}$	0.882	0.006	14.056	0.000
		Transition Probabilities
	Regime 1		Regime 2
	0.9836		0.0164
	0.0306		0.9694

Table 3. Naive Bayesian Estimates for Generalised Extreme Value Distribution (GEVD).

	Block	Minima	$\hat{ξ}$	$se (\hat{ξ})$	$\hat{σ}$	$se (\hat{σ})$	${\hat{μ}}_{0}$	$se ({\hat{μ}}_{0})$	95% CI for $\hat{ξ}$
JSE-ALSI	2400	482	−0.411	0.024	0.053	0.018	0.071	0.026	( −0.459, −0.363 )

Table 4. Comparative Analysis.

Test	SARIMA	MS-EGARCH	GEVD	SARIMA-MS-EGARCH-GEVD
LL	708	742	781	622
Rank	2	3	4	1
AIC	−1312	−1455	−1535	−1571
Rank	4	3	2	1
BIC	−1794	−1573	−1653	−1889
Rank	2	4	3	1
MAE	0.9023	0.7988	0.7777	0.7930
Rank	4	3	1	2
MAPE	1.247	1.004	1.002	0.879
Rank	4	3	2	1
RMSE	1.377	1.0016	0.9908	0.9947
Rank	4	3	1	2
TIC	0.6755	0.4155	0.4363	0.4145
Rank	4	2	3	1

Table 5. Probabilities of correct and incorrect predicted Extreme daily.

	High Extreme Losses	Low Extreme Losses
Signal Issued	$P (1, 1)$ Correct call of Losses [A]	$P (1, 2)$ Type II Error or Wrong Signal [B]
No Signal Issued	$P (2, 1)$ Type I error or Missing Signal [C]	$P (2, 2)$ Correct call for non-event [D]

Table 6. Probabilities of correct and incorrect predicted Extreme daily.

Training Data	High Extreme Losses			Low Extreme Losses		Total
Predicted	High Extremes	1200	(63% )	398		1598
	Low Extremes	703		106	(21%)	809
Total		1903		504		2407
Validation data	High Extreme Losses			Low Extreme Losses		Total
Predicted	High Extreme Losses	210	(51%)	99		309
	Low extreme Losses	204		16	(14%)	220
Total		414		115		529

The first set of numbers represents counts while figures in parentheses represent percentages of correctly predicted extreme losses.

Table 7. Performance of the EWS.

	Training Data	Validation Data
PECC	63%	51%
PNECC	85%	98%
POCC	67%	86%
ANSR	78%	99%
PRGS	43 %	57%
PRGNS	29%	48%
PFE	21%	14%

Table 8. Sensitivity Performance of EWS.

	Validation Data
Concordance	89.15%
AUC	88.78%
Sensitivity	79.89%
Specificity	98.40%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Makatjane, K.; Moroke, N. Predicting Extreme Daily Regime Shifts in Financial Time Series Exchange/Johannesburg Stock Exchange—All Share Index. Int. J. Financial Stud. 2021, 9, 18. https://0-doi-org.brum.beds.ac.uk/10.3390/ijfs9020018

AMA Style

Makatjane K, Moroke N. Predicting Extreme Daily Regime Shifts in Financial Time Series Exchange/Johannesburg Stock Exchange—All Share Index. International Journal of Financial Studies. 2021; 9(2):18. https://0-doi-org.brum.beds.ac.uk/10.3390/ijfs9020018

Chicago/Turabian Style

Makatjane, Katleho, and Ntebogang Moroke. 2021. "Predicting Extreme Daily Regime Shifts in Financial Time Series Exchange/Johannesburg Stock Exchange—All Share Index" International Journal of Financial Studies 9, no. 2: 18. https://0-doi-org.brum.beds.ac.uk/10.3390/ijfs9020018

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Predicting Extreme Daily Regime Shifts in Financial Time Series Exchange/Johannesburg Stock Exchange—All Share Index

Abstract

1. Introduction

Literature Review

2. Materials and Methods

2.1. SARIMA-(p,d,q)(P,D,Q)-MS(K)-EGARCH(p,q)-GEVD

2.2. Logistic Model Tree

2.3. Bayesian Markov-chain-Monte-Carlo Framework

2.4. Forecasting Performance and Predictive Accuracy

3. Results

3.1. SARIMA-MS-EGARCH-GEVD Model

3.2. Generalised Extreme Value Distribution Framework

3.3. Comparative Analysis

3.4. Evaluation of the Performance of the Logit-Based EWS Model

3.5. Model Performance Evaluation

4. Conclusions and Recommendations

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI