Next Article in Journal
Acknowledgement to Reviewers of Risks in 2017
Previous Article in Journal
Company Value with Ruin Constraint in a Discrete Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Simple Traffic Light Approach to Backtesting Expected Shortfall

by
Nick Costanzino
1,2,* and
Michael Curran
3,*
1
Barclays Capital, 745 7th Ave, New York, NY 10019, USA
2
Department of Finance and Risk Engineering, NYU Tandon School of Engineering, New York, NY 11201, USA
3
Riskcare, New York, NY 10018, USA
*
Authors to whom correspondence should be addressed.
Submission received: 11 September 2017 / Revised: 3 January 2018 / Accepted: 4 January 2018 / Published: 9 January 2018

Abstract

:
We propose a Traffic Light approach to backtesting Expected Shortfall which is completely consistent with, and analogous to, the Traffic Light approach to backtesting VaR (Value at Risk) initially proposed by the Basel Committee on Banking Supervision in their 1996 consultative document Basle Committee on Banking Supervision (1996). The approach relies on the generalized coverage test for Expected Shortfall developed in Costanzino and Curran (2015).

1. Introduction

Even before the initial Basel Committee consultative document (Bank for International Settlements 2012) there had been a push by both risk managers and academics to replace VaR (Value at Risk) with another risk measure that addresses VaR’s deficiencies. In particular, coherent risk measures (Acerbi and Taschea 2002a, 2002b; Hall 2007) satisfy the basic desired properties required by a risk measure as outlined in Artzner et al. (1999). Expected Shortfall (ES) is the natural choice among all coherent risk measures, and therefore there is no surprise that it has been chosen by the Basel Committee as the risk measure to replace VaR. However, unlike the case of VaR, there is no well-established backtesting framework for Expected Shortfall. Indeed, the current Basel proposal to backtest ES at the 97.5 quantile is to backtest the related VaR estimate at the 97.5 and 99 quantiles, which is a grossly insufficient test. Nevertheless, some recent backtesting methods have been proposed including, but not limited to, Acerbi and Szekeley (2014); Costanzino and Curran (2015); Du and Escanciano (2017); Fissler et al. (2015); Gordy et al. (2017); Kratz et al. (2016).
The main result of this note is the development of a Traffic Light backtest for Expected Shortfall which extends the Traffic Light backtest for VaR. The test relies on the computation of critical values derived from the finite-sample distribution of the ES test statistic (9) first introduced in Costanzino and Curran (2015).
The note is organized as follows. In Section 2 we briefly review the VaR Traffic Light to provide context for our corresponding test for ES. In Section 3 we define the Traffic Light test for ES and compute the distribution of the finite-sample statistic from which we calculate the critical values using a numerical root-finding algorithm. Finally, in Section 4 we discuss the test and some implications.

2. Review of the VaR Traffic Light Test

Let { t i } i = 0 N be a sequence of historical trading days and { L i } i = 1 N the corresponding realized trading losses. The most basic approach to assessing the accuracy of a VaR forecast calculation for those trading days is to backtest using the VaR Coverage Test which essentially counts the number of VaR breaches. This leads to the Traffic Light approach to backtesting VaR originally proposed by the Basel Committee for Banking Supervision in Basle Committee on Banking Supervision (1996), which we describe below.
For each i = 1 , , N , let VaR i ( α ) denote the forecast VaR at level α defined by
VaR ( α ) : = inf { z R : F L ( z ) α }
where F L is the cumulative distribution of the random loss variable L. For each trading day i we define the VaR breach indicator X VaR ( i ) : [ 0 , 1 ] { 0 , 1 } as
X VaR ( i ) ( α ) : = 1 { L i V a R i ( α ) } = 0 if L i > V a R i ( α ) 1 if L i V a R i ( α ) .
That is, X VaR ( i ) keeps track of whether a breach occurred for trading day i. Then, the total number of breaches over all N trading days, denoted by X VaR N : [ 0 , 1 ] { 0 , 1 , 2 , , N } , is
X VaR N ( α ) : = i = 1 N 1 { L i VaR i ( α ) }
Under the null hypothesis that the VaR model is correct, E [ X VaR N ( α ) ] = N α . Thus, for the Basel parameters α = 1 % and N = 250 , we expect 2.5 breaches. Of course, in any backtest it is very rare that one observes exactly 2.5 breaches (in fact impossible since X VaR N must be an integer), and thus we appeal to statistical analysis to understand the probability of obtaining significantly fewer or more breaches than would be expected if we had a correct model. For fixed N and level α we define the cumulative probability Ψ VaR α , N of obtaining x or fewer breaches as
Ψ VaR α , N ( x ) : = P [ X VaR N ( α ) x ]
The Basel Committee on Banking Supervision proposed a Traffic Light approach to statistical significance of VaR breaches in their 1996 document Basle Committee on Banking Supervision (1996). Therein the Basel Committee defines three color zones through cumulative probabilities of the number of realized VaR breaches. The Green Zone is defined as the number of breaches under the null hypothesis whereby the cumulative probability of obtaining that many breaches or fewer is less than 95 % . The Yellow Zone is defined as the number of breaches whereby the cumulative probability of obtaining that many breaches or fewer is greater than 95 % but less than 99 % . Finally, the Red Zone is defined by a cumulative probability of 99.99 % or more. Thus, the boundary between the Green and Yellow zones is defined as the largest integer x such that Ψ VaR α , N ( x ) < 0.95 and the boundary between the Yellow and Red zones is similarly defined as the largest integer x such that Ψ VaR α , N ( x ) < 0.9999 .
Table 1 gives the resulting color zones for different breaches values under the VaR Basel parameters α = 1 % and N = 250 observations. The true Binomial Null Distribution is used to compute the Cumulative Probabilities rather than the asymptotic Normal distribution.

3. Derivation of the Expected Shortfall Traffic Light Test

We now define a Traffic Light approach to backtesting Expected Shortfall based on the Coverage Test in Costanzino and Curran (2015). The test relies on an appropriate extension of the VaR breach indicator (2) to the case of ES. The resulting new breach indicator (6) takes into account the severity of the breach (i.e., losses beyond the VaR level) and is a continuous variable rather than discrete.
We begin the derivation by defining Expected Shortfall as
ES ( α ) : = 1 α 0 α VaR ( p ) d p
In analogy to X VaR ( i ) (2), we define the ES generalized breach indicator X ES ( i ) : [ 0 , 1 ] [ 0 , 1 ] by
X ES ( i ) ( α ) : = 1 α 0 α 1 { L i VaR i ( p } d p
= 1 F L ( L i ) α severity of the breach 1 { L i VaR i ( α ) }
= θ ( i ) ( α ) · X VaR ( i )
where have used (2) and have set θ ( i ) ( α ) = 1 F L ( L i ) / α , where F L is the cumulative distribution implicitly defined in (1). We note that compared to X VaR ( i ) (2), X ES ( i ) (6) has an extra term F ( L i ) / α which determines the severity of the breach. That is, suppose L i = VaR i . Then F ( L i ) / α = 1 so X ES ( i ) = 0 whereas X VaR ( i ) = 1 . On the other hand suppose L i is very negative. Then, F ( L i ) = 0 so that X ES ( i ) = 1 and similarly X VaR ( i ) = 1 . Thus, X ES ( i ) keeps track of whether a breach happened on trading day i as well as the severity. Then, the total severity of breaches over all N trading days, denoted by X ES N : [ 0 , 1 ] [ 0 , N ] , is
X ES N ( α ) : = i = 1 N 1 α 0 α 1 { L i VaR i ( p ) } d p = i = 1 N 1 F ( L i ) α 1 { L i VaR i ( α ) } = i = 1 N θ ( i ) ( α ) X VaR ( i ) ( α )
For fixed N and level α we define the cumulative probability Ψ ES α , N of obtaining x or fewer breaches as
Ψ ES α , N ( x ) : = P [ X ES N ( α ) x ]
Therefore, for any quantile q, we can compute the corresponding Generalized Breach Value x by inverting the equation
sup x [ 0 , ) Ψ ES α , N ( x ) < q
Note that in the case of the VaR Traffic Light Test (see Table 1), it makes sense to compute the quantiles for different breach values (i.e., 1 , 2 , , 10 ). For Expected Shortfall, the breach indicator is a continuous variable and it no longer makes sense to choose the breach value and compute an associated quantile. Rather, we choose the quantile and then invert to obtain the corresponding breach value. In particular, we borrow the color zone boundaries from the VaR Traffic Light Test, which yield a Green Zone if q < 0.95 , Yellow Zone if 0.95 q < 0.9999 , and Red Zone if 0.9999 q ; i.e.,
Boundary G Y : = sup x [ 0 , ) Ψ ES α , N ( x ) < 0.95
and the boundary between the Yellow and Red zones is given by
Boundary Y R : = sup x [ 0 , ) Ψ ES α , N ( x ) < 0.9999
To compute these boundaries, and other values of x one needs to compute the distribution of the test statistic X ES N ( α ) under the null-hypothesis H 0 given by
H 0 : { X ES ( i ) } i = 1 N i . i . d . i j , and P [ L i VaR i ( p ) ] = p p [ 0 , α ] .
A similar argument as in Costanzino and Curran (2015) shows that for any α ( 0 , 1 ) ,
N X ES N ( α ) μ ES N [ ] D N ( 0 , σ ES 2 )
where
μ ES = 1 2 α N
σ ES 2 = α 4 3 α 12
and hence
lim N X ES N ( α ) N ( μ ES , σ ES 2 )
Hence, as a crude approximation, we can compute (12) and (13) using the asymptotic test distribution (18) to obtain Boundary G Y approx = 5.4768 and Boundary Y R approx = 9.2229 . These values are approximate since they use the asymptotic distribution Ψ ES α , ( x ) of the test statistic rather than the finite-sample one Ψ ES α , N ( x ) . We now derive the finite-sample distribution Ψ ES α , N ( x ) and use a numerical root-finding procedure to accurately estimate the critical values.
The derivation of the ES Traffic Light test relies on the computation of the finite-sample cumulative distribution of the test statistic X E S N ( α ) (9). A key observation in the derivation is that under the null-hypothesis, the distribution of X ES ( i ) ( α ) conditional on a breach having occurred is uniform in the α -tail, and thus using the law of total probability we have
P [ X E S N ( α ) x ] = n = 1 P X ES N ( α ) x X VaR N ( α ) = n · P [ X VaR N ( α ) = n ] = n = 1 I n ( x ) · B ( n , N , α )
where I n ( x ) is the Irwin–Hall distribution (c.f. Hall 1927; Irwin 1927; Marengo et al. 2017) defined by
I n ( x ) = 1 2 ( n 1 ) ! k = 0 n ( 1 ) k n k ( x k ) n 1 sgn ( x k )
and B ( n , N , α ) binomial probability mass function
B ( n , N , α ) = n N α n ( 1 α ) N n
We then use this probability calculation (19) and a root-finding algorithm to solve the equation
P [ X ES N ( α ) Boundary ] = q
for Boundary where q is the appropriate quantile level. In particular assuming the Basel parameters α = 2.5 % and N = 250 , then for q = q GY = 0.05 and q = q YR = 0.0001 we obtain
Boundary G Y = 5.7049
Boundary Y R = 9.8833
Table 2 gives the resulting quantiles and color zones for different breach values under the ES Basel parameters α = 2.5 % and N = 250 observations where the cumulative probabilities were computed using (19). Of particular note is the breach values and cumulative probabilities for Expected Shortfall at the 97.5 quantile (i.e., α = 2.5 % ) are very similar to the VaR values at the 99 quantile (i.e., α = 1 % ). In addition, the finite-sample Breach Value at the 50th quantile (3.0276) is very similar to the asymptotic Breach Value at the 50th quantile ( α 2 N = 250 × 0.025 / 2 = 3.125 ) . Furthermore, note that in the case of Expected Shortfall, the breach values are continuous, and therefore infinitesimally small changes in breach value may result in a change in the color zone.

4. Discussion

First, the values and quantiles for VaR at α = 1 % are similar to the values and quantiles for ES at α = 2.5 % . This happens because there are more VaR breaches at α = 2.5 % than at α = 1 % , but the severity of the breach in ES is smaller than unity so these two mechanisms average each other out.
We also note that along with color zones, the Basel document (Basle Committee on Banking Supervision 1996) defines market risk capital multipliers based on the cumulative probability C VaR of the number of realized exceptions, X VaR . In particular, a multiplier k VaR ranging from 0 to 1 is given depending on the number of breaches; i.e., k VaR = f VaR ( C VaR ) for some function f VaR . The same can obviously be done for Expected Shortfall; i.e., k ES = f ES ( C ES ) for some function f ES . However, the continuous nature of the breach values from (9) leads to the need for k ES to be a continuous function so as to avoid the case where small changes in breach value give rise to large changes in multiplier.

5. Conclusions

By defining an appropriate breach statistic (6) that measures the severity of each breach and using the results in (Costanzino and Curran 2015; Clift et al. 2015), we propose a Traffic Light test for Expected Shortfall using the finite-sample distribution of the test statistic under the null hypothesis. The test itself as well as the actual values of the zone boundaries are analogous to the Basel Traffic Light test for VaR.

Acknowledgments

The authors would like to thank the three anonymous referees for helpful suggestions, as well as Marco Naldi (Barclays) for a careful reading of the manuscript.

Author Contributions

The two authors contribute equally to this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Acerbi, Carlo, and Balazs Szekely. 2014. Back-Testing Expected Shortfall. Risk Magazine, November. [Google Scholar]
  2. Acerbi, Carlo, and Dirk Tasche. 2002a. Expected Shortfall: A natural coherent alternative to Value at Risk. Economic Notes 31: 379–88. [Google Scholar] [CrossRef]
  3. Acerbi, Carlo, and Dirk Tasche. 2002b. On the Coherence of Expected Shortfall. Journal of Banking & Finance 26: 1487–503. [Google Scholar]
  4. Artzner, Philippe, Freddy Delbaen, Jean-Marc Eber, and David Heath. 1999. Coherent Measures of Risk. Mathematical Finance 9: 203–28. [Google Scholar] [CrossRef]
  5. Basle Committee on Banking Supervision. 1996. Supervisory Framework For The Use of Back-Testing in Conjunction With The Internal Models Approach to Market Risk Capital Requirements. Available online: www.bis.org/publ/bcbs22.htm (accessed on 7 January 2018).
  6. Costanzino, Nick, and Mike Curran. 2015. Backtesting General Spectral Risk Measures with Application to Expected Shortfall. Available online: https://ssrn.com/abstract=2514403 (accessed on 7 January 2018).
  7. Clift, Simon, Nick Costanzino, and Mike Curran. 2015. Comparison of Backtesting Methods for Expected Shortfall. Preprint. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2618345 (accessed on 7 January 2018).
  8. Du, Zaichao, and Juan Carlos Escanciano. 2017. Backtesting Expected Shortfall: Accounting for Tail Risk. Management Science 63: 940–58. [Google Scholar] [CrossRef]
  9. Fissler, Tobias, Johanna F. Ziegel, and Tilmann Gneiting. 2015. Expected Shortfall is Jointly Elicitable with Value at Risk—Implications for Backtesting. Risk Magazine, December 21. [Google Scholar]
  10. Bank for International Settlements. 2012. Fundamental Review of the Trading Book: A Revised Market Risk Framework. Consultative Document. Basel: Basel Committee on Banking Supervision. [Google Scholar]
  11. Gordy, Michael B., Hsiao Yen Lok, and Alexander J. McNeil. 2017. Spectral backtests of forecast distributions with application to risk management. arXiv Preprint, arXiv:1708.01489. [Google Scholar]
  12. Hall, Philip. 1927. The distribution of means for samples of size N drawn from a population in which the variate takes values between 0 and 1, all such values being equally probable. Biometrika 19: 240–45. [Google Scholar] [CrossRef]
  13. Hull, John. 2007. VaR versus Expected Shortfall. Risk Magazine, March 1. [Google Scholar]
  14. Irwin, Joseph Oscar. 1927. On the frequency distribution of the means of samples from a population having any law of frequency with finite moments, with special reference to Pearson’s Type II. Biometrika 19: 225–39. [Google Scholar] [CrossRef]
  15. Kratz, Marie, Yen H. Lok, and Alexander J. McNeil. 2016. Multinomial VaR Backtests: A simple implicit approach to backtesting expected shortfall. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2898688 (accessed on 7 January 2018).
  16. Marengo, James E., David L. Farnsworth, and Lucas Stefanic. 2017. A Geometric Derivation of the Irwin-Hall Distribution. International Journal of Mathematics and Mathematical Sciences 2017: 3571419. [Google Scholar] [CrossRef]
Table 1. Traffic Light zone boundaries are computed assuming α = 1 % and N = 250 observations.
Table 1. Traffic Light zone boundaries are computed assuming α = 1 % and N = 250 observations.
Basel Traffic Light Approach to VaR
ZoneBreach ValueCumulative Probability
Green08.11%
128.58%
254.32%
375.81%
489.22%
Yellow595.88%
698.63%
799.60%
899.89%
999.97%
Redmore than 1099.99%
Table 2. Expected Shortfall Traffic Light zone boundaries are computed assuming α = 2.5 % and N = 250 observations.
Table 2. Expected Shortfall Traffic Light zone boundaries are computed assuming α = 2.5 % and N = 250 observations.
Traffic Light Approach to Expected Shortfall
ZoneGeneralized Breach ValueCumulative Probability
Green00.18%
1.392910%
2.113125%
3.027650%
4.052075%
5.062290%
5.704995%
Yellow5.704995%
6.984499%
8.528599.9%
9.883399.99%
Redmore than 9.883399.99%

Share and Cite

MDPI and ACS Style

Costanzino, N.; Curran, M. A Simple Traffic Light Approach to Backtesting Expected Shortfall. Risks 2018, 6, 2. https://0-doi-org.brum.beds.ac.uk/10.3390/risks6010002

AMA Style

Costanzino N, Curran M. A Simple Traffic Light Approach to Backtesting Expected Shortfall. Risks. 2018; 6(1):2. https://0-doi-org.brum.beds.ac.uk/10.3390/risks6010002

Chicago/Turabian Style

Costanzino, Nick, and Michael Curran. 2018. "A Simple Traffic Light Approach to Backtesting Expected Shortfall" Risks 6, no. 1: 2. https://0-doi-org.brum.beds.ac.uk/10.3390/risks6010002

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop