Next Article in Journal
Information-Theoretic Measures and Modeling Stock Market Volatility: A Comparative Approach
Previous Article in Journal
Risk Management and Financial Stability in the Polish Public Hospitals: The Moderating Effect of the Stakeholders’ Engagement in the Decision-Making
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Empirical Evidences on the Interconnectedness between Sampling and Asset Returns’ Distributions

1
Department of Economics and Finance, Università degli Studi di Bari Aldo Moro, Via C. Rosalba 53, 70124 Bari, Italy
2
Department of Methods and Models for Economics, Università degli Studi di Roma “La Sapienza”, Territory and Finance, Via del Castro Laurenziano 9, 00185 Roma, Italy
*
Author to whom correspondence should be addressed.
Submission received: 9 March 2021 / Revised: 20 April 2021 / Accepted: 23 April 2021 / Published: 8 May 2021

Abstract

:
The aim of this work was to test how returns are distributed across multiple asset classes, markets and sampling frequency. We examine returns of swaps, equity and bond indices as well as the rescaling by their volatilities over different horizons (since inception to Q2-2020). Contrarily to some literature, we find that the realized distributions of logarithmic returns, scaled or not by the standard deviations, are skewed and that they may be better fitted by t-skew distributions. Our finding holds true across asset classes, maturity and developed and developing markets. This may explain why models based on dynamic conditional score (DCS) have superior performance when the underlying distribution belongs to the t-skew family. Finally, we show how sampling and distribution of returns are strictly connected. This is of great importance as, for example, extrapolating yearly scenarios from daily performances may prove not to be correct.
JEL Classification:
G10; C10; C20; C16

1. Introduction

The aim of this article was to investigate the interconnectedness between sampling and asset returns’ distributions. To this end, we empirically perform a number of analyses across asset classes, markets and for several sampling frequencies. The topic is quite relevant as both vendors and financial institutions may rely on scenarios generated under the assumption that financial series returns are normally distributed. There are some works which claim that standardized daily returns “are approximately unconditionally normally distributed” Andersen et al. (2001) or that “are IID Gaussian, with variance equal to 1” Rogers (2018).
A more realistic work hypothesis is that time series follow a t-skew distribution. The t-skew distribution can be seen as a mixture of skew-normal distributions Kim (2001) which generalize the normal distribution thanks to an extra parameter regulating the skewness. By construction, then, they can model heavy tails and skews that are common in financial markets. Thus, their adoption in finance is gaining momentum for modeling distributions Harvey (2013) and risk Gao and Zhou (2016). Further, t-skew has the power to link-up with observation-driven models such as the dynamic conditional score (DCS) Creal et al. (2013) or based on data partitioning Orlando et al. (2019, 2020). This paper tries to help in gaining insights on returns’ distributions and on the most suitable way of fitting them.
In particular, according to the tests carried out on our dataset, the distributions of log-returns do not seem to be normally distributed. The same applies on the returns standardized by the standard deviation. In a different context, Tiwari and Gupta (2019) found that the Jarque–Bera test strongly rejects the hypothesis of Gaussian distribution for all considered time series concerning G7 stock markets. Therefore, through the paper, we report a number of tests to decide the better distribution between Gaussian, t-skew, generalized hyperbolic, generalized Pareto and exponential Pareto. In terms of applications, being able to correctly identify the distribution is important for risk management as the tail conditional expectation provides information about the mean of the tail of the loss distribution, “while the tail variance (TV) measures the deviation of the loss from this mean along the tail of the distribution” Eini and Khaloozadeh (2020). Another application is in option pricing. For instance, Yeap et al. (2018) propose a t-skew model with “a fat-tailed, skewed distribution and infinite-activity (pure jump) stock dynamics, which is achieved through modeling the length of time intervals as stochastic”.
Having described the framework of our investigation, now we are in position to perform some tests on swaps, equities (for both developed and emerging markets) and corporate bonds (for both developed and emerging markets) sampled on weekly, monthly and yearly basis. Section 2 contains a literature review, Section 3 describes the dataset and the methods we intend to adopt for our analysis, Section 4 reports the results we obtained on the original data as well as on the time series of the rescaled returns, the last Section 5 draws the conclusions.

2. Literature Review

Distribution of returns is important because econometric models depend upon specific distributional assumptions, and in the case of implied volatilities, on further assumptions concerning the distributional and dynamic properties of stock market volatility. Wrong assumptions call into question the robustness of findings based on those models. One may always opt for an alternative approaches such as those based on squared returns over a given horizon, that provides model-free unbiased estimates of the ex-post realized volatility. Unfortunately, however, “squared returns are also a very noisy volatility indicator and hence do not allow for reliable inference regarding the true underlying latent volatility” Andersen et al. (2001). To overcome such limitations, Andersen et al. (2001) suggested a model free volatility estimate by summing squares and cross-products of intraday high-frequency returns. That approach, however, relies on a reliable high-frequency return observation which, often is not guaranteed. Moreover, it is not necessarily true that the characteristics of a time series are independent on the time horizon and the sampling frequency so that, for example, one may extrapolate seeminglessly from daily data monthly or yearly distributions. Furthermore, time horizon and sampling frequency not only may influence the moments of a given distribution of returns but, also, the way in which data are hierarchically and spatially organized Tumminello et al. (2007).
According to McNeil et al. (2015), a bivariate t-Student distribution can describe a pair of daily stock returns. In fact, multivariate stock returns could be modeled by the means of a t-Student copula Aas et al. (2009) and Nikoloulopoulos et al. (2012). However, t-Student imposes symmetric dependence on the joint upper and lower tails which contrasts with empirical studies, e.g., Ang and Chen (2002); Longin and Solnik (2001); Patton (2006).
Among alternatives, parametric approaches skew normal distributions, as introduced by Azzalini (1985) and Henze (1986), have became quite popular because they suit well in modeling skewed data defined as follows
f ( z ; θ ) = 2 φ ( z ) Φ ( z θ ) z R ,
where θ is the parameter controlling the skewness and φ and Φ denote the N ( 0 , 1 ) are the standard normal density and the normal cumulative distribution, respectively.
By further enhancing those distributions, Kim (2001) proposes a family of t-skew distributions in terms of a scale mixture of skew-normal distributions.
The random variable X is said to be t-skew distributed with parameter θ and ν if its probability density function is
f ( x ; θ , ν ) = 2 E [ z 1 / 2 φ ( z 1 / 2 x ) Φ ( z 1 / 2 θ x ) | z ] x R ,
where z Γ ( ν / 2 , 2 / ν ) , φ and Φ are the standard normal density and the normal cumulative distribution, respectively.
The salient features of the family of t-skew distributions are their mathematical tractability, the inclusion of the normal law and the shape parameter regulating the skewness, apart from the ability of fitting heavy-tailed and skewed data, scale mixtures of skew-normal densities come from a family of t-skew distributions.
Such an extension allows for a continuous variation from normality to non-normality and it has found a number of applications on fitting heavy-tailed and skewed data. Other applications of such distributions are related to copula modeling. Yoshiba (2018) found that the AC t-skew copula describes well the dependence structures of both “unfiltered daily returns and GARCH or EGARCH filtered daily returns of the three stock indices: the Nikkei225, S&P500 and DAX”. This is because financial time series are characterized by asymmetry. For instance, Patton (2006) reported evidence that “the mark–dollar and yen–dollar exchange rates are more correlated when they are depreciating against the dollar than when they are appreciating”. A drawback of t-skew models is that it is more difficult to handle compared to Gaussian distributions and that there is no closed form analytic formula for computing the elements of the expected information matrix. However, numerical methods are available, e.g., Martin et al. (2020).
Moreover, the multiple questions related to parametric models and model-free have led to further development of observation-driven models such as the so-called dynamic conditional score (DCS) Creal et al. (2013) where the updating of the score function is a mechanism that acts as a kind of partitioning of the dataset Lavielle and Teyssiere (2006); Orlando et al. (2020). DCS models, often based on t-skew distributions, have been conceived for describing the distribution of returns Harvey (2013) and they find a number applications in finance from forecasting Value at Risk (VaR) and expected shortfall (ES) Gao and Zhou (2016) to FX Ayala and Blazsek (2019), from commercial and residential mortgage defaults Babii et al. (2019) to hedging for crude oil future Gong et al. (2019). For a review, see Blazsek and Licht (2020).

3. Data and Methods

3.1. Data

In order to have a representative dataset, we diversified the investigation across asset classes (equity, bonds, swaps), maturity (from 1 month to 10 years), issuer (government, corporate), market (developed, emerging). In Table 1, we report the data as retrieved from Ice Data Indices and Bloomberg.

3.2. Volatility Rescaled Returns

Following Rogers (2018), we test if the rescaled log returns series of our dataset are or not normally distributed. In order to rescale the (standardized) returns X i , we let
X ˜ i + 1 = X i + 1 σ ^ i + 1
where
σ ^ i + 1 2 = β Y 2 + ( 1 β ) σ ^ i
with
Y = m a x ( K σ ^ i , m i n ( K σ ^ i , X i ) ) ,
and
σ ^ 0 2 = 1 n i = 1 n X i 2 .
Firstly, we set the parameters ( K , β ) equal to ( 4 , 0.025 ) , as suggested by the author. In a second moment, we compute ( K * , β * ) by solving the following optimization problem
( K * , β * ) = a r g min ( K , β ) i F i F ^ i ( K , β ) 2 ,
where F ^ is the empirical CDF of the series defined in (3) and F is the (standard) normal CDF (evaluated at the same points of F ^ ).

3.3. Methods

In this section, we describe the statistical procedure that will be carried out in previous Section 3.1 on different return time series. For each series, we analyze weekly, monthly and yearly. As yearly data may display high levels of autocorrelation that can alter model’s forecasts, we randomly shuffle those returns and we check their properties as well.
Among the analysis we perform, we mention the moments, the histograms and the so-called quantile-quantile (Q-Q) plot Wilk and Gnanadesikan (1968) where we consider the normal distribution versus the t-skew distribution, etc.

3.3.1. Analysis on the Normality of Returns

Kolmogorov–Smirnov Normality Test

To confirm evidence on the graphical analysis resulting from the (Q-Q) plot, we use the Kolmogorov–Smirnov normality (K-S) test Kolmogorov (1933); Stephens (1992). It is a nonparametric test of the equality of probability distributions that can be used to compare a sample with one reference probability distribution (one-sample K–S test). The Kolmogorov–Smirnov statistic is
D = sup x R | F ^ ( x ) F ( x ) | ,
where F ^ ( x ) , F ( x ) are the empirical distribution function and the theoretical distribution chosen as benchmark, respectively. For large n (being n the sample size), the null hypothesis H 0 (i.e., the sample is drawn from the reference distribution) is rejected at level α if
D > ln ( α / 2 ) n .
Notice that in our tests we use α = 0.01 , as usual.

Dvoretzky–Kiefer–Wolfowitz Bounds

A second comparison between the empirical and the normal CDF for a given time series, is based on the Dvoretzky–Kiefer–Wolfowitz (DKW) inequality Dvoretzky et al. (1956). To assess how close the above-mentioned CDFs are, given ε > 0 , one has to solve the following one-sided estimate
P sup x R F ^ ( x ) F ( x ) > ε e 2 n ε 2 ε ln 2 2 n ,
which also implies a two-sided estimate
P sup x R | F ^ ( x ) F ( x ) | > ε 2 e 2 n ε 2 ε > 0 .
This strengthens the Glivenko–Cantelli Theorem Tucker (1959) by quantifying the rate of convergence as n goes to infinity; it also estimates the tail probability of the Kolmogorov–Smirnov statistic D.
The interval that contains the true CDF F ( x ) , with probability 1 α is often specified as [ F ^ ( x ) ε , F ^ ( x ) + ε ] , where
ε = ln ( 2 / α ) 2 n .
Finally we introduce the variable, named "DKW exceeds", which enumerates the percentage of points of F that exceed the DKW upper and lower bounds.

3.3.2. Comparison with Other Distributions

In order to enforce our thesis, we compare the t-skew distribution with the following distributions.
  • Generalized hyperbolic (GH) distribution,
    f ( x ; λ , α , β , δ , μ ) = γ λ 2 π δ λ + 1 γ K λ · α λ + 1 / 2 δ 2 + ( x μ ) 2 K λ 1 / 2 ( δ 2 + ( x μ ) 2 ) 1 / 4 λ / 2 ( x R ) ,
    where β R is the asymmetry parameter, δ R is the scale parameter, μ R is the location, γ = α 2 β 2 , α R , ( α 2 > β 2 ) and K λ ( λ R ) denotes the modified Bessel function of the second kind.
  • Generalized Pareto (GP) distribution,
    ( 1 + ξ x μ σ ) ( 1 / ξ + 1 ) σ ( x > μ ) ,
    where ξ R is the shape parameter, σ R + is the scale parameter and μ R is the location.
  • Exponential distribution, obtained by the GP distribution (6) when ξ = μ = 0 .

3.3.3. Analysis on the Autocorrelation

Ljung-Box Q-Test

Generally, one can assess the presence of autocorrelation at a given lag by the sample autocorrelation function (ACF) and by the partial autocorrelation function (PACF). Among the more qualitative tests used to detect the autocorrelation, we adopt the Ljung-Box Q-test Ljung and Box (1978) and the ARCH test Engle (1982). The null hypothesis of the Ljung-Box Q-test is that the first m autocorrelations are jointly zero, i.e.,
H 0 : ρ 1 = ρ 2 = . . . = ρ m = 0 .
Hassani and Silva (2015); Hassani and Yeganegi (2019) warn on the sensitivity of the test to large values of m. In our case we avoid the problem by setting m = ln ( n ) , where n is the sample size. The Ljung-Box test statistics are given by
Q ( m ) = n ( n + 2 ) h = 1 m ρ h 2 n h ;
it follows a χ m 2 distribution.

ARCH Test

An uncorrelated time series can still be seriously dependent because of the dynamic conditional variance process. A time series whose squared residuals exhibit conditional heteroscedasticity or autocorrelation is said to have autoregressive conditional heteroscedastic (ARCH) effect. The ARCH test is a Lagrange multiplier test to assess the significance of ARCH effects. Under the assumption that the squared residuals e t 2 follow an AR(m) process, i.e.,
e t 2 = a 0 + h = 1 n a h e t h 2 + ε t ,
being ε t a white noise, the ARCH test null hypothesis becomes
H 0 : a 0 = a 1 = . . . = a m = 0 .
One way to choose m is to compare log-likelihood values for different choices of m, e.g., the likelihood ratio test or AIC-BIC information criteria.

3.3.4. Analysis on the Stationarity

KPSS Test

In order to understand whether returns follow a t-skew distribution, we investigate about the presence of unitary roots, i.e., about the absence of stationarity, and consequently the persistence of fat tail, that is in contrast with the normal distribution. Among all, we choose the Kwiatkowski, Phillips, Schmidt and Shin (KPSS) test, which assesses the null hypothesis that a (univariate) time series x t is trend stationary against the alternative that it is a nonstationary unit root process. This test uses the stochastic model
x t = y t + θ t + ε 1 , t y t = y t 1 + ε 2 , t ,
where θ is the trend coefficient, ε 1 , t is a stationary process and ε 2 , t is an independent and identically distributed process with mean 0 and variance σ 2 .
The null hypothesis is σ 2 = 0 , meaning that y t is a constant random walk. The alternative hypothesis is σ 2 > 0 that introduces the unit root in the random walk. The test statistic is
t = 1 n S t 2 ( τ n ) 2 ,
where S t is the partial sum of the (absolute) errors coming from the regression on x t , n is the sample size and τ 2 is the Newey–West estimate of the long-run variance.

Hassani Test

The Hassani’s-1/2 Theorem in Hassani (2009); Hassani and Yeganegi (2019) states that the sum of sample autocorrelation function for any stationary time series with arbitrary length n 2 and lag h 1 is
S A C F = h = 1 n 1 ρ h = 1 2 .
Note that ρ ( h ) for large h tends to one. In fact, if h = n 1 , there is only one sample. Therefore, a rule of thumb is not to evaluate ρ ( h ) for h > n / 3 .

4. Empirical Results

In this section, we report the statistical properties of the considered times series and then we perform some additional analysis ad detailed in Sec. on the original, averaged and volatility rescaled returns on the USBAAC, respectively. Further analyses for the remaining time series are reported in the Appendix A.

4.1. Statistical Properties and Analysis on Original Log Returns

With regard to statistical properties of the considered time series, Table 2 summarizes the moments. As expected, log returns are not normally distributed and they are heavily skewed. Moreover, Table 3 shows that in most cases there is statistical evidence to reject the null hypothesis of trend stationarity. Finally, for all time series considered S A C F = 0.5 .

USD Basis Swap 1Mv3M (USBAAC) Index: Original Data

In this section, we pick one of the first time series listed in Table 1 and we report the results we obtained from it. For sake of readability, the analyses on the remaining time series are available in Appendix A.
To understand the shape of the distribution of log returns, let us start with a visual inspection. Figure 1 shows how sampling may influence dramatically the resulting distribution.
As sampling is critical, for an investor with a month or a year horizon a consistent time horizon would be suitable. The next question, then, is to check between the normal distribution and t-skew which one fits better the data. Figure 2 and Figure 3 show the Q-Q plots on the monthly and yearly log-returns, respectively. In both cases, the t-skew performs better than the Gaussian with the exception of some outliers.
The last check on the normality is performed by comparing the empirical CDF versus standard normal CDF. Once more, Figure 4 shows that weekly, monthly and yearly log returns do not seem to be normally distributed. The alternative t-skew, instead, is more likely as shown in Table 4 across all considered distributions.
We earlier discussed the interconnectedness between sampling and distributions. We also mentioned that it would be worth to align the time horizon of an investment with sampling as those returns (and their distributions) are those relevant. The problem with yearly returns is that they may be affected by autocorrelation. To this end, one may resolve to the randomly shuffled. Figure 5 shows that, by shuffling the returns, the autocorrelation of yearly data looks similar to the one of weekly and monthly performance. Finally, Table 5 confirms the finding. As mentioned, in Section 3.3.3 the Ljung-Box test could be sensitive to large values of m. This is depicted in Table 5 where the response changes for weekly, monthly and yearly shuffled returns. Anyway, as we set m = ln ( n ) , we do not have this problem.
With regard to the selection of a specific distribution, Table 6 shows that the most indicated are the t-skew and the generalized hyperbolic while the Gaussian, the generalized Pareto and the exponential distributions do not seem to fit.

4.2. Statistical Properties and Analysis on Averaged Log Returns

In this section, instead of considering the punctual values as with the previous section, we carry out our analysis on monthly and yearly averages to see whether this changes the findings. To this end, the monthly returns are obtained by averaging 4 non overlapping weekly returns, and the yearly returns are obtained by averaging 12 non overlapping monthly returns. This should reduce the ARCH effect, the autocorrelation and should give better results in favour of normality tests. However, the results confirm a persistency of autocorrelation, well, as a better fit through the t-skew distribution; while the gain is limited only to the ARCH test.

USD Basis Swap 1Mv3M (USBAAC) Index: Averaged Log Returns

Table 7 tests the autocorrelation for the USBAAC averaged returns while Table 8 checks the best fit across the considered distributions. As the results do not differ substantially from those reported in Section 4.1, we do not repeat this analysis over the remaining time series.

4.3. Statistical Properties and Analysis on Volatility Rescaled Log Returns

The last check we performed is on time series rescaled by their volatilities. As mentioned, there are studies which claim that log returns are normally distributed when rescaled Andersen et al. (2001); Rogers (2018). In Table 9 and Table 10, we report the K-S test on the rescaled time series. As shown, there is little evidence supporting Gaussian distribution.

5. Conclusions

According to the tests carried out on our dataset, the distributions of log-returns do not seem to be normally distributed. The same applies on the returns standardized by the standard deviation. In a different context, Tiwari and Gupta (2019) found that the Jarque–Bera test strongly rejects the hypothesis of Gaussian distribution for all considered time series concerning G7 stock markets.
A more realistic work hypothesis is that time series follow a t-skew distribution. The t-skew distribution can be seen as a mixture of skew-normal distributions Kim (2001) which generalize the normal distribution thanks to an extra parameter regulating the skewness. By construction, then, they can model heavy tails and skews that are common in financial markets. Thus, their adoption in finance is gaining momentum for modeling distributions Harvey (2013) and risk Gao and Zhou (2016). Further, t-skew has the power to link-up with observation-driven models such as the dynamic conditional score (DCS) Creal et al. (2013) or based on data partitioning Orlando et al. (2019); Orlandoet al. (2020). This paper tries to help in gaining insights on returns’ distributions and on the most suitable way of fitting them. According to the empirical results we reported, the distributions that fit better the data are the t-skew and the hyperbolic Pareto. As the latter is more difficult to handle, this research suggests that the t-skew represents a suitable alternative. That is relevant in terms of policy implications because risk management or option pricing Mininni et al. (2020) should rely on models able to describe fat tails, skewed distributions and jumps in assets’ dynamics Orlando et al. (2018) rather than on Gaussian distributions that may underestimate the extremes (and leave the investors exposed to unexpected losses). For those reasons, regulators and financial institutions should pay particular attention to model risk (i.e., risk resulting from using insufficiently accurate models) when they choose a particular distribution.
Last but not least, t-skew models could be used as linkages between financial markets. To this end, Yoshiba (2018) provides a solution for computing the MLE and keeping the correlation matrix positive semi-definite during the optimization process.

Author Contributions

Conceptualization, G.O.; methodology, G.O.; software, G.O. and M.B.; validation, G.O. and M.B., formal analysis, G.O. and M.B.; investigation, G.O. and M.B.; resources, G.O. and M.B.; data curation, G.O. and M.B.; writing—original draft preparation, G.O.; writing—review and editing, G.O. and M.B.; visualization, G.O. and M.B.; supervision, G.O.; project administration, G.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Restrictions apply to the availability of these data. Data was obtained from Ice Data Indices and Bloomberg are available from the authors with the permission of Ice Data Indices and Bloomberg.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

In the following, we report the analysis we have performed on the indices from (b) through (g) of Table 1. Figure A1, Figure A6, Figure A11, Figure A16, Figure A21 and Figure A26 display log-returns histograms. Figure A2, Figure A7, Figure A12, Figure A17, Figure A22 and Figure A27 show monthly and yearly log-returns Q-Q plots. Figure A3, Figure A8, Figure A13, Figure A18, Figure A23 and Figure A28 exhibit yearly windowed and yearly windowed shuffled log-returns Q-Q plots. Figure A4, Figure A9, Figure A14, Figure A19, Figure A24 and Figure A29 show the empirical CDF versus the standard normal CDF. Figure A5, Figure A10, Figure A15, Figure A20, Figure A25 and Figure A30 display log-returns autocorrelations. Table A1, Table A3, Table A5, Table A7, Table A9 and Table A11 report the K-S test to detect the original distribution. Finally, Table A2, Table A4, Table A6, Table A8, Table A10 and Table A12 exhibit the Ljung-Box Q-test and ARCH test to detect autocorrelation.

Appendix A.1. Analysis on S&P 500 Index

Figure A1. S&P 500 log-returns histograms.
Figure A1. S&P 500 log-returns histograms.
Risks 09 00088 g0a1
Figure A2. S&P 500 monthly and yearly log-returns Q-Q plots.
Figure A2. S&P 500 monthly and yearly log-returns Q-Q plots.
Risks 09 00088 g0a2
Figure A3. S&P 500 yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Figure A3. S&P 500 yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Risks 09 00088 g0a3
Figure A4. Empirical CDF versus standard normal CDF for S&P 500 returns. The dotted black lines represent the DKW upper and lower bounds.
Figure A4. Empirical CDF versus standard normal CDF for S&P 500 returns. The dotted black lines represent the DKW upper and lower bounds.
Risks 09 00088 g0a4
Figure A5. S&P 500 log-returns autocorrelations.
Figure A5. S&P 500 log-returns autocorrelations.
Risks 09 00088 g0a5
Table A1. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A1. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Normalt-skewGen. HyperbolicGen. ParetoExp. Pareto
resp.p-ValueDKW Exceedsresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Value
Weekly11.6502 × 10 19 74.92%00.048200.72561010
Monthly18.3615 × 10 9 58.32%00.198500.97141010
Yearly12.7038 × 10 30 68.55%18.6098 × 10 7 101010
Yearly Shuffled12.7038 × 10 30 68.55%18.6098 × 10 7 101010
Table A2. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A2. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Ljung-Box Q-TestARCH Test
m = ln ( n ) m = ( n 1 )
resp.p-Valueresp.p-Valueresp.p-Value
Weekly11.9720 × 10 5 1010
Monthly10.00311010
Yearly00.72341000.2308
Yearly Shuffled1000.930610

Appendix A.2. Analysis on Bovespa Index

Figure A6. Bovespa log-returns histograms.
Figure A6. Bovespa log-returns histograms.
Risks 09 00088 g0a6
Figure A7. Bovespa monthly and yearly log-returns Q-Q plots.
Figure A7. Bovespa monthly and yearly log-returns Q-Q plots.
Risks 09 00088 g0a7
Figure A8. Bovespa yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Figure A8. Bovespa yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Risks 09 00088 g0a8
Figure A9. Empirical CDF versus standard normal CDF for Bovespa returns. The dotted black lines represent the DKW upper and lower bounds.
Figure A9. Empirical CDF versus standard normal CDF for Bovespa returns. The dotted black lines represent the DKW upper and lower bounds.
Risks 09 00088 g0a9
Figure A10. Bovespa log-returns autocorrelations.
Figure A10. Bovespa log-returns autocorrelations.
Risks 09 00088 g0a10
Table A3. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A3. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Normalt-skewGen. HyperbolicGen. ParetoExp. Pareto
resp.p-ValueDKW Exceedsresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Value
Weekly14.0876 × 10 19 69.62%00.787000.98701010
Monthly18.4152 × 10 7 31.55%00.667900.99771010
Yearly16.7479 × 10 103 81.43%13.7439 × 10 12 101010
Yearly Shuffled16.7479 × 10 103 81.43%13.7439 × 10 12 101010
Table A4. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A4. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Ljung-Box Q-TestARCH Test
m = ln ( n ) m = ( n 1 )
resp.p-Valueresp.p-Valueresp.p-Value
Weekly101010
Monthly101011.3679 × 10 4
Yearly101010
Yearly Shuffled00.893300.514700.7809

Appendix A.3. Analysis on US Corporate Index

Figure A11. US Corp. log-returns histograms.
Figure A11. US Corp. log-returns histograms.
Risks 09 00088 g0a11
Figure A12. US Corp. monthly and yearly log-returns Q-Q plots.
Figure A12. US Corp. monthly and yearly log-returns Q-Q plots.
Risks 09 00088 g0a12
Figure A13. US Corp. yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Figure A13. US Corp. yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Risks 09 00088 g0a13
Figure A14. Empirical CDF versus standard normal CDF for US Corp. returns. The dotted black lines represent the DKW upper and lower bounds.
Figure A14. Empirical CDF versus standard normal CDF for US Corp. returns. The dotted black lines represent the DKW upper and lower bounds.
Risks 09 00088 g0a14
Figure A15. US Corp. log-returns autocorrelations.
Figure A15. US Corp. log-returns autocorrelations.
Risks 09 00088 g0a15
Table A5. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A5. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Normalt-skewGen. HyperbolicGen. ParetoExp. Pareto
resp.p-ValueDKW Exceedsresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Value
Weekly10.00630.73%00.320100.97481010
Monthly00.18770%00.996500.99001010
Yearly00.07120%00.0698101010
Yearly Shuffled00.07120%00.0698101010
Table A6. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A6. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Ljung-Box Q-TestARCH Test
m = ln ( n ) m = ( n 1 )
resp.p-Valueresp.p-Valueresp.p-Value
Weekly12.4980 × 10 14 1016.0678 × 10 4
Monthly00.707500.261514.8873 × 10 5
Yearly101010
Yearly Shuffled00.232500.558100.7833

Appendix A.4. Analysis on Emerging Markets Corporate Plus Index

Figure A16. EmMkt Corp. log-returns histograms.
Figure A16. EmMkt Corp. log-returns histograms.
Risks 09 00088 g0a16
Figure A17. EmMkt Corp. monthly and yearly log-returns Q-Q plots.
Figure A17. EmMkt Corp. monthly and yearly log-returns Q-Q plots.
Risks 09 00088 g0a17
Figure A18. EmMkt Corp. yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Figure A18. EmMkt Corp. yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Risks 09 00088 g0a18
Figure A19. Empirical CDF versus standard normal CDF for EmMkt Corp. returns. The dotted black lines represent the DKW upper and lower bounds.
Figure A19. Empirical CDF versus standard normal CDF for EmMkt Corp. returns. The dotted black lines represent the DKW upper and lower bounds.
Risks 09 00088 g0a19
Figure A20. EmMkt Corp. log-returns autocorrelations.
Figure A20. EmMkt Corp. log-returns autocorrelations.
Risks 09 00088 g0a20
Table A7. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A7. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Normalt-skewGen. HyperbolicGen. ParetoExp. Pareto
resp.p-ValueDKW Exceedsresp.p-valueresp.p-Valueresp.p-Valueresp.p-Value
Weekly18.3270 × 10 8 55.68%00.881400.9971010
Monthly10.001211.68%00.992700.99991010
Yearly15.8695 × 10 4 8.54%00.2340101010
Yearly Shuffled00.01380%00.3056101010
Table A8. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A8. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Ljung-Box Q-TestARCH Test
m = ln ( n ) m = ( n 1 )
resp.p-Valueresp.p-Valueresp.p-Value
Weekly101010
Monthly00.765200.997610.0018
Yearly101010
Yearly Shuffled00.60171000.9606

Appendix A.5. Analysis on 3-Month Treasury Constant Maturity Rate

Figure A21. DGS3M log-returns histograms.
Figure A21. DGS3M log-returns histograms.
Risks 09 00088 g0a21
Figure A22. DGS3M monthly and yearly log-returns Q-Q plots.
Figure A22. DGS3M monthly and yearly log-returns Q-Q plots.
Risks 09 00088 g0a22
Figure A23. DGS3M yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Figure A23. DGS3M yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Risks 09 00088 g0a23
Figure A24. Empirical CDF versus standard normal CDF for DGS3M returns. The dotted black lines represent the DKW upper and lower bounds.
Figure A24. Empirical CDF versus standard normal CDF for DGS3M returns. The dotted black lines represent the DKW upper and lower bounds.
Risks 09 00088 g0a24
Figure A25. DGS3M log-returns autocorrelations.
Figure A25. DGS3M log-returns autocorrelations.
Risks 09 00088 g0a25
Table A9. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A9. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Normalt-skewGen. HyperbolicGen. ParetoExp. Pareto
resp.p-ValueDKW Exceedsresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Value
Weekly11.6567 × 10 20 47.48%11.7558 × 10 20 11.46 × 10 20 1010
Monthly11.0664 × 10 71 75.69%13.6783 × 10 7 101010
Yearly12.0644 × 10 286 79.83%12.2533 × 10 26 101010
Yearly Shuffled13.0634 × 10 297 80.24%11.3943 × 10 28 101010
Table A10. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A10. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Ljung-Box Q-TestARCH Test
m = ln ( n ) m = ( n 1 )
resp.p-Valueresp.p-Valueresp.p-Value
Weekly101010
Monthly12.4962 × 10 4 1010.0283
Yearly101010
Yearly Shuffled13.0097 × 10 5 00.696610

Appendix A.6. Analysis on 10-Year Treasury Constant Maturity Rate

Figure A26. DGS10Y log-returns histograms.
Figure A26. DGS10Y log-returns histograms.
Risks 09 00088 g0a26
Figure A27. DGS10Y monthly and yearly log-returns Q-Q plots.
Figure A27. DGS10Y monthly and yearly log-returns Q-Q plots.
Risks 09 00088 g0a27
Figure A28. DGS10Y yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Figure A28. DGS10Y yearly windowed and yearly windowed shuffled log-returns Q-Q plots.
Risks 09 00088 g0a28
Figure A29. Empirical CDF versus standard normal CDF for DGS10Y returns. The dotted black lines represent the DKW upper and lower bounds.
Figure A29. Empirical CDF versus standard normal CDF for DGS10Y returns. The dotted black lines represent the DKW upper and lower bounds.
Risks 09 00088 g0a29
Figure A30. DGS10Y log-returns autocorrelations.
Figure A30. DGS10Y log-returns autocorrelations.
Risks 09 00088 g0a30
Table A11. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A11. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Normalt-skewGen. HyperbolicGen. ParetoExp. Pareto
resp.p-ValueDKW Exceedsresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Value
Weekly11.2613 × 10 19 74.57%00.707500.94041010
Monthly12.3517 × 10 6 39.47%00.9336700.97751010
Yearly13.2079 × 10 7 41.77%10.0020101010
Yearly Shuffled13.2079 × 10 7 41.77%10.0020101010
Table A12. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table A12. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Ljung-Box Q-TestARCH Test
m = ln ( n ) m = ( n 1 )
resp.p-Valueresp.p-Valueresp.p-Value
Weekly1000.987910
Monthly10.024800.998700.0721
Yearly101010
Yearly Shuffled00.22941000.5526

References

  1. Aas, Kjersti, Claudia Czado, Arnoldo Frigessi, and Henrik Bakken. 2009. Pair-copula constructions of multiple dependence. Insurance: Mathematics and Economics 44: 182–98. [Google Scholar] [CrossRef] [Green Version]
  2. Andersen, Torben G., Tim Bollerslev, Francis X Diebold, and Heiko Ebens. 2001. The distribution of realized stock return volatility. Journal of Financial Economics 61: 43–76. [Google Scholar] [CrossRef]
  3. Ang, Andrew, and Joseph Chen. 2002. Asymmetric correlations of equity portfolios. Journal of financial Economics 63: 443–94. [Google Scholar] [CrossRef]
  4. Ayala, Astrid, and Szabolcs Blazsek. 2019. Score-driven models of stochastic seasonality in location and scale: An application case study of the indian rupee to usd exchange rate. Applied Economics 51: 4083–103. [Google Scholar] [CrossRef]
  5. Azzalini, Adelchi. 1985. A class of distributions which includes the normal ones. Scandinavian Journal of Statistics 12: 171–8. [Google Scholar]
  6. Babii, Andrii, Xi Chen, and Eric Ghysels. 2019. Commercial and residential mortgage defaults: Spatial dependence with frailty. Journal of Econometrics 212: 47–77. [Google Scholar] [CrossRef]
  7. Blazsek, Szabolcs, and Adrian Licht. 2020. Dynamic conditional score models: A review of their applications. Applied Economics 52: 1181–99. [Google Scholar] [CrossRef]
  8. Creal, Drew, Siem Jan Koopman, and André Lucas. 2013. Generalized autoregressive score models with applications. Journal of Applied Econometrics 28: 777–95. [Google Scholar] [CrossRef] [Green Version]
  9. Dvoretzky, Aryeh, Jack Kiefer, and Jacob Wolfowitz. 1956. Asymptotic minimax character of the sample distribution function and of the classical multinomial estimator. The Annals of Mathematical Statistics 27: 642–69. [Google Scholar] [CrossRef]
  10. Eini, Esmat Jamshidi, and Hamid Khaloozadeh. 2020. Tail variance for generalized skew-elliptical distributions. Communications in Statistics-Theory and Methods, 1–18. [Google Scholar] [CrossRef]
  11. Engle, Robert F. 1982. Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica: Journal of the Econometric Society 50: 987–1007. [Google Scholar] [CrossRef]
  12. Gao, Chun-Ting, and Xiao-Hua Zhou. 2016. Forecasting VaR and ES using dynamic conditional score models and skew Student distribution. Economic Modelling 53: 216–23. [Google Scholar] [CrossRef]
  13. Gong, Xiao-Li, Xi-Hua Liu, and Xiong Xiong. 2019. Measuring tail risk with GAS time varying copula, fat tailed GARCH model and hedging for crude oil futures. Pacific-Basin Finance Journal 55: 95–109. [Google Scholar] [CrossRef]
  14. Harvey, Andrew C. 2013. Dynamic Models for Volatility and Heavy Tails: With Applications to Financial and Economic Time Series. Cambridge: Cambridge University Press, vol. 52. [Google Scholar]
  15. Hassani, Hossein. 2009. Sum of the sample autocorrelation function. Random Operators and Stochastic Equations 17: 125–30. [Google Scholar] [CrossRef]
  16. Hassani, Hossein, and Emmanuel Sirimal Silva. 2015. A kolmogorov-smirnov based test for comparing the predictive accuracy of two sets of forecasts. Econometrics 3: 590–609. [Google Scholar] [CrossRef] [Green Version]
  17. Hassani, Hossein, and Mohammad Reza Yeganegi. 2019. Sum of squared acf and the ljung–box statistics. Physica A: Statistical Mechanics and Its Applications 520: 81–86. [Google Scholar] [CrossRef]
  18. Henze, Norbert. 1986. A probabilistic representation of the ’skew-normal’ distribution. Scandinavian Journal of Statistics 13: 271–5. [Google Scholar]
  19. Kim, Hea Jung. 2001. On a skew-t distribution. CSAM (Communications for Statistical Applications and Methods) 8: 867–73. [Google Scholar]
  20. Kolmogorov, Andrey. 1933. Sulla determinazione empirica di una legge di distribuzione. Giornale Istituto Italiano Attuari 4: 83–91. [Google Scholar]
  21. Lavielle, Marc, and Gilles Teyssiere. 2006. Detection of multiple change-points in multivariate time series. Lithuanian Mathematical Journal 46: 287–306. [Google Scholar] [CrossRef] [Green Version]
  22. Ljung, Greta M, and George EP Box. 1978. On a measure of lack of fit in time series models. Biometrika 65: 297–303. [Google Scholar] [CrossRef]
  23. Longin, Francois, and Bruno Solnik. 2001. Extreme correlation of international equity markets. The Journal of Finance 56: 649–76. [Google Scholar] [CrossRef]
  24. Martin, R Douglas, Chindhanai Uthaisaad, and Daniel Z Xia. 2020. Skew-t expected information matrix evaluation and use for standard error calculations. The R Journal 12: 188–205. [Google Scholar] [CrossRef]
  25. McNeil, Alexander J, Rüdiger Frey, and Paul Embrechts. 2015. Quantitative Risk Management: Concepts, Techniques and Tools-Revised Edition. Princeton: Princeton University Press. [Google Scholar]
  26. Mininni, Michele, Giuseppe Orlando, and Giovanni Taglialatela. 2020. Challenges in approximating the Black and Scholes call formula with hyperbolic tangents. Decisions in Economics and Finance, 1–28. [Google Scholar] [CrossRef]
  27. Nikoloulopoulos, Aristidis K, Harry Joe, and Haijun Li. 2012. Vine copulas with asymmetric tail dependence and applications to financial return data. Computational Statistics & Data Analysis 56: 3659–73. [Google Scholar]
  28. Orlando, Giuseppe, Rosa Maria Mininni, and Michele Bufalo. 2019. Interest rates calibration with a CIR model. The Journal of Risk Finance 20: 370–87. [Google Scholar] [CrossRef]
  29. Orlando, Giuseppe, Rosa Maria Mininni, and Michele Bufalo. 2020. Forecasting interest rates through Vasicek and CIR models: A partitioning approach. Journal of Forecasting 39: 569–79. [Google Scholar] [CrossRef] [Green Version]
  30. Orlando, Giuseppe, Rosa Maria Mininni, and Michele Bufalo. 2018. A new approach to CIR short-term rates modelling. In New Methods in Fixed Income Modeling - Fixed Income Modeling. New York: Springer, pp. 35–44. [Google Scholar]
  31. Patton, Andrew J. 2006. Modelling asymmetric exchange rate dependence. International Economic Review 47: 527–56. [Google Scholar] [CrossRef]
  32. Rogers, L. C. G. 2018. Sense, nonsense and the S&P500. Decisions in Economics and Finance 41: 447–61. [Google Scholar]
  33. Stephens, Michael A. 1992. Introduction to Kolmogorov (1933) on the empirical determination of a distribution. In Breakthroughs in Statistics. New York: Springer, pp. 93–105. [Google Scholar]
  34. Tiwari, Aviral Kumar, and Rangan Gupta. 2019. Chaos in G7 stock markets using over one century of data: A note. Research in International Business and Finance 47: 304–10. [Google Scholar] [CrossRef]
  35. Tucker, Howard G. 1959. A generalization of the Glivenko-Cantelli theorem. The Annals of Mathematical Statistics 30: 828–30. [Google Scholar] [CrossRef]
  36. Tumminello, Michele, Tiziana Di Matteo, Tomaso Aste, and Rosario N Mantegna. 2007. Correlation based networks of equity returns sampled at different time horizons. The European Physical Journal B 55: 209–17. [Google Scholar] [CrossRef]
  37. Wilk, Martin B, and Ram Gnanadesikan. 1968. Probability plotting methods for the analysis for the analysis of data. Biometrika 55: 1–17. [Google Scholar] [CrossRef]
  38. Yeap, Claudia, S. T. Boris Choy, and S. Simon Kwok. 2018. The skew-t option pricing model. In International Econometric Conference of Vietnam. Cham: Springer, pp. 309–26. [Google Scholar]
  39. Yoshiba, Toshinao. 2018. Maximum likelihood estimation of skew-t copulas with its applications to stock returns. Journal of Statistical Computation and Simulation 88: 2489–506. [Google Scholar] [CrossRef]
Figure 1. USBAAC log-returns histograms. Weekly, monthly and yearly sampling generates different distributions. In terms of distributions, there is no difference between shuffled and not shuffled yearly log returns.
Figure 1. USBAAC log-returns histograms. Weekly, monthly and yearly sampling generates different distributions. In terms of distributions, there is no difference between shuffled and not shuffled yearly log returns.
Risks 09 00088 g001
Figure 2. USBAAC weekly and monthly log-returns Q-Q plots. This is the graphic representation of distribution quantiles comparing the CDF of the observed time series, which is unknown, a priori, with that of a specified distribution, chosen as benchmark. If the observed variable follows the theoretical distribution chosen, the Q-Q plot thickens across the line that connects the first and third quantiles of the data.
Figure 2. USBAAC weekly and monthly log-returns Q-Q plots. This is the graphic representation of distribution quantiles comparing the CDF of the observed time series, which is unknown, a priori, with that of a specified distribution, chosen as benchmark. If the observed variable follows the theoretical distribution chosen, the Q-Q plot thickens across the line that connects the first and third quantiles of the data.
Risks 09 00088 g002
Figure 3. USBAAC yearly and yearly randomly shuffled log-returns Q-Q plots. In both cases, distributions do not look Gaussian.
Figure 3. USBAAC yearly and yearly randomly shuffled log-returns Q-Q plots. In both cases, distributions do not look Gaussian.
Risks 09 00088 g003
Figure 4. Empirical CDF versus standard normal CDF for USBAAC returns. The dotted black lines represent the DKW upper and lower bounds.
Figure 4. Empirical CDF versus standard normal CDF for USBAAC returns. The dotted black lines represent the DKW upper and lower bounds.
Risks 09 00088 g004
Figure 5. USBAAC log-returns autocorrelations.
Figure 5. USBAAC log-returns autocorrelations.
Risks 09 00088 g005
Table 1. Dataset.
Table 1. Dataset.
IndexCodeDescriptionAsset ClassMarketTime Frame
aUSBAACUSD Basis Swap 1Mv3MSwapDeveloped12 February 2007–30 March 2020
bSPXS&P 500EquityDeveloped30 December 1927–29 May 2020
cIBOVBovespaEquityEmerging5 January 1927–29 May 2020
dBAMLCC0A2AATRIVAA US Corp.TRBond CorporateDeveloped23 December 1988–29 May 2020
eBAMLEM1BRRAAA2ACRPITRIVAAA-A Em. Mkt Corp TRBond CorporateEmerging8 January 1999–29 May 2020
fDGS3MO3-M Treasury Const. MtyBond GovernmentDeveloped11 January 1982–25 May 2020
gDGS1010-Y Treasury Const. MtyBond GovernmentDeveloped8 January 1962–25 May 2020
a: USD Basis Swap 1Mv3M (Bloomberg ticker USBAAC) returns which is a swapping 1 month (reference index US0001M) versus 3 months (reference index US0003M), taken from 12 February 2007 to 30 March 2020; b: S&P 500 index returns, taken from 30 December 1927 to 29 May 2020; c: Bovespa index returns, taken from 5 January 1990 to 29 May 2020; d: ICE BofA AA US Corporate Index Total Return Index Value [BAMLCC0A2AATRIV], taken from 23 December 1988 to 29 May 2020; e: ICE BofA AAA-A Emerging Markets Corporate Plus Index Total Return Index Value [BAMLEM1BRRAAA2ACRPITRIV], taken from 8 January 1999 to 29 May 2020; f: 3-Month Treasury Constant Maturity Rate [DGS3MO], taken from 11 January 1982 to 25 May 2020; g: 10-Year Treasury Constant Maturity Rate [DGS10], taken from 8 January 1962 to 25 May 2020.
Table 2. Statistics on returns.
Table 2. Statistics on returns.
Statistical Characteristics of Weekly Returns
Mom.\Des.USD Swap 1Mv3MS&P 500BovespaAA US Corp.TREm Mk3m Tbill10Y Tbond
St. Dev.0.04800.02500.06160.00540.00510.00010.0262
Mean−7.0055 × 10 5 0.00110.00920.00120.00120.0001−0.0006
Kurtosis27.48409.648919.757119.485123.26412.709926.0274
Skew0.3850−0.61351.5259−1.3492−2.28820.5302−1.3782
Statistical Characteristics of Monthly Returns
Mom.\Des.USD Swap 1Mv3MS&P 500BovespaAA US Corp.TREm Mk3m Tbill10Y Tbond
St. Dev.0.07370.05400.13020.01250.01443.4106 × 10−40.0591
Mean3.3382 × 10 4 0.00420.03550.00490.00474.2310 × 10−4−0.0023
Kurtosis8.304312.47368.75925.735439.03412.678825.9171
Skew−0.6069−0.44241.0511−0.3696−3.72330.52482−2.1663
Statistical Characteristics of Yearly Returns
Mom.\Des.USD Swap 1Mv3MS&P 500BovespaAA US Corp.TREm Mk3m Tbill10Y Tbond
St. Dev.0.23860.19540.98150.05040.05050.00400.2029
Mean−0.00460.05260.44550.06020.05880.0052−0.0170
Kurtosis6.38416.99688.35392.57085.25102.03597.7485
Skew0.8520−1.15892.4618−0.03470.10580.3025−1.0258
Table 3. KPSS test to assess if the time series are trend stationary against the alternative of a unit root. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table 3. KPSS test to assess if the time series are trend stationary against the alternative of a unit root. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
USBAACS&P 500BovespaUS Corp.EmMkt Corp.DGS3MDGS10Y
resp.p-Valueresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Value
Weekly10.0048101010101010
Monthly10.004610101000.04051010
Yearly00.666100.732100.076710101010
Yearly Shuffled10101010101010
Table 4. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table 4. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Normalt-SkewGen. HyperbolicGen. ParetoExp. Pareto
resp.p-ValueDKW Exceedsresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Value
Weekly11.8548 × 10 25 75.19%00.761400.71151010
Monthly16.0526 × 10 5 30.01%00.875000.86831010
Yearly17.1011 × 10 11 45.09%00.0376101010
Yearly Shuffled17.1011 × 10 11 45.09%00.0376101010
Table 5. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table 5. Ljung-Box Q-test and ARCH test to detect autocorrelation. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Ljung-Box Q-TestARCH Test
m = ln ( n ) m = ( n 1 )
resp.p-Valueresp.p-Valueresp.p-Value
Weekly11.3188 × 10 8 00.928211.5504 × 10 12
Monthly10.012700.969315.0823 × 10 6
Yearly101010
Yearly Shuffled10.014900.990000.6897
Table 6. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table 6. K-S test to detect the original distribution. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Normalt-skewGen. HyperbolicGen. ParetoExp. Pareto
resp.p-ValueDKW Exceedsresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Value
Weekly11.6502 × 10 19 74.92%00.048200.72561010
Monthly18.3615 × 10 9 58.32%00.198500.97141010
Yearly12.7038 × 10 30 68.55%18.6098 × 10 7 101010
Yearly Shuffled12.7038 × 10 30 68.55%18.6099 × 10 7 101010
Table 7. Ljung-Box Q-test and ARCH test to detect autocorrelation for the average series. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table 7. Ljung-Box Q-test and ARCH test to detect autocorrelation for the average series. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Ljung-Box Q-TestARCH Test
m = ln ( n ) m = ( n 1 )
resp.p-Valueresp.p-Valueresp.p-Value
Monthly11.3188 × 10 8 00.965400.4820
Yearly10.007100.718400.6229
Table 8. K-S test to detect the original distribution for the average series. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Table 8. K-S test to detect the original distribution for the average series. The response is a boolean where 0 indicates that there is no evidence to reject the null hypothesis, and the value 1 is the opposite case.
Normalt-skewGen. HyperbolicGen. ParetoExp. Pareto
resp.p-ValueDKW Exceedsresp.p-Valueresp.p-Valueresp.p-Valueresp.p-Value
Av. Monthly10.000375.19%00.870600.04141010
Av. Yearly00.538224.43%00.996100.07521010
Table 9. K-S test to detect the normality of the rescaled returns when ( K , β ) = ( 4 , 0.025 ) (see Rogers (2018)).
Table 9. K-S test to detect the normality of the rescaled returns when ( K , β ) = ( 4 , 0.025 ) (see Rogers (2018)).
Normal
IndexSamplingresp.p-ValueDKW Exceeds
USBAACWeekly15.0443 × 10 15 67.16%
Monthly15.3363 × 10 4 25.74%
Yearly11.0622 × 10 7 28.05%
Yearly Shuffled12.2856 × 10 11 54.27%
S&P 500Weekly12.8341 × 10 9 57.24%
Monthly11.3090 × 10 5 46.72%
Yearly13.3137 × 10 40 70.12%
Yearly Shuffled15.0050 × 10 30 69.52%
BovespaWeekly15.7367 × 10 4 16.51%
Monthly00.02290%
Yearly15.2297 × 10 7 33.15%
Yearly Shuffled17.6801 × 10 104 80.69%
US Corp.Weekly00.01990%
Monthly00.12360%
Yearly18.4622 × 10 7 34.55%
Yearly Shuffled00.11650%
EmMkt Corp.Weekly11.2991 × 10 5 42.61%
Monthly18.7935 × 10 4 8.27%
Yearly13.2051 × 10 7 32.73%
Yearly Shuffled00.08880%
DGS3MWeekly15.5608 × 10 10 51.97%
Monthly11.0300 × 10 119 86.22%
Yearly11.1038 × 10 62 75.85%
Yearly Shuffled18.6683 × 10 122 89.42%
DGS10YWeekly00.01010%
Monthly10.00381.19%
Yearly11.0023 × 10 6 44.23%
Yearly Shuffled12.0652 × 10 7 53.54%
Table 10. K-S test to detect the normality of the rescaled returns with ( K * , β * ) (see Equation (4)).
Table 10. K-S test to detect the normality of the rescaled returns with ( K * , β * ) (see Equation (4)).
Normal
IndexSamplingresp.p-ValueDKW Exceeds K * β *
USBAACWeekly11.5841 × 10 5 33.76%2.01660.3653
Monthly10.00286.62%12.98210.3374
Yearly10.00631.22%6.70900.1032
Yearly Shuffled12.1026 × 10 10 40.24%7.67920.0053
S&P 500Weekly11.0061 × 10 7 44.27%8.32210.1905
Monthly12.0385 × 10 5 32.34%9.52460.1070
Yearly11.2624 × 10 24 57.09%5.51530.0050
Yearly Shuffled11.0678 × 10 27 73.56%8.59800.0393
BovespaWeekly10.00298.89%6.79900.1540
Monthly00.02600%1.13960.1014
Yearly00.53260%5.26120.0116
Yearly Shuffled11.3433 × 10 98 80.01%3.93410.0293
US Corp.Weekly00.06260%9.63000.1515
Monthly00.21020%7.89130.0015
Yearly00.42530%5.57810.0040
Yearly Shuffled00.08860%2.38420.0948
EmMkt Corp.Weekly10.00165.95%11.71890.2936
Monthly00.01090%4.60680.3794
Yearly11.3835 × 10 4 22.16%5.99000.0044
Yearly Shuffled00.02340%4.53040.0279
DGS3MWeekly12.3686 × 10 10 49.58%4.28620.0244
Monthly11.1081 × 10 29 64.83%5.04890.0161
Yearly11.4193 × 10 67 70.98%33.70530.0185
Yearly Shuffled12.9770 × 10 48 73.56%62.43700.1358
DGS10YWeekly00.26690%7.47240.1314
Monthly10.00600.26%7.67850.1516
Yearly10.00242.54%5.32050.0047
Yearly Shuffled14.2387 × 10 7 40.33%10.03820.0009
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Orlando, G.; Bufalo, M. Empirical Evidences on the Interconnectedness between Sampling and Asset Returns’ Distributions. Risks 2021, 9, 88. https://0-doi-org.brum.beds.ac.uk/10.3390/risks9050088

AMA Style

Orlando G, Bufalo M. Empirical Evidences on the Interconnectedness between Sampling and Asset Returns’ Distributions. Risks. 2021; 9(5):88. https://0-doi-org.brum.beds.ac.uk/10.3390/risks9050088

Chicago/Turabian Style

Orlando, Giuseppe, and Michele Bufalo. 2021. "Empirical Evidences on the Interconnectedness between Sampling and Asset Returns’ Distributions" Risks 9, no. 5: 88. https://0-doi-org.brum.beds.ac.uk/10.3390/risks9050088

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop