Next Article in Journal
Applying Fixed Point Techniques to Stability Problems in Intuitionistic Fuzzy Banach Spaces
Next Article in Special Issue
A Novel Methodology to Calculate the Probability of Volatility Clusters in Financial Series: An Application to Cryptocurrency Markets
Previous Article in Journal
Performance Evaluation of Deep Learning-Based Gated Recurrent Units (GRUs) and Tree-Based Models for Estimating ETo by Using Limited Meteorological Variables
Previous Article in Special Issue
Detection of Near-Nulticollinearity through Centered and Noncentered Regression
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Proposal to Fix the Number of Factors on Modeling the Dynamics of Futures Contracts on Commodity Prices  †

by
Andrés García-Mirantes
1,
Beatriz Larraz
2,* and
Javier Población
3
1
IES Juan del Enzina, 24001 Leon, Spain
2
Statistics Department/Faculty of Law and Social Sciences, Universidad de Castilla-La Mancha, 45071 Toledo, Spain
3
Banco de España, 28014 Madrid, Spain
*
Author to whom correspondence should be addressed.
This paper should not be reported as representing the views of the Banco de España (BdE) or European Central Bank (ECB). The views expressed herein are those of the authors and should not be attributed to the BdE or ECB.
Submission received: 8 May 2020 / Revised: 10 June 2020 / Accepted: 11 June 2020 / Published: 14 June 2020
(This article belongs to the Special Issue Quantitative Methods for Economics and Finance)

Abstract

:
In the literature on modeling commodity futures prices, we find that the stochastic behavior of the spot price is a response to between one and four factors, including both short- and long-term components. The more factors considered in modeling a spot price process, the better the fit to observed futures prices—but the more complex the procedure can be. With a view to contributing to the knowledge of how many factors should be considered, this study presents a new way of computing the best number of factors to be accounted for when modeling risk-management of energy derivatives. The new method identifies the number of factors one should consider in the model and the type of stochastic process to be followed. This study aims to add value to previous studies which consider principal components by assuming that the spot price can be modeled as a sum of several factors. When applied to four different commodities (weekly observations corresponding to futures prices traded at the NYMEX for WTI light sweet crude oil, heating oil, unleaded gasoline and Henry Hub natural gas) we find that, while crude oil and heating oil are satisfactorily well-modeled with two factors, unleaded gasoline and natural gas need a third factor to capture seasonality.

1. Introduction

Forecasting is not a highly regarded activity for economists and financiers. For some, it evokes images of speculators, chart analysts and questionable investor newsletters. For others, there are memories of the grandiose econometric forecasting failures of the 1970’s. Nevertheless, there is a need for forecasting in risk management. A prudent corporate treasurer or fund manager must have some way of measuring the risk of earnings, cash flows or returns. Any measure of risk must incorporate some estimate of the probability distribution of the futures asset prices on which financial performance depends. Consequently, forecasting is an indispensable element of prudent financial management.
When a company is planning to develop a crude oil or natural gas field, the investment is significant, and production usually lasts many years. However, there must be an initial investment for there to be any return (see, for example, [1,2], among others). Assuming that futures values are not known after a certain date because there is no trade, it makes it difficult to measure the risk of these projects. Since commodities (crude oil, gas, gasoline, etc.) are physical assets, their price dynamic is much more complex than financial assets because their prices are affected by storage and transportation cost (cost of carry). Due to such complexity, in order to model this price dynamic we need factor models such as in [3,4,5,6,7,8,9]. In addition, in the transport sector [10] and [11] use different factor models for modeling bulk shipping prices and freight prices.
In order to measure exposure to price risk due to a single underlying asset, it is necessary to know the dynamics of the term structure of asset prices. Specifically, the value-at-risk (VaR, [12]) of the underlying asset price, the most widely known measure of market risk [13], is characterized by knowing the stochastic dynamic of the price, the volatility of the price and the correlation of different prices at different times. For these reasons, to date, the behavior of commodity prices has been modeled under the assumption that the spot price and/or the convenience yield of the commodity follow a stochastic process.
In the literature we find that the spot price is considered as the sum of both short-term and long-term components (see, for example, [14,15]). Short-term factors account for the mean reverting components in commodity prices, while long-term factors account for the long-term dynamics of commodity prices, assuming they follow a random walk. Sometimes a deterministic seasonal component needs to be added [16].
Following this approach, some multifactor models have been proposed in the literature. Focusing on the number of factors initially considered, [17] developed a two-factor model to value oil-linked assets. Later, [14] planned a one-factor model, two-factor model and a three-factor model, adding stochastic interest rates to the previous factors. This was superseded by a new formulation which appeared in [15], enhancing the latter article and developing a short-term/long-term model. [18] added the long-term spot price return as a third risk factor. Finally, [19] offered researchers a general N-factor model.
At this point, it should be stressed that the decision regarding the number of factors to be used in the model needs to be made a priori. According to the above literature consulted, the models are usually planned with two, three or four factors. However, in this study, the need to assume a fixed number of factors in the model is discounted. We propose a new method that identifies the number of factors one should consider in the model and the type of stochastic process to be followed. This method avoids the necessity of inaccurately suggesting a concrete number of factors in the model. This is very useful for researchers and practitioners because the optimal number of factors could change, depending on the accuracy needed in each problem. Clearly, if we do not use the optimal number of factors in modeling the commodity price dynamics, the results will not be optimal.
To the best of our knowledge, there are three previous studies applying principal component analysis [20] to the modeling of commodity futures price dynamics [21,22,23]. However, they only model the futures prices dynamic and ignore the dynamic followed by the spot price and, consequently carrying the risk of being incoherent, since futures price are the spot price expected value under the Q measure.
This study aims to add value to previous contributions by assuming that is the spot price can be modeled as a sum of several factors (long term and short term, seasonality, etc.). Therefore, since it is widely accepted (see, for example, [24]) that the futures price is the spot price expected value under the Q measure ( F t , T = E * [ S t + T | I t ] ), where S t + T is the spot price at time t + T , I t is the information available at time t and E * [   ] is the expected value under the Q measure.), from the variance–covariance matrix of the futures prices we can deduce the best structure for modelling the spot prices dynamic.
The remainder of this study is organized as follows. Section 2 presents a general theoretical model and explains the methodology proposed to set an optimal set of factors. In Section 3, we describe the datasets used to show the methodology and these results are described. Finally, Section 4 sets out the conclusions.

2. Theoretical Model

2.1. Theoretical Model

In the main literature to date (for example, [19]), it is assumed that the commodity log spot price is the sum of several stochastic factors: S t = exp ( C X t ) , t = 0 , , n where the vector of state variables X t = ( x 1 t , , x N t ) follows the process: d X t = Μ d t + A X t d t + R d W t , being C , M , A and R vectors’ and matrices’ parameters.
It is widely accepted that, for the model to be identifiable, some restrictions must be imposed. This means that if we assume that A is diagonalizable and all its eigenvalues are real (a different formula is available if some are complex), we can take C = ( 1 , , 1 ) , M = ( μ , 0 , , 0 ) and A = ( 0 0 0 0 k 2 0 0 0 k N ) , with k i , i = 1 , , N the eigenvalues and k 1 = 0 , by simply changing the state space basis. Therefore, we already have M , A and C .
It is also easy to prove that as d W t is a N × 1 vector of correlated Brownian motion increments, R can be assumed as R = ( σ 1 0 0 0 σ 2 0 0 0 σ N ) . Note that R is not important, but the product R R is what appears in all formulae. In fact, it can be proved that any factorization of R R corresponds to a different definition of the noise, so we can safely take R as any Choleski factorization of ( R R ). In the Black–Scholes world (risk-neutral world), knowing the real dynamics, the risk neutral one is d X t = M * d t + A X t d t + R d W t * where M * = M λ being λ = ( λ 1 , λ 2 , , λ N ) the vector formed from each state variable’s risk premium).
Following [25], the futures price is given by F t , T = exp ( g ( T ) + C e A T X t ) , where we know explicitly g ( T ) = C 0 T e A ( T s ) M * d s + C ( 0 T e A ( T s ) R R ( e A ( T s ) ) d s ) C and where both g ( T ) and C ( T ) = e A T are known deterministic functions independent of t and X t is a stochastic process with known dynamics.
Defining in a more compact form, we have:
{ d X t = ( M + A X t ) d t + R d W t F t , T = exp [ δ ( T ) + ϕ ( T ) X t + φ ( T ) M * + ε t , T ]

2.2. A General Procedure to Determine the Stochastic Factors

In the previous subsection, we have presented the general model for characterizing the commodity price dynamics based on the assumption that the log commodity spot price is the sum of several factors. However, to the best of the authors’ knowledge, the optimal number of stochastic factors has not yet been studied, for these models.
This subsection presents a theoretical procedure to establish the optimal number of factors. It also presents a way to determine how those factors should be aligned (long-term, short-term, seasonal, etc.).
To address this problem, let us suppose that there are M futures maturities and n observations of the forward curve, that is, the matrix U = log ( F t , T i ) ,   t = 0 , , n ;   i = 1 , , M has dimension M × ( n + 1 ) . We further assume, as usual, that n M . To determine the optimal number of stochastic factors needed to characterize the commodity price dynamic in the best way, first we must realize that the number of factors is equal to r a n k ( R ) and, from the previous expression, r a n k ( R ) has to be equal to the rank of the variance–covariance matrix of U . If, as usual, the process X t has a unit root, so it is non-stationary and the variance and covariances are infinity, we need another matrix to determine the rank of the variance–covariance matrix of U .
If we define volatility (instantaneous variance) as σ T i 2 = lim h 0 V a r ( log F t + h , T i log F t , T i ) h , i = 1 , , M and cross-volatility (instantaneous covariance) as σ T i , T j = lim h 0 C o v ( log F t + h , T i log F t , T i , log F t + h , T j log F t , T j ) h , i , j = 1 , , M (as expected, σ T i 2 = σ T i , T j ), we have the necessary matrix. Although we cannot compute the limit from the data, we can set h as the shortest time period available and estimate it directly as σ ^ T i , T j = [ C ^ o v ( log F t + h , T i log F t , T i , log F t + h , T j log F t , T j ) h ] , where C ^ o v is the sample covariance.
We thus define the matrix Θ = ( Θ i j ) (dim M × M ) as Θ i j = σ T i , T j , i , j = 1 , , M . We can estimate it directly from our database and we can also estimate its rank. Once we have this rank, as stated above r a n k ( Θ ) = r a n k ( R ) = N , we know the number of stochastic factors (N) that define the commodity price dynamics.
From a practical point of view, however, if we follow this procedure as explained above, unless one futures maturity is a linear combination of the rest (which is not likely), we obtain r a n k ( Θ ) = r a n k ( R ) = N . Nevertheless, the weights of these factors are going to be different and most of them will have an insignificant weight.
Fortunately, from this procedure, we can also estimate the eigenvalues k 1 , , k N and, from there, determine the factor weight through the eigenvalues’ relative weight. We can estimate the eigenvalues of A via a nonlinear search procedure by using the fact that σ T i , T j = C e A T i R R ( e A T j ) C (see García et al. 2008) and therefore, Θ can be expressed as Θ = C ( e A T 1 e A T M ) R R ( e A T 1 e A T M ) C . σ T i , T j is a linear combination of products of e k 1 T , , e k N T . In other words, if k 1 , , k N are the eigenvalues of A , e k 1 T , , e k N T must be the eigenvalues of Θ .
Moreover, from the eigenvalues of matrix A , it is also easy to determine the factors. Taking into account that factors´ Stochastic Differential Equation (SDE) is d X t = Μ d t + A X t d t + R d W t , if, for example, the eigenvalue is k = 0 , the factor is a long-term one because the SDE associated with this factor is a random walk (General Brownian Motion (GBM)): d x i t = μ i d t + σ i d W i t . On the other hand, if the eigenvalue is k ( 1 , 0 ) , the factor is a short-term one because the SDE associated with this factor is an Ornstein–Uhlenbeck: d x i t = λ x i t d t + σ i d W i t . If the eigenvalue is complex, the factor is a seasonal one.
From a practical point of view, when we carry out this procedure we get N eigenvalues and we need to decide how many of them to optimally choose. The way to decide this is through the relative weight of the eigenvalues. By normalizing the largest one to 1, the smallest eigenvalues represent negligible factors. This allows us to decide how many factors must be optimally chosen.
In order to clarify concepts, the following example could be useful, if we have M = 9 futures with maturities at times T 1 , , T 9 . The method is as follows.
  • Compute Θ ^ i j = [ C o v ^ ( log F t + h , T i log F t , T i , log F t + h , T j log F t , T j ) h ] .
  • Compute the rank of Θ ^ . Let us assume that this is 3.
  • As a result, we have three eigenvalues k 1 , k 2 and k 3 . It is usual to assume that k 1 = 0 as the futures process is not stationary, but k 1 can nevertheless be estimated. If we do assume it, however, we obtain that σ T i , T j is a linear combination of the products of e 0 T = 1 , e k 2 T and e k 3 T . Therefore, we obtain the general equation Θ i j = α 11 + α 12 e k 2 T j + α 13 e k 3 T j + α 21 e k 2 T i + α 31 e k 3 T i + α 22 e k 2 ( T i + T j ) + α 23 e k 2 T i + k 3 T j + α 23 e k 2 T j + k 3 T i + α 33 e k 3 ( T i + T j ) which can be estimated numerically as:
    • Select an initial estimate of ( k 2 , k 3 ) .
    • Regress Θ ^ i j and compute the error.
    • Iteratively select another estimate of ( k 2 , k 3 ) and get back to b.
To the best of the authors’ knowledge, no method has combined the knowledge of this concrete specification G = Φ ( T ) with a nonlinear search procedure to identify factors, which is one of the contributions made by this article.
Once we have determined the optimal number and form of the stochastic factors to characterize the commodity price dynamics, we can estimate model parameters using standard techniques. The Kalman filter (see, for example, [26]) uses a complex calibration technique. Other techniques include approximations such as [18] or [27]. Finally, the recently published option by [28] presents an optimal way of estimating model parameters by avoiding the use of the Kalman filter. Model parameters are estimated in the papers and so, for the sake of brevity we do not estimate the parameters in this study.

3. Data and Main Results

3.1. Data

In this subsection, we briefly describe the datasets used in this study. The datasets include weekly observations corresponding to futures prices for four commodities: WTI light sweet crude oil, heating oil, unleaded gasoline (RBOB) and Henry Hub natural gas. These futures were taken into consideration because they are the most representative and classic among the products. They are futures with many historical series and futures at many maturities. Therefore, they are considered as ideal for studying the optimal number of factors that should be chosen.
In this study, two data sets were considered for each commodity. Data set 1 contains less futures maturities, but more years of observations considered while data set 2 contains more futures maturities, but less years of observations. For dataset 1 (Table 1A), related to WTI crude oil, it comprised contracts from 4 September 1989 to 3 June 2013 (1240 weekly observations) for futures maturities from F1 to F17, F1 being the contract for the month closest to maturity, F2 the contract for the second-closest month to maturity, etc. In the case of heating oil, it contained contracts from 21 January 1991 to 3 June 2013 (1168 weekly observations) for futures maturities from F1 to F15. Meanwhile, RBOB gasoline first data set comprised contracts from 3 October 2005 to 3 June 2013 (401 weekly observations) for futures maturities from F1 to F12 and in the case of Henry Hub natural gas, it contained contracts from 27 January 1992 to 3 June 2013 (1115 weekly observations) for futures maturities from F1 to F16.
Looking at the dataset 2 (Table 1B), in the case of WTI crude oil, it comprised contracts from 18 September 1995 to 3 June 2013 (925 weekly observations) for futures maturities from F1 to F28 and in the case of heating oil it comprised contracts from 9 September 1996 to 3 June 2013 (874 weekly observations) for futures maturities from F1 to F18. In the meantime, RBOB gasoline comprised contracts from 2 February 2007 to 3 June 2013 (330 weekly observations) for futures maturities from F1 to F36 and, to end with, in the case of Henry Hub natural gas, dataset 2 (Table 1B) contained contracts from 24 March 1997 to 3 June 2013 (856 weekly observations) for futures maturities from F1 to F36.
Table 1 shows the main descriptive statistics of the futures, particularly the mean and volatility, for each dataset. It is interesting to note that the lack of low-cost transportation and the limited storability of natural gas made its supply unresponsive to seasonal variation in demand. Thus, natural gas prices were strongly seasonal [3]. The unleaded gasoline was also seasonal.

3.2. Main Results

We now present the results after applying the method proposed to the 4 commodities (2 datasets per commodity) described above in order to select the number of factors to model the behavior of commodity prices. The results correspond to the eigenvalues in decreasing order, the percentage of the overall variability that they explain and the cumulative proportion of explained variance. These are reported in Table 2, Table 3, Table 4 and Table 5.
As a general rule, we can consider that the first factor, which corresponds to the first eigenvalue, was clearly dominant in the sense that it can explain a percentage of the total variance ranging between 95.2% and 99.7%, depending on the commodity. It captures qualitative long-run effects. However, it is always necessary to consider a second factor capable of taking up short-term effects. Both the first and second factors explain a cumulative proportion of overall variance between 97.5% and 99.9%, depending on the case under study. In WTI light sweet crude oil, these two factors explain more than a 99.99% of the total variance is explained, while in heating oil case studies, these percentages were approximately 99.88% and in unleaded gasoline and Henry Hub natural gas, they were approximately 97–98%.
Consequently, in the first commodity (crude oil) it is recommended that just the first two factors are considered. The reason is that a third factor will impose a larger estimating effort and a minimum reduction in terms of error measures. The first factor will capture long-term effects, such as world economic events, which significantly impact on commodity prices. The second factor will capture the nature of short-term components such as temporary issues and unforeseen situations. The third and following stochastic factors can be considered as seasonal factors [28] and, as we know, crude oil is a non-seasonal commodity. This matter reinforces the idea that it is suitable to consider a model with only the first two factors.
The next commodity, heating oil, presents some seasonal behavior, which could be captured by a third factor. The fact that the gain in the percentage of cumulative proportion of overall variance goes from 99.88 to 99.94 and from 99.90 to 99.94 in its respective datasets suggest the inclusion of a third factor was not necessary.
Conversely, for the unleaded gasoline and Henry hub natural gas, at least a third factor seemed to be necessary. Both were seasonal commodities (see, for example, [3]). They were characterized by very limited storability and their prices were highly dependent on the commodity demand. Third and fourth factors will acknowledge this behavior. It seems necessary to capture more than long-term and short-term dynamics. Depending on the cumulative variance, if we would like to explain (98–99%), we need to consider at least a third factor or two more. In the unleaded gasoline case, the inclusion of a third factor would increase the cumulative proportion of overall variance from 98.48% to 99.73% and from 97.49% to 98.73%. However, with a fourth factor, we would reach 99.86% and 99.76%, respectively. When we apply the methodology proposed to Henry Hub natural gas datasets, we also verify the need to consider a third and even a fourth factor to explain 99.80% and 99.65% of the total variance, respectively.
These results are coherent with the patterns shown in the futures contracts of each commodity. By considering seasonality as a stochastic factor instead of a deterministic one, we can choose from two- to four-factor models to better model the behavior of commodity prices. It should be noted that the long-term and short-term effects, captured by the first two factors, are clearly dominant in terms of their eigenvalues’ relative weight. However, the seasonality should be considered if necessary.
It is important to bear in mind that the distinction between long term and short term is not always direct. It is related to the eigenvalue of the factor, which, as we have stated, is always in the form e k with k 0 (a positive k would mean an explosive process, which is clearly not observed in the data).
If k = 0 , we have a long-term effect (a unit root). The more negative k is, the shorter the effect. Therefore, k = 1 means a much shorter effect than k = 0.01 , for example.
Explanation capacities of each factor are measured according to their (relative) contribution to the global variance. For example, if there is a unique factor related to eigenvalue k = 0 that gives 90% of variance, we would conclude that long term dynamics explain 90% of the variance.
It should be noted that this article focuses on the econometric theory and identifies the optimal number of factors to characterize the dynamics of commodity prices. Apart from this econometric approach, where each factor represents a component—long term, short term, seasonal, etc.—these factors may also capture economic forces [29,30,31]. In other words, there are economic forces that are being captured by these factors, such as technology effects (long term) or the functioning of the market (short term). Following [15], we argue that the long-term factor reflects expectations of the exhaustion of the existing supply, improvements in technology for the production and discovery of the commodity, inflation, as well as political and regulatory effects. The short-term factor reflects short-term changes in demand or intermittent supply disruptions. An interpretation of seasonal factors can be found in [3].
This method provides a new selection criterion for obtaining the optimal number of factors. It is always important to keep in mind the purpose of modeling such commodity prices. If we need more accuracy because, for example, we are designing investment strategies, the consideration of more factors is understandable. We could also use fewer factors in a different case.
This is important because, on one hand, if we use too many factors the model will be too complex and parameter estimation may not be accurate. On the other hand, if we use too few factors the model will not be acceptable because it will not capture all the characteristic of the price dynamics that we need to consider in order to solve our problem.
We believe our findings to be very useful for researchers and practitioners. Based on our findings, a researcher who needs to model a commodity price dynamic can use our method to identify the number and the characteristics of the factors to be included in the model. Moreover, a practitioner who is investing or measuring risk can also use our methodology in order to identify the optimal number of factors needed and their characteristics.
Finally, as stated above, we have chosen to order the factors according to their relative (joint) contribution to variance because it is a direct and simple way to interpret the results. We are aware that collinearity and, in general, correlation structures can modify the results. However, since the first eigenvalue explains around 95% of variance, it seems unlikely that results are going to change substantially by a more refined analysis.

4. Summary and Conclusions

In this article, we propose a novel methodology for choosing the optimal number of stochastic factors to be included in a model of the term structure of futures commodity prices. With this method, we add to the research related to the way we characterize commodity price dynamics.
The procedure is based on the eigenvalues of the variance–covariance matrix. Moreover, in deciding how many of them to choose, we propose using the relative weight of the eigenvalues and the percentage of the total variance explained by them and balancing this with the effort of estimating more parameters.
In this article, we applied our method to eight datasets, corresponding with four different commodities: crude oil, heating oil, unleaded gasoline and natural gas. Results indicate that to model the first two commodity prices two factors are suitable, which corresponds with the two biggest eigenvalues, since they are sufficient to account for both long-term and short-term structures. Nevertheless, in the case of unleaded gasoline and natural gas, a third or even fourth factor is needed. We think that, in accordance with the literature, this is related to their seasonal behavior.
Our results support the notion that including too many or too few factors or factors with characteristics which are not optimal in a model for commodity prices could lead to results which may not be as accurate as they should be.

Author Contributions

Conceptualization, B.L. and J.P.; methodology, A.G.-M.; software, A.G.-M.; validation, Población and A.G.-M.; formal analysis, A.G.-M.; investigation, J.P. and A.G.-M.; resources, A.G.-M., B.L. and J.P.; data curation, A.G.-M., B.L. and J.P.; writing—original draft preparation, B.L.; writing—review and editing, B.L. y Población; visualization, B.L.; supervision, Población; project administration, B.L.; funding acquisition, B.L. All authors have read and agreed to the published version of the manuscript.

Funding

We acknowledge the financial support of the Spanish Ministerio de Economía, Industria y Competitividad Grant Number ECO2017-89,715-P (Javier Población).

Acknowledgments

This study should not be reported as representing the views of the Banco de España (BdE) or European Central Bank (ECB). The views in this study are those of the author and do not necessarily reflect those of the Banco de España (BdE) or European Central Bank (ECB). We thank the anonymous referees. Any errors are caused by the authors.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Jahn, F.; Cook, M.; Graham, M. Hydrocarbon Exploration and Production; Elsevier: Aberdeen, UK, 2008. [Google Scholar]
  2. Smit, H.T.J. Investment analysis of offshore concessions in the Netherlands. Financ. Manag. 1997, 26, 5–17. [Google Scholar] [CrossRef] [Green Version]
  3. García, A.; Población, J.; Serna, G. The Stochastic Seasonal Behaviour of Natural Gas Prices. Eur. Financ. Manag. 2012, 18, 410–443. [Google Scholar]
  4. García, A.; Población, J.; Serna, G. The stochastic seasonal behavior of energy commodity convenience yields. Energy Econ. 2013, 40, 155–166. [Google Scholar]
  5. García, A.; Población, J.; Serna, G. Analyzing the dynamics of the refining margin: Implications for valuation and hedging. Quant. Financ. 2013, 12, 1839–1855. [Google Scholar]
  6. Alquist, R.; Bhattarai, S.; Coibion, O. Commodity-price comovement and global economic activity. J. Monet. Econ. 2019. [Google Scholar] [CrossRef] [Green Version]
  7. Jacks, D.S. From boom to bust: A typology of real commodity prices in the long run. Cliometrica 2019, 13, 201–220. [Google Scholar] [CrossRef] [Green Version]
  8. Nazlioglu, S. Oil and Agricultural Commodity Prices. In Routledge Handbook of Energy Economics; Soytas, U., San, R., Eds.; Routledge: London, UK, 2020; pp. 385–405. [Google Scholar]
  9. Ayres, J.; Hevia, C.; Nicolini, J.P. Real exchange rates and primary commodity prices. J. Intern. Econ. 2020, 122. [Google Scholar] [CrossRef]
  10. Población, J.; Serna, G. A common long-term trend for bulk shipping prices. Marit. Econ. Logist. 2018, 20, 421–432. [Google Scholar] [CrossRef]
  11. García, A.; Población, J.; Serna, G. Hedging voyage charter rates on illiquid routes. Intern. J. Shipp. Transp. Logist. 2020, 12, 197–211. [Google Scholar]
  12. Morgan, J.P. Risk Metrics–Technical Document; Reuters: New York, NY, USA, 1996. [Google Scholar]
  13. Echaust, K.; Just, M. Value at Risk Estimation Using the GARCH-EVT Approach with Optimal Tail Selection. Mathematics 2020, 8, 114. [Google Scholar] [CrossRef] [Green Version]
  14. Schwartz, E.S. The stochastic behavior of commodity prices: Implication for valuation and hedging. J. Financ. 1997, 52, 923–973. [Google Scholar] [CrossRef]
  15. Schwartz, E.S.; Smith, J.E. Short-term variations and long-term dynamics in commodity prices. Manag. Sci. 2000, 46, 893–911. [Google Scholar] [CrossRef] [Green Version]
  16. Sorensen, C. Modeling seasonality in agricultural commodity futures. J. Futures Mark. 2002, 22, 393–426. [Google Scholar] [CrossRef]
  17. Gibson, R.; Schwartz, E.S. Stochastic convenience yield and the pricing of oil contingent claims. J. Financ. 1990, 45, 959–976. [Google Scholar] [CrossRef]
  18. Cortazar, G.; Schwartz, E.S. Implementing a stochastic model for oil futures prices. Energy Econ. 2003, 25, 215–218. [Google Scholar] [CrossRef]
  19. Cortazar, G.; Naranjo, L. An N-Factor gaussian model of oil futures prices. J. Futures Mark. 2006, 26, 209–313. [Google Scholar] [CrossRef]
  20. Camiz, S.; Pillar, V.D. Identifying the Informational/Signal Dimension in Principal Component Analysis. Mathematics 2018, 6, 269. [Google Scholar] [CrossRef] [Green Version]
  21. Cortazar, G.; Schwartz, E.S. The valuation of commodity-contingent claims. J. Deriv. 1994, 1, 27–39. [Google Scholar] [CrossRef]
  22. Clewlow, L.; Strickland, C. Energy Derivatives, Pricing and Risk Management; Lamina Publication: London, UK, 2000. [Google Scholar]
  23. Tolmasky, C.; Hindanov, D. Principal components analysis for correlated curves and seasonal commodities: The case of the petroleum market. J. Futures Mark. 2002, 22, 1019–1035. [Google Scholar] [CrossRef]
  24. Hull, J. Options, Futures and Other Derivatives, 5th ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2003. [Google Scholar]
  25. García, A.; Población, J.; Serna, G. A Note on Commodity Contingent Valuation. J. Deriv. Hedge Fund. 2008, 13, 311–320. [Google Scholar] [CrossRef]
  26. Harvey, A.C. Forecasting Structural Time Series Models and the Kalman Filter; Cambridge University Press: Cambridge, UK, 1989. [Google Scholar]
  27. Kolos, S.P.; Rohn, E.I. Estimating the commodity market price of risk for energy prices. Energy Econ. 2008, 30, 621–641. [Google Scholar] [CrossRef]
  28. García, A.; Larraz, B.; Población, J. An alternative method to estimate parameters in modeling the behavior of commodity prices. Quant. Financ. 2016, 16, 1111–1127. [Google Scholar] [CrossRef]
  29. Coles, J.L.; Li, Z.F. An Empirical Assessment of Empirical Corporate Finance. SSRN 2019. [Google Scholar] [CrossRef] [Green Version]
  30. Coles, J.L.; Li, Z.F. Managerial Attributes, Incentives, and Performance. Rev. Corp. Financ. Stud. 2019. [Google Scholar] [CrossRef]
  31. Dang, C.; Foerster, S.R.; Li, Z.F.; Tang, Z. Analyst Talent, Information, and Insider Trading. SSRN 2020. [Google Scholar] [CrossRef]
Table 1. Descriptive statistics.
(A) Dataset 1 
(A) Dataset 1 
WTI Crude OilGasolineNatural GasHeating Oil
Mean ($/bbl)Volatility
(%)
Mean
($/bbl)
Volatility
(%)
Mean
($/MMBtu)
Volatility
(%)
Mean
($/bbl)
Volatility
(%)
F143.130%96.732%4.245%63.628%
F243.327%96.530%4.340%63.826%
F343.325%96.429%4.336%64.024%
F443.424%96.327%4.432%64.123%
F543.323%96.127%4.429%64.122%
F643.322%95.926%4.427%64.221%
F743.321%95.725%4.526%64.220%
F843.220%95.626%4.524%64.119%
F943.220%95.625%4.524%64.119%
F1043.119%95.426%4.522%64.118%
F1143.119%95.525%4.522%64.118%
F1243.018%95.525%4.521%64.017%
F1343.018% 4.520%63.717%
F1442.918% 4.520%63.417%
F1542.917% 4.520%63.017%
F1642.817% 4.520%
F1742.817%
(B) Dataset 2 
(B) Dataset 2 
WTI Crude OilGasolineNatural GasHeating oil
Mean ($/bbl)Volatility
(%)
Mean
($/bbl)
Volatility
(%)
Mean
($/MMBtu)
Volatility
(%)
Mean
($/bbl)
Volatility
(%)
F150.930%101.132%4.945%53.330%
F251.228%100.630%5.040%53.528%
F351.326%100.330%5.138%53.626%
F451.325%100.128%5.133%53.725%
F551.324%99.828%5.231%53.724%
F651.323%99.627%5.228%53.723%
F751.322%99.326%5.327%53.722%
F851.321%99.226%5.326%53.721%
F951.221%99.125%5.325%53.721%
F1051.220%99.127%5.324%53.620%
F1151.120%99.126%5.322%53.619%
F1251.119%99.126%5.321%53.519%
F1351.019%99.126%5.321%53.319%
F1450.919%99.025%5.321%53.019%
F1550.818%98.925%5.321%53.018%
F1650.818%98.823%5.320%53.518%
F1750.718%98.524%5.320%55.418%
F1850.718%98.323%5.319%58.418%
F1950.617%98.123%5.320%
F2050.517%98.023%5.319%
F2150.517%98.023%5.319%
F2250.417%97.924%5.320%
F2350.417%97.923%5.218%
F2450.317%97.924%5.218%
F2550.316%97.924%5.218%
F2650.216%97.823%5.218%
F2750.216%97.824%5.218%
F2850.116%97.723%5.218%
F29 97.623%5.218%
F30 97.522%5.218%
F31 97.422%5.218%
F32 97.423%5.219%
F33 97.322%5.218%
F34 97.123%5.219%
F35 97.023%5.218%
F36 97.023%5.217%
Table 2. Eigenvalues for both datasets of the WTI light sweet crude oil.
Table 2. Eigenvalues for both datasets of the WTI light sweet crude oil.
Dataset 1Dataset 2
EigenvaluesPercentage of
Total Variance
Cumulative
Variance (%)
EigenvaluesPercentage of
Total Variance
Cumulative
Variance (%)
10099.671399.671310099.544899.5448
0.32020.319199.99040.44280.440899.9855
0.00840.008499.99880.01260.012599.9980
0.00100.001099.99980.00170.001799.9997
0.00010.000199.99990.00020.000299.9998
2.9905 × 10−52.9806 × 10−51000.00010.000199.9999
1.2209 × 10−51.2169 × 10−51003.7318 × 10−53.7148 × 10−5100
5.4907 × 10−65.4727 × 10−61001.6898 × 10−51.6821 × 10−5100
2.7838 × 10−62.7746 × 10−61008.7477 × 10−68.7079 × 10−6100
1.5250 × 10−61.5200 × 10−61004.4713 × 10−64.4509 × 10−6100
7.5290 × 10−77.5043 × 10−71003.0355 × 10−63.0217 × 10−6100
4.3460 × 10−74.3317 × 10−71002.3004 × 10−62.2899 × 10−6100
3.3010 × 10−73.2901 × 10−71001.3628 × 10−61.3566 × 10−6100
1.8100 × 10−71.8041 × 10−71008.7940 × 10−78.7540 × 10−7100
1.1490 × 10−71.1452 × 10−71004.7100 × 10−74.6886 × 10−7100
1.0350 × 10−71.0316 × 10−71003.6450 × 10−73.6284 × 10−7100
3.9300 × 10−83.9171 × 10−81002.3880 × 10−72.3771 × 10−7100
1.8840 × 10−71.8754 × 10−7100
1.4180 × 10−71.4115 × 10−7100
1.1480 × 10−71.1428 × 10−7100
1.0070 × 10−71.0024 × 10−7100
7.7800 × 10−87.7446 × 10−8100
5.4200 × 10−85.3953 × 10−8100
4.7800 × 10−84.7582 × 10−8100
3.8600 × 10−83.8424 × 10−8100
2.3000 × 10−82.2895 × 10−8100
2.1600 × 10−82.1502 × 10−8100
8.9000 × 10−98.8595 × 10−9100
Table 3. Eigenvalues for both datasets of the heating oil.
Table 3. Eigenvalues for both datasets of the heating oil.
Dataset 1Dataset 2
EigenvaluesPercentage of
Total Variance
Cumulative
Variance (%)
EigenvaluesPercentage of
Total Variance
Cumulative
Variance (%)
10099.613399.613310099.536599.5365
0.26980.268799.88200.36660.364999.9014
0.06580.065599.94750.04750.047299.9486
0.04740.047299.99470.04480.044699.9932
0.00280.002899.99750.00370.003799.9969
0.00130.001399.99880.00120.001299.9981
0.00090.000899.99970.00110.001199.9992
0.00010.000199.99980.00050.000599.9997
0.00010.000199.99990.00010.000199.9998
4.7937 × 10−54.7752 × 10−599.99990.00010.000199.9999
1.9734 × 10−51.9658 × 10−51004.1450 × 10−54.1257 × 10−599.9999
1.1626 × 10−51.1581 × 10−51002.8767 × 10−52.8633 × 10−599.9999
1.0482 × 10−51.0441 × 10−51001.5784 × 10−51.5711 × 10−5100
9.6273 × 10−69.5901 × 10−61001.3478 × 10−51.3416 × 10−5100
6.3346 × 10−66.3101 × 10−61009.3425 × 10−69.2992 × 10−6100
7.9877 × 10−67.9507 × 10−6100
6.1859 × 10−66.1572 × 10−6100
5.7507 × 10−65.7240 × 10−6100
Table 4. Eigenvalues for both datasets of the unleaded gasoline (RBOB).
Table 4. Eigenvalues for both datasets of the unleaded gasoline (RBOB).
Dataset 1Dataset 2
EigenvaluesPercentage of
Total Variance
Cumulative
Variance (%)
EigenvaluesPercentage of
Total Variance
Cumulative
Variance (%)
10096.859196.859110095.247395.2473
1.67481.622298.48132.35702.245097.4924
1.29011.249699.73081.30501.242998.7353
0.13340.129299.86001.07621.025099.7603
0.05580.054099.91400.06390.060899.8212
0.03860.037499.95150.05990.057099.8782
0.02170.021099.97240.04370.041699.9198
0.01560.015199.98760.02170.020699.9405
0.00930.009099.99660.02080.019899.9602
0.00220.002299.99880.01710.016399.9765
0.00090.000999.99970.00740.007099.9835
0.00030.00031000.00490.004799.9882
0.00300.002999.9910
0.00250.002499.9935
0.00190.001899.9953
0.00120.001199.9964
0.00080.000899.9972
0.00060.000699.9978
0.00040.000499.9982
0.00040.000399.9986
0.00030.000399.9989
0.00030.000399.9991
0.00020.000299.9993
0.00010.000199.9995
0.00010.000199.9996
0.00010.000199.9997
0.00010.000199.9997
0.00010.000199.9998
0.00010.000099.9999
3.8439 × 10−53.6612 × 10−599.9999
2.8300 × 10−52.6955 × 10−599.9999
2.4205 × 10−52.3055 × 10−599.9999
1.9530 × 10−51.8602 × 10−5100
1.5016 × 10−51.4303 × 10−5100
1.2694 × 10−51.2091 × 10−5100
9.9475 × 10−69.4747 × 10−6100
Table 5. Eigenvalues for both datasets of the henry hub natural gas.
Table 5. Eigenvalues for both datasets of the henry hub natural gas.
Dataset 1Dataset 2
EigenvaluesPercentage of
Total Variance
Cumulative
Variance (%)
EigenvaluesPercentage of
Total Variance
Cumulative
Variance (%)
10097.817997.817910095.895795.8957
1.15641.131198.94912.79722.682498.5782
0.47850.468199.41720.59930.574799.1529
0.39600.387499.80460.52210.500799.6535
0.08390.082199.88670.11780.113099.7665
0.07300.071499.95800.09930.095299.8617
0.02230.021899.97980.07820.075099.9367
0.00520.005099.98490.01660.015999.9527
0.00390.003899.98870.00740.007199.9598
0.00310.003099.99170.00700.006799.9665
0.00270.002799.99440.00600.005799.9722
0.00230.002399.99670.00510.004999.9772
0.00120.001299.99790.00480.004699.9818
0.00100.001099.99880.00380.003799.9854
0.00070.000799.99950.00320.003199.9886
0.00050.00051000.00240.002399.9908
0.00200.001999.9927
0.00180.001799.9944
0.00170.001699.9961
0.00130.001299.9973
0.00100.000999.9983
0.00060.000699.9988
0.00030.000399.9991
0.00020.000299.9993
0.00020.000299.9995
0.00010.000199.9996
0.00010.000199.9997
0.00010.000199.9998
0.00010.000199.9998
0.00010.000199.9999
3.2345 × 10−53.1018 × 10−599.9999
2.8128 × 10−52.6974 × 10−5100
1.5565 × 10−51.4926 × 10−5100
1.2489 × 10−51.1977 × 10−5100
9.3473 × 10−68.9637 × 10−6100
7.6972 × 10−67.3813 × 10−6100

Share and Cite

MDPI and ACS Style

García-Mirantes, A.; Larraz, B.; Población, J. A Proposal to Fix the Number of Factors on Modeling the Dynamics of Futures Contracts on Commodity Prices . Mathematics 2020, 8, 973. https://0-doi-org.brum.beds.ac.uk/10.3390/math8060973

AMA Style

García-Mirantes A, Larraz B, Población J. A Proposal to Fix the Number of Factors on Modeling the Dynamics of Futures Contracts on Commodity Prices . Mathematics. 2020; 8(6):973. https://0-doi-org.brum.beds.ac.uk/10.3390/math8060973

Chicago/Turabian Style

García-Mirantes, Andrés, Beatriz Larraz, and Javier Población. 2020. "A Proposal to Fix the Number of Factors on Modeling the Dynamics of Futures Contracts on Commodity Prices " Mathematics 8, no. 6: 973. https://0-doi-org.brum.beds.ac.uk/10.3390/math8060973

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop