Next Article in Journal
A Self-Adaptive Shrinking Projection Method with an Inertial Technique for Split Common Null Point Problems in Banach Spaces
Previous Article in Journal
Singularly Perturbed Cauchy Problem for a Parabolic Equation with a Rational “Simple” Turning Point
Previous Article in Special Issue
On the Numerical Solution of Ordinary, Interval and Fuzzy Differential Equations by Use of F-Transform
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bitcoin Analysis and Forecasting through Fuzzy Transform

by
Maria Letizia Guerra
1,*,
Laerte Sorini
2 and
Luciano Stefanini
2
1
Department of Statistical Sciences “Paolo Fortunati”, University of Bologna, 40126 Bologna, Italy
2
Department of Economics, Society, Politics, University of Urbino Carlo Bo, 61029 Urbino, Italy
*
Author to whom correspondence should be addressed.
Axioms 2020, 9(4), 139; https://doi.org/10.3390/axioms9040139
Submission received: 9 September 2020 / Revised: 24 November 2020 / Accepted: 25 November 2020 / Published: 28 November 2020
(This article belongs to the Special Issue Fuzzy Transforms and Their Applications)

Abstract

:
Sentiment analysis to characterize the properties of Bitcoin prices and their forecasting is here developed thanks to the capability of the Fuzzy Transform (F-transform for short) to capture stylized facts and mutual connections between time series with different natures. The recently proposed Lp-norm F-transform is a powerful and flexible methodology for data analysis, non-parametric smoothing and for fitting and forecasting. Its capabilities are illustrated by empirical analyses concerning Bitcoin prices and Google Trend scores (six years of daily data): we apply the (inverse) F-transform to both time series and, using clustering techniques, we identify stylized facts for Bitcoin prices, based on (local) smoothing and fitting F-transform, and we study their time evolution in terms of a transition matrix. Finally, we examine the dependence of Bitcoin prices on Google Trend scores and we estimate short-term forecasting models; the Diebold–Mariano (DM) test statistics, applied for their significance, shows that sentiment analysis is useful in short-term forecasting of Bitcoin cryptocurrency.

1. Introduction

The notion of a fuzzy transform (F-transform) as a tool for modeling with fuzzy rules as specific transformation and for general approximation of functions has been introduced by Perfilieva in [1] (see also [2]) and is now recognized as a powerful technique with important properties and potentials for various applications, as developed in several papers and special issues (see, e.g, [3,4,5] and the references therein).
In the present paper, we will focus on the use of quantile and expectile F-transform (and, more generally, on L p -norm-based fuzzy-valued F-transforms) in modeling time series. In particular, we will show the application of direct and inverse L p -norm F-transform to analyze the connection between Bitcoin returns and the level of interest in the world wide web. The Bitcoin observed time series is modeled through fuzzy-valued functions, whose level-cuts can be interpreted in the setting of expectile and quantile fuzzy regressions; these last are introduced in [6,7] as non-parametric smoothing methodologies and are constructed by defining fuzzy-valued L p -norm extensions of the F-transforms (in particular, expectile or L 2 -norm and quantile or L 1 -norm).
Quantile regression is also applied in [8] to show that Bitcoin reacts positively to uncertainty at both higher quantiles and shorter frequency movements of Bitcoin returns.
Following recent research on financial time series, where the properties of quantile and expectile modeling are discussed with respect to coherent and elicitable risk measures (see [9,10]), expectile methods seem to compete favourably with quantiles; furthermore, some recent papers (e.g., [11]) suggest to adopt L p -norm-based procedures with p between 1 and 2 (e.g., p 1.25 , 1.5 , 1.75 ), in order to consider (probabilistic) tail behaviour according, e.g., to robust Extreme Value Theory.
Literature about Bitcoin has hugely grown in recent years and some papers deserve a citation. An exhaustive analysis of Bitcoin and its statistical properties are explored in [12] by a comparison with standard currencies dynamics. Yermack in [13] shows that Bitcoin does not satisfy the three main properties called medium of exchange, unit of account and store of value, concluding that it is not a currency but rather a speculative asset. However, a debate is still open about the nature of Bitcoin and many hints can be found in [14,15,16]. A shared property by financial instruments is the day-of-the-week effect and in [17] the same effect is proved for Bitcoin returns and volatility through OLS and GARCH model. The possibility of global economic policy uncertainty to produce valid information to improve the prediction of returns and volatility in the Bitcoin market is detailed in [18].
Research on possible forecasting models to be used as decision support tools in investment strategies is more recent; in [19], monthly data are considered and it is shown that the predictive ability of the internet-based economic uncertainty related queries index is statistically stronger than the measure of uncertainty derived from newspapers in predicting Bitcoin returns.
The nexus between Bitcoin prices and market sentiment is further studied in many papers: in [20] sentiment is shown to explain about 2.5% to 5% of the unusual level of price clustering in Bitcoin. In [21] the cross-correlations between Google Trends and Bitcoin market is analyzed through the Multifractal Detrended Cross-correlation Analysis (MF-DCCA) method and in [22], within the more general context of Dow Jones Industrial Average, it is shown that Google searches are power-law correlated with Hurst exponents between 0.8 and 1.1; the authors conclude that globally on time domain, there is no relationship between the on-line search queries and some financial measures. In [23] investor sentiment regarding Bitcoin is introduced because of its significant information for explaining changes in Bitcoin volatility for future periods; on this basis Bitcoin is proved to be an investment asset with high volatility and dependence on investor sentiment rather than a monetary asset.
In a more general framework, in [24], interactions between (mass) media reporting and financial market movements are measured with particular focus on the property of sentiment as a predictors of securities prices.
The existing high correlation between Bitcoin prices and Google Trend scores is discovered and documented since the origins of digital currencies (see, e.g., [25]) and several pieces of research have discussed about its characteristics (see [26]) and about predicting prices using sentiment analysis (see references numbered from 23 to 27 in [26]). On the other hand, there is evidence that a bi-directional causal relationship exists between Bitcoin web-attention and Bitcoin returns, in particular for data in the left tail (poor performance) and the right tail (superior performance) of the observed statistical distribution (see [27]).
A non-parametric forecasting model based on technical analysis is presented in [28], focusing on the presence of predictive local non-linear trends that reflect the speculative nature of cryptocurrency trading. In [29] a computational intelligence technique that uses a hybrid Neuro-Fuzzy controller is introduced to forecast the direction in the change of the daily price of Bitcoin and its performance is shown to be good when compared with two other computational intelligence models based on a simpler neuro-fuzzy model and an artificial neural network.
Forecasting of Bitcoin risk measures is developed in [30] by comparing predictability of the one-step-ahead volatility with Value-at-Risk using several volatility models.
Many other authors approach general cryptocurrency properties. For example, in [31] there is evidence that Bitcoin is the most influential among digital coins both as a transmitter toward digital currencies and as a receiver of spillovers from virtual and traditional instruments. An extended analysis is also presented in [32] where the four cryptocurrencies Bitcoin, Ethereum, Ripple and Litecoin are predicted through a combination of eight models revealing that a combination of stochastic volatility and a student-t distribution gives the best results. The same topic of Bitcoin-realized volatility forecasting is studied in [33] where conventional regression models are substituted by least-squares model-averaging methods and no investor sentiment is modeled.
In [34,35] a continuous time model for Bitcoin price dynamics is studied to detect bubbles; regarding the existence of a bubble, in [36] it is proved it holds from early 2013 to mid-2014, but, not in late 2017 as supposed. Evidence of bubbly Bitcoin behaviour, mainly in the 2017–2018 period, is shown in [37], where it is also proved that economic policy uncertainty and stock market volatility play the most important role in Bitcoin values.
The evidence for the Bitcoin bubble is confirmed in [38] through the empirical validation of three properties: volume of trading is mainly explained in terms of price dynamics, trading is based exclusively on past prices and the price of Bitcoin is an explosive process.
In [39] a thorough analysis is conducted: several alternative univariate and multivariate models for point and density forecasting of crypto-series are compared, finding statistically significant improvements in point forecasting when using combinations of univariate models, and in density forecasting when relying on the selection of multivariate models.
Various deep learning-based Bitcoin price prediction models are studied in [40] using Bitcoin block-chain information; regression and classification problems are addressed in the sense that the first predicts the future Bitcoin price and the second one predicts whether the future price will go up or down.
In the case of Bitcoin prices using high frequency data, in [41] it is shown that it exists a large degree of multi-fractality in all examined time intervals which can be attributed to the high kurtosis and the fat distributional tails of the series returns; in [42] there is evidence about the leverage effect as the most powerful effect in volatility forecasting; volatility is also analyzed in [43] in terms of the property of the long memory parameter to be significant and quite stable for both unconditional and conditional volatilities at different time scales. Extending the study to several high frequency cryptocurrencies data, in [44] the investigation on stylized facts is developed in terms of the Hurst exponent of dependence between four different cryptocurrencies.
Also in [45] multi-fractality of Bitcoin time series is investigated, confirming that both temporal correlation and the fat-tailed distribution are the main sources, in addition in [46] a possible use of multi-fractal parameters in Technical Analysis is suggested.
The paper is organized into six sections. Preliminary facts on our methodology concerning F-transform are presented in section two. Our empirical experiments and analyses concerning Bitcoin prices and Google Trend scores are detailed in sections three and four: in section three, we apply the expectile and quantile (inverse) F-transforms to both time series and we examine their relationship on pre-clustered subsets of observations, subdivided in terms of three different clustering criteria; in section four we identify stylized facts for Bitcoin prices, based on local (low-order polynomial) trends obtained by direct F-transform, we study their clustering to obtain (centroid) typical forms and we use them to reconstruct the time series and to analyze their time evolution in terms of a transition matrix. Possible short-term forecasting models of Bitcoin prices using Google Trends are shown in section five and the Diebold–Mariano (DM) test statistics is applied for their significance. Section six closes with some comments and hints for future research paths.

2. Fuzzy-Transform Smoothing

We introduced in [47] and then we enhance in [6] two non-parametric smoothing methodologies called expectile and quantile fuzzy-transform; the first one is based on the classical direct F-transform and it is obtained by minimizing a least-squares (L2-norm) operator while the second one is based on the L1-type direct F-transform and it is obtained by minimizing an L1-norm operator.
Some preliminary notions compose the research framework: a fuzzy set is a mapping u : R [ 0 ,   1 ] and a fuzzy interval is a fuzzy set on R with the properties that the mapping u is (i) normal ( x ^ R with u ( x ^ ) = 1 ), (ii) upper semi-continuous, (iii) fuzzy convex ( u ( λ x + ( 1 λ ) x ) min { u ( x ) , u ( x ) } for all λ [ 0 ,   1 ] ), (iv) c l { x | u ( x ) > 0 } is a compact interval. A consequence of (ii) and (iii) is that the α-cuts [ u ] α = { x | u ( x ) α } = [ u α , u α + ] are compact intervals for all α [ 0 ,   1 ] . The 1-cut is the core [ u ] 1 = { x | u ( x ) = 1 } of u; the interval [ u ] 0 = c l ( { x | u ( x ) > 0 } ) is the 0-cut of u. A fuzzy interval is a fuzzy number if its core is a singleton [ u ] 1 = { u ^ } with u ^ R .
The space of real fuzzy intervals is denoted R F and the mapping u R F satisfies what follows:
u ( x ) = 0 if x [ u ] 0 sup { α | x [ u ] α } if x [ u ] 0 .
For a given real compact interval [ a , b ] , a generalized r-partition is defined by a triplet ( P , A , r ) where r 1 is integer, P = x j = a + j 1 n 1 ( b a ) ; j = 1 , 2 ,   ,   n , n 2 , is a uniform decomposition of [ a , b ] ; for simplicity of notation, if r > 1 we extend P by adding r 1 points x 1 j = a j b a n 1 , j = 1 ,   ,   r 1 on the left of a and r 1 points x n + j = b + j b a n 1 , j = 1 ,   ,   r 1 on the right of b. The second term of the triplet is a family A = { A r + 2 ,   ,   A 1 ,   A 2 ,   ,   A n ,   ,   A n + r 1 } of n + 2 r 2 continuous fuzzy sets on R , called basic functions, that satisfy the following condition for all x [ a , b ]
k = r + 2 n + r 1 A k ( x ) = r
and are such that A k ( x k ) = 1 , for k = 2 r ,   , 1 ,   2 ,   ,   n ,   ,   n + r 1 , A k ( x ) = 0 for all x [ x k r , x k + r ] .
If r = 1 , the partition ( P , A , 1 ) will be simply denoted by ( P , A ) .
Families of basic functions can be obtained in terms of increasing shape functions such as rational splines of the form
L ( t ; β 0 , β 1 ) = t 2 + β 0 t ( 1 t ) 1 + ( β 0 + β 1 2 ) t ( 1 t ) ,       t [ 0 ,   1 ]
with real parameters β 0 0 , β 1 0 ; the Hermite-type conditions L ( 0 ) = 0 , L ( 1 ) = 1 , L ( 0 ) = β 0 , L ( 1 ) = β 1 are satisfied and L ( t ) 0 for all t [ 0 ,   1 ] . By any pair of non-negative values β 0 , β 1 , a large number of shape functions can be generated; for example, if β 0 + β 1 = 2 (with 0 β 0 2 , β 1 = 2 β 0 ) we have a quadratic function L ( t ) = ( 1 β 0 ) t 2 + β 0 t , e.g., L ( t ; 2 , 0 ) = 2 t t 2 , L ( t ; 0 , 2 ) = t 2 and L ( t ; 1 , 1 ) = t is linear.
Each basic function A k , k = 2 r ,   ,   1 ,   2 ,   ,   n ,   ,   n + r 1 , increasing on x k r , x k and decreasing on x k , x k + r , is obtained by translating t L ( t , β 0 , β 1 ) and t L ( 1 t , β 0 , β 1 ) from [ 0 , 1 ] onto x k r , x k and x k , x k + r , respectively (each A k is finally extended to R by setting A k ( x ) = 0 on the left of x k r and on the right of x k + r ).

2.1. L2-Norm F-Transform in Expectile Smoothing

We just recall the discrete version of the direct F-transform.
Definition 1.
(from [1]) Given a set of m values Y = ( t i , f i ) | t i [ a , b ] , i = 1 ,   ,   m of a function f : [ a , b ] R and a fuzzy partition ( P , A ) of [ a , b ] such that each subinterval [ x k 1 , x k + 1 ] contains at least one point t i in its interior (so that i = 1 m A k ( t i ) > 0 for all k), then the discrete direct L 2 -type F-transform of Y with respect to ( P , A ) is the n-tuple of real numbers ( F 1 ,   ,   F n ) where each component F k minimizes the function Φ k ( y ) = i = 1 m | f i y | 2 A k ( t i ) , k = 1 , 2 , , n . The associated inverse F-transform function (iF-transform for short) is defined by f ^ ( P , A ) ( x ) = k = 1 n F k A k ( x ) for all x [ a , b ] .
More generally, we consider a r-partition ( P , A , r ) and substitute the direct F-transform components F k , with an ( n + 2 r 2 ) -tuple of polynomials of order q 0 , say ( φ 2 r ( x ) ,   ,   φ n + r 1 ( x ) ) with φ k ( x ) = F k , 0 + F k , 1 ( x x k ) +     + F k , q ( x x k ) q , k = 2 r ,   ,   n + r 1 . The q + 1 coefficients F k , j , j = 0 ,   1 ,   ,   q are obtained, for fixed k, by minimizing the function Φ k ( y 0 ,   ,   y q ) = i = 1 m | f i y 0 + y 1 ( t i x k ) +     + y q ( t i x k ) q | 2 A k ( t i ) with respect to the parameters y 0 ,   ,   y q , under the assumption that for each k, the data points ( t i , f i ) with t i in the interval x m a x ( a , k r ) , x m i n ( b , k + r ) produce a unique optimal solution. The details are shown in [6].
The corresponding (inverse) iF-transform function is given by
f ^ ( P , A , r ) ( q ) ( x ) = 1 r k = 2 r n + r 1 A k ( x ) φ k ( x ; F k , 0 ,   ,   F k , q )                   f o r   x [ a , b ] .
Consider for simplicity the F-transform of order zero ( q = 0 ).
The iF-transform function becomes
f ^ ( P , A , r ) ( x ) = 1 r k = 2 r n + r 1 F k A k ( x )                   f o r   x [ a , b ]
with the ( n + 2 r 2 ) -tuple of the direct F-transform ( F 2 r ,   ,   F n + r 1 ) .
We recall from [6] that the expectile direct F-transform components are defined to be the minimizers of the strictly convex functions, for k = 2 r ,   ,   1 ,   ,   n ,   ,   n + r 1 and ω [ 0 , 1 ] ,
Φ k , ω ( μ ) = j = 1 m w j ( ω ; μ ) ( f j μ ) 2 A k ( t j )
where
w j ( ω ; μ ) = ω if f j > μ 1 ω if f j μ .
The value μ k = μ k ( ω ) (depending on ω) is the expectile for the asymmetry parameter ω [ 0 ,   1 ] and if ω = 1 2 we obtain the direct F-transform component F k in Equation (5).
Proposition 1.
([6]) Given the set of minimizers { μ k ω | ω ] 0 , 1 [ } of Φ k , ω μ , consider α [ 0 , 1 ] ; then the compact intervals
U k , α = { μ k ( 1 2 ) } i f α = 1 [ μ k ( α 2 ) , μ k ( 1 α 2 ) ] i f α ] 0 ,   1 [ c l ( β > 0 U k , β ) i f α = 0
define the α-cuts of a fuzzy number u k R F with membership function
u k ( x ) = sup { α | x U k , α } if x U k , 0 0 if x U k , 0 .
Definition 2.
Given a set of m points Y = { ( t i , f i ) ; t i [ a , b ] , i = 1 ,   ,   m } and a fuzzy r-partition ( P , A , r ) of [ a , b ] , the ( n + 2 r 2 )-vector of fuzzy numbers
F ( P , A , r ) = ( F 2 r ,   ,   F n + r 1 ) ,
where each fuzzy interval F k has α-cuts U k , α given by (8) in Proposition 1, is called the discrete direct expectile fuzzy-valued F-transform of f with respect to ( P , A , r ) , based on the dataset Y. The corresponding inverse expectile fuzzy-valued iF-transform is the fuzzy-valued function defined by
f ^ ( P , A , r ) x = 1 r k = 2 r n + r 1 F k A k ( x )   f o r   x [ a , b ] .
The fuzzy-valued function f ^ ( P , A , r ) ( x ) is well defined as indeed each basic function A k has non-negative values for each x [ a , b ] . The α -cuts U k , α of F k will be denoted by
[ F k ] α = [ F k , α , F k , α + ] ,       k = 2 r ,   ,   n + r 1 ,       α [ 0 , 1 ]
and the α -cuts of the fuzzy-valued function f ^ ( P , A , r ) x , x [ a , b ] , will be given by
f ^ ( P , A , r ) x α = 1 r k = 2 r n + r 1 F k , α A k x , 1 r k = 2 r n + r 1 F k , α + A k x ,             α [ 0 , 1 ] .
When α = 1 we obtain the standard direct F transform and the standard i F transform function, corresponding to the core of the fuzzy-valued iF-transform.

2.2. L1-Norm F-Transform in Quantile Smoothing

The L1-norm direct and inverse F-transform are defined as follows.
Definition 3.
Given a set of m values Y = ( t i , f i ) | t i [ a , b ] , i = 1 ,   ,   m of a function f : [ a , b ] R and a fuzzy partition ( P , A ) of [ a , b ] such that each subinterval [ x k 1 , x k + 1 ] contains at least one point t i in its interior, then the discrete direct L 1 -type F-transform of Y with respect to ( P , A ) is the n-tuple of real numbers ( G 1 ,   ,   G n ) where each component G k minimizes the function Ψ k ( y ) = i = 1 m f i y A k ( t i ) , k = 1 ,   2 ,   ,   n . The associated inverse F-transform function (iF-transform for short) is defined by f ˜ ( P , A ) ( x ) = k = 1 n G k A k ( x ) for all x [ a , b ] .
Also in this case, we consider a generalized r-partition ( P , A , r ) and substitute the direct F-transform components G k with an ( n + 2 r 2 ) -tuple of polynomials of order q 0 , say ( ψ 2 r ( x ) ,   ,   ψ n + r 1 ( x ) ) with ψ k ( x ) = G k , 0 + G k , 1 ( x x k ) +     + G k , q ( x x k ) q , k = 2 r ,   ,   n + r 1 . The q + 1 coefficients G k , j , j = 0 ,   1 ,   ,   q are obtained, for fixed k, by minimizing the function Ψ k ( y 0 ,   ,   y q ) = i = 1 m f i y 0 + y 1 ( t i x k ) +     + y q ( t i x k ) q A k ( t i ) with respect to the parameters y 0 ,   ,   y q (see details in [6]).
The corresponding L 1 -type inverse F-transform function is given by
f ( P , A , r ) ( q ) ( x ) = 1 r k = 2 r n + r 1 A k ( x ) ψ k ( x ; G k , 0 ,   ,   G k , q )           f o r   x [ a , b ] .
The iF-transform function of order zero ( p = 0 ) becomes
f ( P , A , r ) ( x ) = 1 r k = 2 r n + r 1 G k A k ( x )           f o r   x [ a , b ]
with the ( n + 2 r 2 ) -tuple of the L 1 -type direct F-transform ( G 2 r ,   ,   G n + r 1 ) .
We recall from [6] that the quantile direct F-transform is defined in terms of the minimizers of the convex functions, for k = 1 ,   ,   n and ω [ 0 , 1 ] ,
Ψ k , ω η = j = 1 m w j ω ; η f j η A k t j
where
w j ω ; η = ω if f j > η 1 ω if f j η .
As detailed in [6], the minimization of Ψ k , ω η produces the family of compact intervals, for α [ 0 , 1 ] and ω α 2 , 1 α 2 ,
V k , α = η k 1 2 if α = 1 η k α 2 , η k 1 α 2 if α [ 0 , 1 ] c l β > 0 V k , β if α = 0 ;
and we obtain the α -cuts of a fuzzy number v k R F with membership function
v k ( x ) = sup { α | x V k , α } if x V k , 0 0 if x V k , 0 .
Definition 4.
Given a set of m points Y = { ( t i , f i ) ; t i [ a , b ] , i = 1 ,   ,   m } and a fuzzy r-partition ( P , A , r ) of [ a , b ] , the ( n + 2 r 2 )-vector of fuzzy numbers
G ( P , A , r ) = ( G 2 r ,   ,   G n + r 1 ) ,
where each fuzzy interval G k has α-cuts V k , α , is called the discrete direct quantile fuzzy transform of Y with respect to ( P , A , r ) .
The corresponding (inverse) quantile iF-transform of f is the fuzzy-valued function defined by
f ( P , A , r ) x = 1 r k = 2 r n + r 1 G k A k x       for x [ a , b ] .
Denoting the α -cuts V k , α of G k by
G k α = G k , α , G k , α + ,             k = 2 r ,   ,   n + r 1 ,             α [ 0 , 1 ] ,
then, the α -cuts of the corresponding fuzzy-valued function f ˜ ( P , A , r ) x , x [ a , b ] , will be given by
f ( P , A , r ) x α = 1 r k = 2 r n + r 1 G k , α A k x , 1 r k = 2 r n + r 1 G k , α + A k x ,     α [ 0 , 1 ] .

2.3. General Lp-Norm-Based Discrete F-Transform

The general L p -norm-based F-transform has been analyzed in detail in [48] for the continuous case. Its interest in time series applications is motivated by recent literature on tail behaviour of economic and financial time series (see, e.g., [11]) and in modeling risk measures ([9,10]): L p -norm estimation with 1 < p < 2 has been suggested to balance robustness and fitting properties.
For a dataset of m values Y = ( t i , f i ) | t i [ a , b ] , i = 1 ,   ,   m and a generalized r-partition ( P , A , r ) , the L p -norm direct F-transform is an ( n + 2 r 2 ) -tuple of polynomials of order q 0 ( ϑ 2 r ( x ) ,   ,   ϑ n + r 1 ( x ) ) with ϑ k ( x ) = θ k , 0 + θ k , 1 ( x x k ) +     + θ k , q ( x x k ) q , k = 2 r ,   ,   n + r 1 , where this time, the q + 1 coefficients θ k , j , j = 0 , 1 ,   ,   q , are obtained by minimizing the functions Θ k ( p ) ( θ 0 ,   ,   θ q ) = i = 1 m f i θ 0 + θ 1 ( t i x k ) +     + θ q ( t i x k ) q p A k ( t i ) with respect to the parameters θ 0 ,   ,   θ q .
If q = 0 , the direct fuzzy-valued F-transform components, for a fixed p [ 1 , 2 ] , are obtained by minimizing the strictly convex functions, for k = 2 r ,   ,   1 ,   ,   n ,   ,   n + r 1 and ω [ 0 , 1 ] ,
Θ k , ω ( p ) θ = j = 1 m w j ω ; θ f j θ p A k t j
where
w j ω ; θ = ω if f j > θ 1 ω if f j θ .
The minimizers ϑ k = ϑ k ( ω ) (depending on ω ) define the L p -norm direct F-transform components for the asymmetry parameter ω [ 0 , 1 ] . We have (the proof is similar to the case p = 2 in [6])
Proposition 2.
Given the set of minimizers ϑ k ( ω ) | ω [ 0 , 1 ] of Θ k , ω ( p ) θ , consider α [ 0 , 1 ] ; then, the compact intervals
W k , α = θ k 1 2 if α = 1 θ k α 2 , θ k 1 α 2 if α [ 0 , 1 ] c l β > 0 W k , β if α = 0
define the α-cuts of a fuzzy number w k R F with membership function
w k ( x ) = sup { α | x W k , α } if x W k , 0 0 if x W k , 0 .
Definition 5.
Given a set of m points Y = { ( t i , f i ) ; t i [ a , b ] , i = 1 ,   ,   m } and a fuzzy r-partition ( P , A , r ) of [ a , b ] , the ( n + 2 r 2 )-vector of fuzzy numbers
H ( P , A , r ) = ( H 2 r ,   ,   H n + r 1 ) ,
where each fuzzy interval H k has α-cuts W k , α given by (26) in Proposition 2, is called the discrete direct L p -norm fuzzy-valued F-transform with respect to ( P , A , r ) , based on the dataset Y. The corresponding inverse L p -norm fuzzy-valued iF-transform is the fuzzy-valued function defined by
f ( P , A , r ) ( p ) x = 1 r k = 2 r n + r 1 H k A k x           for x [ a , b ] .
The α -cuts W k , α of H k will be denoted by
H k α = H k , α , H k , α + ,           k = 2 r ,   ,   n + r 1 ,           α [ 0 , 1 ]
and the α -cuts of the fuzzy-valued function f ( P , A , r ) ( p ) x , x [ a , b ] , are
f ( P , A , r ) ( p ) x α = 1 r k = 2 r n + r 1 H k , α A k x , 1 r k = 2 r n + r 1 H k , α + A k x ,           α [ 0 , 1 ] .
When α = 1 we obtain the L p -norm direct F-transform and (inverse) iF-transform function, corresponding to the core of the fuzzy-valued L p -norm iF-transform.

3. Analysis of Bitcoin Prices and Google Trends by F-Transform

To focus on the strength of fuzzy-valued L p -norm F-transform smoothing, we will apply the proposed models to the time series of Bitcoin prices, which has received much attention by regulators and investors in the last decade.
Bitcoin was released at the beginning of 2009 as a digital currency in the market; it remained under $0.20 for three years and it began to increase during the first quarter of 2013. By the end of 2017, Bitcoin was valued at nearly $18,000 per “coin”. In 2018, the price plummeted $4000 and it grew again in 2019.
The second dataset we consider is Google Trends, the search index that measures what people are currently interested in and curious about. In particular, we consider Google Trends with value 100 out of 100 meaning that trend (word Bitcoin) is on its peak in the considered time period.
Here, we work on two daily time series, as in Figure 1, from April 2013 to June 2019: Bitcoin prices (from www.blockchain.info) and Google Trends (from https://trends.google.com). The label time in the figures refers to daily observation number (t = 1 corresponds to the first observation considered); the labels Bitcoin and GT100 denote the arrays with the data.
Remark that the F-transform (direct and inverse) is linear with respect to the data-set, in particular it is homogeneous and scale invariant: we can normalize the two time series and the direct F-transform components (or the iF-transform function) are multiplied by the same factor. In this way, we can compare the F-transform results for the two series in terms of the obtained smoothing effect and by visualizing the scatter-plots of the of each series and the obtained iF-transform reconstructions.
The degree of smoothness of a given time series f t , t = 1 , 2 ,   ,   M is measured in terms of its (average) absolute variation, given by
V ( f ) = t = 1 M 1 | f t + 1 f t | M 1 ;
on the other hand, it is well known that the inverse L p -norm F-transform function, for a fixed r-partition ( P , A , r ) , allows the computation of the smoothing values
f t ( r ) = 1 r k = 2 r n + r 1 ϑ k ( t ) A k t     for   t = 1 ,   2 ,   ,   M
corresponding to the estimated (local) polynomials of order q, ϑ k ( t ) = θ k , 0 + θ k , 1 ( t x k ) +     + θ k , q ( t x k ) q , k = 2 r ,   ,   n + r 1 . The corresponding absolute variation, given by
V ( f ( r ) ) = t = 1 M 1 | f t + 1 ( r ) f t ( r ) | M 1 ,
is in general smaller than V ( f ) ; the ratio
L ( f , f ( r ) ) = V ( f ( r ) ) V ( f )
represents the proportion of absolute variation which remains in the smoothed time series f ( r ) with respect to the original data f, while 1 L ( f , f ( r ) ) is the amount of removed variation.
We have computed the L p -norm F-transforms for different values of p 1 , 1.25 , 1.5 , 1.75 , 2 and orders q 0 , 1 , 3 . The excellent performance of the smoothing based on F-transform are strongly confirmed for these two special time series, as summarized in Table 1 and Table 2.
All computations are performed with a 1-partition ( r = 1 ), a decomposition P of [ 1 , M ] into n = 326 equispaced nodes and basic functions obtained from L ( τ ) = τ 2 1 2 τ ( 1 τ ) , τ [ 0 , 1 ] . For both series, covering the same time period, we have M = 2276 so that each subinterval x k , x k + 1 , k = 1 ,   ,   n 1 of P has exactly 8 observations (13 observations internal to intervals x k 1 , x k + 1 , k = 2 ,   ,   n 1 on which basic function A k is non-zero); remark that the two series are observed all days of the week, including holidays, so that all internal nodes x k of P correspond to the same day in the week.
For the results of Table 1, the observed values are normalized in the range 0 , 1000 ; on this common range, the computed absolute variations are V ( B i t c o i n ) = 4.6354 and V ( G T 100 ) = 6.9986 and Bitcoin is 66.23 % less fluctuating (on average) than GT100. For all values of p, the reduction in total variation expressed by the ratios L ( f , f ( r ) ) , for both series, is more depending on the order q than on the used norm L p ; this is not surprising, because increasing the degree of local polynomials will reduce the average fitting errors but increase their variation.
In Table 2 the smoothing and fitting of the time series by L p -norm F-transforms are compared in terms of three well-known indices: the mean square error MSE, the mean absolute percentage error % M A E and the Kendall τ rank correlation.
Here the series are not normalized (in particular, the value of index M S E depends on the scale of the series). In all cases, the F-transform fitting for Bitcoin series has significantly smaller errors than for GT100, as demonstrated by indices % M A E and τ (the Kendall τ is always significantly positive with p-value less than 10 8 ).
For the fitting F-transform functions obtained by L p -norm with p = 1.5 and q = 0 , the scatter-plots of time series ( f t , f t r ) are pictured in Figure 2; Figure 3 plots f t and f t r with respect to time. Remark that peaks in the series tend always to be smoothed, as a characteristics of the smoothing effect produced by F-transform.
Assuming a bi-directional dependence between G T 100 and B i t c o i n time series, empirically demonstrated, e.g., in [27], and focusing on the impact of G T 100 on B i t c o i n , we want now to investigate the form of dependence by use of expectile (fuzzy-valued) F-transform; in particular, we see that while the fitting of iF-transform obtained on the totality of observed values ( B i t C o i n t , G T 100 t ) presents a high dispersion, a very big improvement in the fitting quality is obtained if F-transform is applied to clustered subsets of observed data. A relatively small number of clusters (from 20 to 24) in sufficient to obtain a fitting with correlation coefficient greater than 0.99 .
First of all, a scatter plot of the pairs is pictured in Figure 4 (the two time series are normalized in the range [ 0 , 1000 ] and GT100 appears in horizontal axis); observations in our data-set cover the cited time period 2013–2019, are concentrated on the bottom-left part of the positive quadrant and have rare points with big values in both series. At a first look, no evident functional relationships emerge from the data; they simply show a tendency to be co-monotonic, but the points are very sparse.
For a deeper analysis, we propose a model based on F-transform, relating the pairs of data ( G T 100 t , B i t C o i n t ) . F-transform is thus used to model BitCoin as a function of GT100. The L 1 -norm and L 2 -norm-based inverse iF-transforms of the data-set ( G T 100 t , B i t C o i n t ) , t = 1 , 2 ,   ,   M are computed; taking into account the sparsity of the values G T 100 t (in particular above the threshold 40), we use a non-uniform 1-partition ( P , A ) of the range [ 0 , 100 ] of the observed G T 100 t , namely the set of 25 nodes 0 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 10 , 12 , 14 , 16 , 18 , 20 , 22 , 24 , 26 , 28 , 30 , 35 , 40 , 45 , 50 , 60 , 100 , as pictured in Figure 5. The two curves give the predominant relationship between B i t C o i n and G T 100 . It is evident that both iF-transforms of B i t C o i n t are not increasing on the whole range of G T 100 (in particular, they decrease when GT100 is around 7–8, around 20 and 40). On the other hand, observing the dispersion of points in Figure 4, both iF-transforms are not good fitting of the complete data-set and it appears that the points can be “clustered”, e.g., for different levels of Bitcoin prices, and better sub-fittings can be attempted.
This can be better analyzed by applying the quantile and the expectile F-transforms to our data-set ( G T 100 t , B i t C o i n t ) , pictured in Figure 6, which shows the fuzzy-valued expectile F-transform of Bitcoin as a function of GT100 for five different α -cuts corresponding to α { 0.01 , 0.25 , 0.5 , 0.75 , 1 } (i.e., ten values ω { α 2 , 1 α 2 ; a l l α } for the asymmetry expectile parameter ω ).
We see that corresponding to different values of α , i.e., corresponding to subintervals in the range of Bitcoin prices, the relationship between our time series changes significantly.
This suggests that possibly, the clustering of the data into subsets may significantly improve the quality of fitting.
Clearly, there are several procedures and criteria to cluster the observed data ( G T 100 t , B i t C o i n t ) ; we use the well-known k-means method and the number of clusters is selected according to the silhouette measures available in MATLAB R2018b. We have performed three types of clusters, the first on the basis of variable B i t C o i n , the second using the pair ( B i t C o i n , G T 100 ) and the third using observations ( B i t C o i n , G T 100 , Δ B i t C o i n , Δ G T 100 ) where Δ is the first difference operator Δ f t = f t f t 1 . The clustering measure is the standard Euclidean distance.
Let n C l denote the number of clusters and let j = 1 , 2 ,   ,   n C l be the labels of cluster S j . Each observation ( B i t C o i n t , G T 100 t ) is assigned to cluster S c ( t ) , i.e., the observation t is assigned to cluster labelled c ( t ) 1 , 2 ,   ,   n C l .
For a given clustering, identified by clusters S 1 , S 2 ,   ,   S n C l , the L 2 -norm-based F-transform is applied (independently) on each subset of data, for j = 1 , 2 ,   ,   n C l ,
S j = ( B i t C o i n t , G T 100 t ) | c ( t ) = j .
Finally, for the observations of each cluster, the inverse iF-transform is computed and the fitted values, for each cluster, are obtained and recomposed to obtain the fitted values for the whole dataset.
If all the data are collected in a unique cluster and the F-transform is applied to the whole data-set, we obtain the fitted B i t c o i n series pictured in Figure 7: the green points give the observed B i t C o i n t , the red points are the G T 100 t series and in blue is the fitting of B i t C o i n t . We see that the fitting preserves the qualitative (gross) form of the observed Bitcoin prices, but in several portions of time period the fitting is not good.
Significant improvements are obtained by adopting the three described pre-clustering, denoted respectively by labels A, B and C.
The computations are performed on the second half portion of the time period, starting with observation time t 1121 ; the first part is less interesting because, from observation t 350 to t 1220 both time series have small variations and relatively flat curves. Without performing pre-clustering, the L 2 -norm F-transform reconstruction (of order 1) of B i t C o i n in terms of G T 100 has Kendall rank correlation τ = 0.6049 and Spearman correlation ρ = 0.7830 . We will compare τ and ρ indices as preliminary evaluation of the effect of pre-clustering on the fitting quality.
Clustering A. Clusters are based on variable B i t C o i n : the number of clusters is n C l = 20 . The L 2 -norm F-transform reconstruction (of order 1) of B i t C o i n in terms of G T 100 with pre-clustering A has much higher Kendall rank correlation τ = 0.9457 and Spearman correlation ρ = 0.9956 .
In Figure 8 we plot the observed and fitted Bitcoin series for the second half of observations, with evidence that clustering A allows a much better fitting. The 20 clusters are pictured and expanded in Figure 9 where also the sub-fittings are visible.
Clustering B. Clusters are based on both variables ( B i t C o i n , G T 100 ) : the number of clusters is n C l = 21 . The L 2 -norm F-transform reconstruction (of order 1) of B i t C o i n in terms of G T 100 with pre-clustering B has high Kendall correlation τ = 0.9447 and Spearman correlation ρ = 0.9954 , similar to clustering A.
In Figure 10 we plot the observed and fitted Bitcoin series for the second half of observations, with evidence that clustering B allows a good fitting. The 21 clusters are pictured and expanded in Figure 11 where also the sub-fittings are visible.
Clustering C. Clusters are based on variables ( B i t C o i n , G T 100 , Δ B i t C o i n , Δ G T 100 ) ; the number of clusters is n C l = 24 . The L 2 -norm F-transform reconstruction (of order 1) of B i t C o i n in terms of G T 100 with this pre-clustering has high Kendall correlation τ = 0.9183 and Spearman correlation ρ = 0.9905 , similar but not better than pre-clustering A and B.
In Figure 12 we plot the observed and fitted Bitcoin series for the second half of observations, with evidence that also clustering C allows a good fitting. The 24 clusters are pictured (blue colours) in Figure 13 and expanded in Figure 14 where also the sub-fittings are pictured.
The overall result is that pre-clustering of the data, even based on very simple clustering strategies and a relatively small number of clusters (from 20 to 24) significantly improves the fitting ability of F-transform.
It is also interesting to see that the form of relationships between B i t C o i n and G T 100 is very different for each cluster; this has important consequences on the analysis and modeling of Bitcoin time series as, in particular, it follows different paths in various sub-periods of time and in cases of rare values of the data (e.g., big values and/or big absolute changes).

4. Stylized Facts of Bitcoin Prices Identified by F-Transform Components

The empirical identification of stylized facts, emerging from the analysis of financial time series, is a common tool in data-based approaches to time series modeling (see [49]) and is consolidated by availability of large data-sets and by application of computer-intensive efficient methods for analyzing their properties. In this section, we will analyze the local F-transform components, in particular the form of polynomials φ k of L 2 -norm F-transform, of orders q = 1 and q = 2 described in Section 2.1, applied to the Bitcoin time series.
We have selected the number m of daily observations such that m 1 is a multiple of 7 ( B i t C o i n and G T 100 are observed all the days in the year): in this way, the available data are m = 2276 .
Consider an r-partition ( P , A , r ) with nodes x k , k = 1 , 2 ,   ,   n ; denoting the time points of observations simply by t = 1 , 2 ,   ,   m (or t j = j for j = 1 ,   ,   m ) we consider two uniform ( P , A , r ) :
( P a )—a dense partition with n = m and x k = k (i.e., each observation is a node) and the bandwidth r is chosen such that each open interval I k ( r ) = [ x k r , x k + r ] contains (internally) a prescribed sufficient number of data. E.g., with r = 4 each I k ( r ) contains all data of the week centred at x k ; with r = 7 , I k ( r ) contains the two weeks ending and starting with x k .
( P b )—a sparse partition with n = 326 and x k = 1 + 7 ( k 1 ) (i.e., there is a node every 7 observations) and the bandwidth r = 3 is chosen. The (direct) F-transform components will span 21 observed values on each side (left or right) of the nodes.

4.1. F-Transform Fitting with Dense r-Partition

In the dense partition case, we obtain the best L 2 -norm F-transform components φ k ( t ) associated with all observations (indeed, x k = k corresponds to all observed times k = 1 ,   ,   n = m ); in this way, we are able to estimate the local trend around every observation and se can follow the time evolution of trends by plotting φ k ( t ) around x k on subintervals I k ( r ) (see Figure 15 and Figure 16).
On the other hand, if we translate vertically the polynomials φ k ( t ) , the polynomials φ ^ k ( t ) = φ k ( t ) ϕ k , 0 are such that φ ^ k ( x k ) = 0 for all k and, if q > 0 , we can cluster the φ ^ k by clustering the set of vectors ( ϕ 1 , k ,   ,   ϕ q , k ) of the estimated coefficients. If q = 1 we obtain a set of lines through the origin with different slopes (in terms of a single variable ϕ 1 , k ); if q = 2 we obtain a set of parabolic functions through the origin, in terms of two variables ϕ 1 , k and ϕ 2 , k .
Using the k-means clustering method with the Euclidean distance and testing the number of clusters using the silhouette values, we have that the best number of clusters is n C l = 9 when q = 1 and n C l = 15 when q = 2 .
The interpretation of n C l = 9 clusters (characterized by variable ϕ 1 , k ) is interesting, because we have a central cluster of local trends with slope around 0 and other eight clusters characterized by slopes ranging from very negative values (cluster 1) to intermediate negative values (cluster 3) up to intermediate positive (cluster 7) to very positive slopes (cluster 9).
Analogous interpretation is applied to the case of n C l = 15 clusters (and q = 2 ), where again cluster 1 corresponds to the most negative slope, cluster 8 to almost zero slope and cluster 15 to the most positive slope; clearly, also the degrees of concavity and convexity (represented by the second variable ϕ 2 , k ) are taken into account in this case.
Figure 17 pictures, for each cluster S c ( 1 ) , c = 1 , 2 ,   ,   9 , the first order shifted polynomials φ ^ k ( t ) = ϕ 1 , k ( t x k ) assigned to S c ( 1 ) (red colors) and the centroid polynomial (blue color) obtained by averaging the parameters ϕ 1 , k .
Similarly, Figure 18 pictures, for each cluster S c ( 2 ) , c = 1 , 2 ,   ,   15 , the second order shifted polynomials ϕ 1 , k ( t x k ) + ϕ 2 , k ( t x k ) 2 assigned to S c ( 2 ) (red colors) and the centroid polynomial (blue color) obtained by averaging the pairs of parameters ( ϕ 1 , k , ϕ 2 , k ) .
As we have said, the n C l centroid polynomials, identified by averaging the parameters of all elements assigned to each cluster S c ( 1 ) or S c ( 2 ) , can be considered to be the stylized forms of the local trends. If we identify each estimated trend by the centroid of its cluster, we then have n C l stylized forms, one for each cluster, that form the possible typical trends around the observed points of the time series.
As a last step, we can produce a simple analysis of how good the stylized trends represent the effective observations: let’s denote by f ( P a , A , r ) q ( t j ) the standard inverse iF-transform values at times t j obtained with estimated direct F-transform (polynomial) components φ k ( t ) , i.e.,
f ( P a , A , r ) q ( t ) = 1 r k = 2 r n + r 1 φ k ( t ) A k ( t ) ;
denote by f c , ( P a , A , r ) q ( t j ) the analogous expression obtained by substituting each local polynomial φ k ( t ) by φ ^ k ( t ) = ϕ 0 , k + ϕ ^ 1 , k ( t x k ) +     + ϕ ^ q , k ( t x k ) q (here q = 1 or q = 2 ), where the parameters ϕ ^ 1 , k or the pairs ( ϕ ^ 1 , k , ϕ ^ 2 , k ) are the ones that identify the centroid of cluster containing time k, i.e., if an observation belongs to cluster S c we substitute the computed local trend with the local trend of the corresponding centroid:
f c , ( P a , A , r ) q ( t ) = 1 r k = 2 r n + r 1 φ ^ k ( t ) A k ( t ) .
Essentially, we identify the elements of each cluster by its centroid and we estimate its goodness in terms of the vicinity between the modified version f c , ( P a , A , r ) q ( t ) at times t j = 1 ,   ,   M and the observed data f j = B i t C o i n j . In Figure 19 and Figure 20 the data f j (green colors) and f c , ( P a , A , r ) q ( t j ) (black colors) are plotted for all the data with r = 4 and r = 7 , respectively; remark in particular that the iF-transform values and the modified values have a very high correlation and the two values are very near each other on the whole range of small and big values of observed prices.
Finally, it is interesting to observe the time evolution of the different clusters (see Figure 21).
Remark that in the first part of the time series, the data (i.e., the local trends) persist into the quasi-zero slope (quasi-constant time series); after observation 1400, frequent changes in the local trends appear evident, but changes seem to be gradual from one class to a near one and only rarely the local trends jump from a form to a very different one. This appears clearly from the transition matrix P = p r o b ( i , j ) , i , j = 1 , 2 ,   ,   9 given below ( p r o b ( i , j ) is the probability that local trend of cluster i moves to class j): we see that matrix P is essentially tridiagonal.
P 123456789
10.570.290.14000000
20.080.440.4400.040000
30.020.120.390.250.120.030.050.020
400.020.100.500.310.040.010.010
50000.030.930.03000
6000.10.040.320.530.100.010
70000.030.040.270.560.100
800.050.090.0500.090.180.500.05
900000000.500.50

4.2. F-Transform Fitting with Sparse r-Partition

The results obtained with the dense r-partition P a are confirmed when using the sparse r-partition P b . The computations are performed only for the bandwidth r = 3 , corresponding to n = 326 nodes of P b and 42 = 2 7 3 data belonging to each interval x k r , x k + r .
Figure 22 and Figure 23 plot the local trends of orders q = 1 and q = 2 , respectively (we have plotted only the second part of the Bitcoin time series).
The local trends are clustered into n C l = 9 groups when q = 1 and n C l = 15 groups when q = 2 , pictured in Figure 24 and Figure 25. It appears that , for q = 1 , clusters 8 and 9 (and clusters 1, 13, 14, 15, when q = 2 ) contain very few elements and possibly, in this sparse case, the number of clusters should be reduced to 7 and 11.
The substitution of estimated local trends with the ones obtained from centroids of each cluster, produces the smooth reconstructions represented in Figure 26 and Figure 27, respectively; the scatter-plots of the data and their smoothing with standard F-transform and the modified versions, are plotted in Figure 28 and Figure 29. We remark that corresponding to a stronger smoothing effect obtained with a smaller number of nodes in the decomposition (now it is sparse) the quality of the fitting is reduced.

5. Forecasting Bitcoin Prices with Gt100 Index

As described in the Introduction, there is empirical evidence that causal relationship between Bitcoin prices/returns and Google Trend scores is bi-directional and, we expect, this will be useful in designing short-term models that relate B i t C o i n ( t ) to G T 100 ( t l ) for small values of lag l 1 . Clearly, as illustrated in sections three and four, the type of functional relationship will change with time and, in particular, the form of local polynomials is expected to persist only for short times around the actual time t and we can estimate their form (coefficients) from the data up to the last available observations.
In the setting of L p -norm F-transform, we suggest using the (polynomial) direct F-transform components ϑ k such as
ϑ k ( t ) = θ k , 0 + θ k , 1 ( t x k ) +     + θ k , q ( t x k ) q + θ k , q + 1 g 1 ( t ) + θ k , q + 2 g 2 ( t ) +     + θ k , q + s g s ( t )
where ϑ k , k =   ,   N 2 , N 1 , N is the k-th local trend function (k-th direct F-transform component) and g 1 ( t ) ,   ,   g s ( t ) are delayed versions of the Google Trends G T 100 ( t ) and/or B i t C o i n ( t ) series.
We are interested to a forecasting model by which, with available observations of B i t C o i n ( t ) and G T 100 ( t ) at times t =   ,   T 2 , T 1 , T up to time T, we like to construct a forecast of B i t C o i n ( T + l ) for l steps ahead. To do this, we estimate the direct F-transform components (39) with appropriate q, s and values g 1 ( t l ) ,   ,   g s ( t l ) , obtained from observed values of B i t c o i n ( t l ) and/or G T 100 ( t l ) for t =   ,   T 2 , T 1 , T ; then, e.g., using the last estimated trend function ϑ N ( t ) we approximate B i t C o i n ( T + l ) with ϑ N ( T + l ) . This is always possible if the fuzzy r-partition is such that T + l T = x N , x N + r . Alternatively, if r > 1 , we can approximate B i t C o i n ( T + l ) by computing the inverse F-transform at time T + l as the combination of local trends ϑ k ( T + l ) that have positive weights A k ( T + l ) , the basic functions active at time T + l .
Clearly, this construction may be good and reasonable only for short-term forecasting. In our experiments we have used the first approximation only to forecast Bitcoin prices B i t C o i n ( T + l ) with l 1 , 2 , 3 , 4 , 5 . The reported results are given for the last 1200 values of the available time series. We have used q 0 , 1 , 2 , r 1 , 2 and two cases of functions g j ( t ) :
Model A: s = 1 and g 1 ( t ) = G T 100 ( t ) ;
Model B: s = 2 and g 1 ( t ) = G T 100 ( t ) , g 2 ( t ) = B i t C o i n ( t 1 ) , i.e., by adding an autoregressive term.
Our simple forecasting model is obtained by the following three steps:
Step 1. We start with one of the fitting model obtained in the analysis in previous sections and we chose the pair of values ( r , m r ) where r 1 is the bandwidth of the r-partition and the associated value m r is the number of time observations used to estimate the parameters of local trend functions ϑ k ( t ) . We have found that m r observations on each subinterval of the partition, in the range m r = 11 ,   ,   25 , produces in general the best fitting results: two or three weeks of data are sufficient to obtain the forecast.
Step 2. The parameters in ϑ k ( t ) are estimated using the Lp-norm-based criterion: we assume p = 1.5 as a good intermediate value between quantile ( p = 1 ) and expectile ( p = 2 ) estimators.
Step 3. Each l-steps ahead forecast value f T + l for the last more recent 1200 available observations ending at time T f i n a l = 2242 is obtained from ϑ N ( t ) , where N = 2 + r is the number on intervals in the partition covering m r data that terminate at time T: forecast f T + l is then estimated from data B i t C o i n t , t = T m r + 1 ,   ,   T , G T 100 t , t = T m r l + 1 ,   ,   T l for g 1 ( t l ) and, in Model B, B i t C o i n t , t = T m r l + 1 ,   ,   T l for g 1 ( t l ) . In this way, we can compute f T + l = ϑ N ( T + l ) as the needed values g 1 ( T + l ) = G T 100 T and g 2 ( T + l ) = B i t C o i n T are available from observations.
For the fitting (i.e., with l = 0 ) and forecasting models (with l > 0 ) we report the mean square error ( M S E = 1 m i = 1 m ( f i f i r ) 2 ), the mean absolute percentage error ( % M A E = 100 m i = 1 m f i f i r f i ) and the well-known Kendall τ rank correlation (a measure of ordinal association between f i and f i r ).
We see from Table 3 that the fitting of Bitcoin and GT100 time series become significantly better by increasing the order q; it is also interesting to remark that GT100 fitting is much less precise that Bitcoin fitting (with the same p and q). On the other hand, polynomials of orders q > 1 are not useful for extrapolation as they tend to be highly oscillating for a lag l > 1 .
For these reasons, only three pairs ( p , q ) as in Table 4 and Table 5 are considered and we see that for forecasting, q = 1 gives good results for small lag l while q = 0 is better for higher lags.
The forecast Bitcoin time series for lags l = 1 , 2 obtained with model A are pictured in Figure 30; for lags l = 4 , 5 and model B are plotted in Figure 31.
To conclude this section on forecasting Bitcoin time series, we shortly explore the statistical significance of the proposed models A and B. Following some ideas in [28], we compare our forecast estimates with the so-called random walk model, assumed as a benchmark for single step forecasting, i.e., with lag l = 1 . Defining the return series as f R e t t = l o g ( f S e r t f S e r t 1 ) , the random walk forecast r ^ T of the returns at time T is defined in terms of a fixed time horizon of s observations f R e t t , t = T s + 1 ,   ,   T ending at time T by
R e t t = r ^ T + ε t ,       ε t I D ( 0 , σ T ) ,       t = T s + 1 ,   ,   T .
Then, considering that the return values f R e t t are known up to time T, the random walk forecast of f R e t at time T + 1 is simply the average r ^ T = 1 s t = T s + 1 T R e t t , we can estimate the random walk forecast of f S e r T + 1 from the definition of return at time T + 1 and obtain f T + 1 r w = e x p ( r ^ T ) f S e r T . This calculations are performed for T being each of the last 730 available observations (two years).
As in [28], the Diebold–Mariano (DM) test statistics is applied to test the significance of the M S E measures for our forecasts (with models A and B) in comparison with the random walk forecast. The results are reported in Table 6.
Table 6 (in particular the small p-value) confirm that Bitcoin prices can be forecasted using F-transform of order q = 1 , as indeed both (simple) short-term models A and B outperform the random walk model.

6. Final Comments and Conclusions

In this paper, we apply the F-transform setting to analyze Bitcoin and the associated Google Trend time series. The direct and inverse F-transforms provide flexible and highly adaptive non-parametric smoothing in data analysis and they are adopted to develop a quantitative approach to sentiment analysis; considering empirical evidence, demonstrated in recent literature, that there is a bi-directional causal relationship between the two time series, we suggest a model to evaluate the dependence of Bitcoin prices on Google Trends scores and we show how and at which extend it is useful for short-term forecasting.
This research topic has rapidly increased in recent years and however deserves more investigation. Thanks to the high flexibility of smoothing techniques and modeling based on F-transform, we show that the web interest (querying) in Bitcoin phenomenon has an influence on the values of Bitcoin prices; the type and form of relationship has an essentially local nature as it may change from one period to the other. Remark that the reverse relationship remains important, but its interest is clearly less interesting.
In Section 3 and Section 4 the two time series are deeply analyzed in terms of non-parametric smoothing techniques, different clustering methodologies and efficient stylized fact research. Our results confirm the general hypothesis that short-term local trends characterize the Bitcoin time series, including the possibility to forecast its values at least for small steps ahead (from 1 to 5, daily, in our study).
Future research directions include improved clustering models identified by specific forms of local trends such as, e.g., exponential functions or other dependencies more suitable and stable for extrapolation than polynomials.
In the paper we do not argue about the superiority of forecasting based on F-transform with respect to many other methodologies; however, a comparison may be the opportunity to better investigate the theoretical properties in future research.

Author Contributions

The authors contributed equally in writing the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

The authors would like to thank the editors and the anonymous reviewers for their meaningful and constructive suggestions that have led to the present improved version of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Perfilieva, I. Fuzzy Transforms: Theory and Applications. Fuzzy Sets Syst. 2006, 157, 993–1023. [Google Scholar] [CrossRef]
  2. Perfilieva, I. Fuzzy Transforms: A Challenge to conventional transforms. In Advances in Images and Electron Physics; Hawkes, P.W., Ed.; Elsevier Academic Press: Cambridge, MA, USA, 2007; Volume 147, pp. 137–196. [Google Scholar]
  3. Perfilieva, I.; Novák, V.; Dvorak, A. Fuzzy transform in the analysis of data. Int. J. Approx. Reason. 2008, 48, 36–46. [Google Scholar] [CrossRef] [Green Version]
  4. Perfilieva, I. F-Transform. In Springer Handbook of Computational Intelligence; Kacprzyk, J., Pedrycz, W., Eds.; Springer: Berlin, Germany, 2015; Chapter 7; pp. 113–130. [Google Scholar]
  5. Kreinovich, V.; Kosheleva, O.; Sriboonchitta, S. Why Use a Fuzzy Partition in F-Transform? Axioms 2019, 8, 94. [Google Scholar] [CrossRef] [Green Version]
  6. Guerra, M.L.; Sorini, L.; Stefanini, L. Quantile and Expectile Smoothing based on L1-norm and L2-norm F-transforms. Int. J. Approx. Reason. 2019, 107, 17–43. [Google Scholar] [CrossRef]
  7. Guerra, M.L.; Sorini, L.; Stefanini, L. On the approximation of a membership function by empirical quantile functions. Int. J. Approx. Reason. 2020, 124, 133–146. [Google Scholar] [CrossRef]
  8. Bouri, E.; Gupta, R.; Tiwari, A.K.; Roubaud, D. Does Bitcoin hedge global uncertainty? Evidence from wavelet-based quantile-in-quantile regressions. Financ. Res. Lett. 2017, 23, 87–95. [Google Scholar] [CrossRef] [Green Version]
  9. Bellini, F.; Klar, B.; Muller, A.; Gianin, E.R. Generalized quantiles as risk measures. Insur. Math. Econ. 2014, 54, 41–48. [Google Scholar] [CrossRef]
  10. Bellini, F.; Bignozzi, V. On elicitable risk measures. Quant. Financ. 2015, 15, 725–733. [Google Scholar] [CrossRef]
  11. Daouia, A.; Girard, S.; Stupfler, G. Extreme M-quantiles as risk measures: From L1 to Lp optimization. Bernoulli 2019, 25, 264–309. [Google Scholar] [CrossRef] [Green Version]
  12. Bariviera, A.F.; Basgall, M.J.; Hasperuéb, W.; Naiouf, M. Some stylized facts of the Bitcoin market. Phys. A 2017, 484, 82–90. [Google Scholar] [CrossRef] [Green Version]
  13. Yermack, D. Is Bitcoin a Real Currency? An Economic Appraisal. In Handbook of Digital Currency: Bitcoin, Innovation, Financial Instruments, and Big Data; Chuen, D.L.K., Ed.; Academic Press: Cambridge, MA, USA, 2015; Chapter 2; pp. 113–130. [Google Scholar]
  14. Demir, E.; Gozgor, G.; Lau, C.K.M.; Vigne, S.A. Does economic policy uncertainty predict the Bitcoin returns? An empirical investigation. Financ. Res. Lett. 2018, 26, 145–149. [Google Scholar] [CrossRef] [Green Version]
  15. Corbet, S.; Lucey, B.; Urquhart, A.; Yarovaya, L. Cryptocurrencies as a financial asset: A systematic analysis. Int. Financ. Anal. 2019, 62, 182–199. [Google Scholar] [CrossRef] [Green Version]
  16. Gronwald, M. Is Bitcoin a Commodity? On price jumps, demand shocks,and certainty of supply. J. Int. Money Finance 2019, 97, 86–92. [Google Scholar] [CrossRef]
  17. Aharon, D.Y.; Qadan, M. Bitcoin and the day-of-the-week effect. Financ. Res. Lett. 2019, 31, 415–424. [Google Scholar] [CrossRef]
  18. Qin, M.; Su, C.; Tao, R. A new basket for eggs? Econ. Model. 2020. [Google Scholar] [CrossRef]
  19. Bouri, E.; Gupta, R. Predicting Bitcoin returns: Comparing the roles of newspaper- and internet search-based measures of uncertainty. Financ. Res. Lett. 2019. [Google Scholar] [CrossRef] [Green Version]
  20. Baiga, A.; Blaub, B.M.; Sabaha, N. Price clustering and sentiment in bitcoin. Financ. Res. Lett. 2019, 29, 111–116. [Google Scholar] [CrossRef]
  21. Zhang, W.; Wanga, P.; Li, X.; Shen, D. Quantifying the cross-correlations between online searches and Bitcoin market. Phys. A 2018, 509, 657–672. [Google Scholar] [CrossRef]
  22. Kristoufek, L. Power-law correlations in finance-related Google searches and their cross-correlations with volatility and traded volume: Evidence from the Dow Jones Industrial components. Phys. A 2015, 428, 194–205. [Google Scholar] [CrossRef] [Green Version]
  23. Eom, C.; Kaizoji, T.; Kang, S.H.; Pichl, L. Bitcoin and investor sentiment: Statistical characteristics and predictability. Phys. A 2019, 514, 511–521. [Google Scholar] [CrossRef]
  24. Karalevicius, V.; Degrande, N.; Weerdt, J.D. Using sentiment analysis to predict interday Bitcoin price movements. J. Risk Financ. 2018, 19, 56–75. [Google Scholar] [CrossRef]
  25. Kristoufek, L. BitCoin meets Google Trends and Wikipedia: Quantifying the relationship between phenomena in the Internet era. Sci. Rep. 2013, 3, 1–7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Stolarski, P.; Lewoniewski, W.; Abramowicz, W. Chryptocurrencies Perceptions Using Wikipedia and Google Trends. Information 2020, 11, 234. [Google Scholar] [CrossRef]
  27. Dastgir, S.; Demir, E.; Downing, G.; Gozgor, G.; Lau, C.K.M. The causal relationship between Bitcoin attention and Bitcoin returns: Evidence from the Copula-based Granger causality test. Financ. Res. Lett. 2019, 28, 160–164. [Google Scholar] [CrossRef]
  28. Adcock, R.; Gradojevic, N. Non-fundamental, non-parametric Bitcoin forecasting. Phys. Stat. Mech. Its Appl. 2019, 531, 121727. [Google Scholar] [CrossRef]
  29. Atsalakis, G.S.; Atsalaki, I.G.; Pasiouras, F.; Zopounidis, C. Bitcoin price forecasting with neuro-fuzzy techniques. Eur. J. Oper. Res. 2019, 276, 770–780. [Google Scholar] [CrossRef]
  30. Trucios, C. Forecasting Bitcoin risk measures: A robust approach. Int. J. Forecast. 2019, 35, 836–847. [Google Scholar] [CrossRef]
  31. Kyriazis, N.A. A Survey on Empirical Findings about Spillovers in cryptocurrency Markets. J. Risk Financ. Manag. 2019, 12, 170. [Google Scholar] [CrossRef] [Green Version]
  32. Bohte, R.; Rossini, L. Comparing the Forecasting of cryptocurrencies by Bayesian Time-Varying Volatility Models. J. Risk Financ. Manag. 2019, 12, 150. [Google Scholar] [CrossRef] [Green Version]
  33. Xie, T. Forecast Bitcoin Volatility with Least Squares Model Averaging. Econometrics 2019, 7, 40. [Google Scholar] [CrossRef] [Green Version]
  34. Cretarola, A.; Figà-Talamanca, G.; Patacca, M. Market Attention and Bitcoin Price Modeling: Theory, Estimation and Option Pricing. 2017. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3042029 (accessed on 8 May 2019).
  35. Cretarola, A.; Figà-Talamanca, G.; Patacca, M. A continuous time model for bitcoin price dynamics. In Mathematical and Statistical Methods for Actuarial Sciences and Finance; Springer: Berlin, Germany, 2018; pp. 273–277. [Google Scholar]
  36. Chaim, P.; Laurini, M.P. Is Bitcoin a bubble? Phys. A 2019, 517, 222–232. [Google Scholar] [CrossRef]
  37. Panagiotidis, T.; Stengos, T.; Vravosinos, O. A Principal Component-Guided Sparse Regression Approach for the Determination of Bitcoin Returns. J. Risk Financ. Manag. 2020, 13, 33. [Google Scholar] [CrossRef] [Green Version]
  38. Moosa, I.A. The bitcoin: A sparkling bubble or price discovery? J. Ind. Bus. Econ. 2020, 47, 93–113. [Google Scholar] [CrossRef]
  39. Catania, L.; Grassi, S.; Ravazzolo, F. Forecasting cryptocurrencies under model and parameter instability. Int. J. Forecast. 2019, 35, 485–501. [Google Scholar] [CrossRef]
  40. Ji, S.; Kim, J.; Im, Ḣ. A Comparative Study of Bitcoin Price Prediction Using Deep Learning. Mathematics 2019, 7, 898. [Google Scholar] [CrossRef] [Green Version]
  41. Stavroyiannis, S.; Babalos, V.; Bekiros, S.; Lahmiri, S.; Uddin, G.S. The high frequency multifractal properties of Bitcoin. Phys. A 2019, 520, 62–71. [Google Scholar] [CrossRef]
  42. Yu, M.; Gao, R.; Su, X.; Jin, X.; Zhang, H.; Song, J. Forecasting Bitcoin volatility: The role of leverage effect and uncertainty. Phys. A 2019. [Google Scholar] [CrossRef]
  43. Zargar, F.N.; Kumar, D. Long range dependence in the Bitcoin market: A study based on high-frequency data. Phys. A 2019, 515, 625–640. [Google Scholar] [CrossRef]
  44. Zhang, Y.; Chan, S.; Chu, J.; Nadarajah, S. Stylised facts for high frequency cryptocurrency data. Phys. A 2018, 513, 598–612. [Google Scholar] [CrossRef]
  45. Takaishi, T. Statistical properties and multifractality of Bitcoin. Phys. Stat. Mech. Its Appl. 2018, 506, 507–519. [Google Scholar] [CrossRef] [Green Version]
  46. Da Silva Filho, A.C.; Maganini, N.D.; de Almeida, E.F. Multifractal analysis of Bitcoin market. Phys. A 2018, 512, 954–967. [Google Scholar] [CrossRef]
  47. Guerra, M.L.; Stefanini, L. Expectile smoothing of time series using F-transform. In Proceedings of the 8th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT 2013), Milan, Italy, 9–13 September 2013; pp. 559–564, ISBN 978-162993219-4. [Google Scholar]
  48. Coroianu, L.; Stefanini, L. Properties of fuzzy transform obtained from Lp-minimization and a connection with Zadeh extension principle. Inf. Sci. 2019, 478, 331–354. [Google Scholar] [CrossRef]
  49. Cont, R. Empirical properties of asset returns: Stylized facts and statistical issues. Quant. Financ. 2001, 1, 223–236. [Google Scholar] [CrossRef]
Figure 1. Daily Bitcoin prices (blue color, left scale) and daily Google Trends series (red color, right scale) from 28 April 2013 to 17 June 2019.
Figure 1. Daily Bitcoin prices (blue color, left scale) and daily Google Trends series (red color, right scale) from 28 April 2013 to 17 June 2019.
Axioms 09 00139 g001
Figure 2. Scatter-plots of ( f t , f t r ) for daily Bitcoin prices (left picture) and daily GT100 trends (right), from April 2013 to June 2019.
Figure 2. Scatter-plots of ( f t , f t r ) for daily Bitcoin prices (left picture) and daily GT100 trends (right), from April 2013 to June 2019.
Axioms 09 00139 g002
Figure 3. L p -based smoothing for Bitcoin (top picture, blue points) and GT100 (bottom, blue points) series, obtained with p = 1.5 and q = 0 . The green curves plot the observed series.
Figure 3. L p -based smoothing for Bitcoin (top picture, blue points) and GT100 (bottom, blue points) series, obtained with p = 1.5 and q = 0 . The green curves plot the observed series.
Axioms 09 00139 g003
Figure 4. Scatterplot ( G T 100 t , B i t C o i n t ) of daily Bitcoin prices vs daily GT100 series. For this visualization, the two time series are normalized to the common range [ 0 , 1000 ] .
Figure 4. Scatterplot ( G T 100 t , B i t C o i n t ) of daily Bitcoin prices vs daily GT100 series. For this visualization, the two time series are normalized to the common range [ 0 , 1000 ] .
Axioms 09 00139 g004
Figure 5. The figure shows (inverse) L 1 -norm (on top) and L 2 -norm (on bottom) iF-transform functions obtained for the observations ( G T 100 t , B i t C o i n t ) ; here, the daily Bitcoin prices are considered to be functions in the domain [ 0 , 100 ] of GT100.
Figure 5. The figure shows (inverse) L 1 -norm (on top) and L 2 -norm (on bottom) iF-transform functions obtained for the observations ( G T 100 t , B i t C o i n t ) ; here, the daily Bitcoin prices are considered to be functions in the domain [ 0 , 100 ] of GT100.
Axioms 09 00139 g005
Figure 6. Fuzzy-valued Espectile F-transform function of the data-set ( B i t C o i n t , G T 100 t ) ; the five computed α -cuts correspond to α { 0.01 , 0.25 , 0.5 , 0.75 , 1 } .
Figure 6. Fuzzy-valued Espectile F-transform function of the data-set ( B i t C o i n t , G T 100 t ) ; the five computed α -cuts correspond to α { 0.01 , 0.25 , 0.5 , 0.75 , 1 } .
Axioms 09 00139 g006
Figure 7. The fitted B i t C o i n time series (blue color) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t C o i n t ) ; the observed B i t C o i n t values are green and the observed G T 100 t values are red.
Figure 7. The fitted B i t C o i n time series (blue color) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t C o i n t ) ; the observed B i t C o i n t values are green and the observed G T 100 t values are red.
Axioms 09 00139 g007
Figure 8. Clustering A: the fitted B i t C o i n time series (blue colour) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t C o i n t ) ; the observed B i t C o i n t values are green and the observed G T 100 t values are red.
Figure 8. Clustering A: the fitted B i t C o i n time series (blue colour) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t C o i n t ) ; the observed B i t C o i n t values are green and the observed G T 100 t values are red.
Axioms 09 00139 g008
Figure 9. Clustering A: for each cluster, the fitted B i t C o i n subseries (blue colour) are obtained from L 2 -norm iF-transform function applied to the subset of data assigned to each cluster; the observed ( G T 100 t , B i t C o i n t ) values are green.
Figure 9. Clustering A: for each cluster, the fitted B i t C o i n subseries (blue colour) are obtained from L 2 -norm iF-transform function applied to the subset of data assigned to each cluster; the observed ( G T 100 t , B i t C o i n t ) values are green.
Axioms 09 00139 g009
Figure 10. Clustering B: the fitted B i t C o i n time series (blue colour) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t C o i n t ) ; the observed B i t C o i n t values are green and the observed G T 100 t values are red.
Figure 10. Clustering B: the fitted B i t C o i n time series (blue colour) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t C o i n t ) ; the observed B i t C o i n t values are green and the observed G T 100 t values are red.
Axioms 09 00139 g010
Figure 11. Clustering B: the fitted B i t C o i n time series is in blue colour; the observed B i t C o i n t values are green.
Figure 11. Clustering B: the fitted B i t C o i n time series is in blue colour; the observed B i t C o i n t values are green.
Axioms 09 00139 g011
Figure 12. Clustering C: the fitted B i t c o i n time series (blue color) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t c o i n t ) ; the observed B i t c o i n t values are green and the observed G T 100 t values are red.
Figure 12. Clustering C: the fitted B i t c o i n time series (blue color) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t c o i n t ) ; the observed B i t c o i n t values are green and the observed G T 100 t values are red.
Axioms 09 00139 g012
Figure 13. The blue points correspond to the observations ( G T 100 t , B i t C o i n t ) assigned to each cluster by clustering C. The red points give the centroid values relative to two variables G T 100 and B i t C o i n .
Figure 13. The blue points correspond to the observations ( G T 100 t , B i t C o i n t ) assigned to each cluster by clustering C. The red points give the centroid values relative to two variables G T 100 and B i t C o i n .
Axioms 09 00139 g013
Figure 14. The fitted B i t C o i n time series (blue color) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t C o i n t ) ; the observed B i t C o i n t values are green and the observed G T 100 t values are red.
Figure 14. The fitted B i t C o i n time series (blue color) is obtained from L 2 -norm iF-transform function applied to the data-set ( G T 100 t , B i t C o i n t ) ; the observed B i t C o i n t values are green and the observed G T 100 t values are red.
Axioms 09 00139 g014
Figure 15. L 2 -norm F-transform components of order 1 for Bitcoin series. The fuzzy r-partition is P a with r = 4 (top picture) and r = 7 (bottom).
Figure 15. L 2 -norm F-transform components of order 1 for Bitcoin series. The fuzzy r-partition is P a with r = 4 (top picture) and r = 7 (bottom).
Axioms 09 00139 g015
Figure 16. L 2 -norm F-transform components of order 2 for Bitcoin series. The fuzzy r-partition is P a with r = 4 (top picture) and r = 7 (bottom).
Figure 16. L 2 -norm F-transform components of order 2 for Bitcoin series. The fuzzy r-partition is P a with r = 4 (top picture) and r = 7 (bottom).
Axioms 09 00139 g016
Figure 17. Clustering of L 2 -norm F-transform components (polynomials) of order 1 for Bitcoin series. The clusters correspond to the fuzzy r-partition P a with r = 4 (bottom picture) and r = 7 (top).
Figure 17. Clustering of L 2 -norm F-transform components (polynomials) of order 1 for Bitcoin series. The clusters correspond to the fuzzy r-partition P a with r = 4 (bottom picture) and r = 7 (top).
Axioms 09 00139 g017
Figure 18. Clustering of L 2 -norm F-transform components (polynomials) of order 2 for Bitcoin series. The clusters correspond to the fuzzy r-partition P a with r = 4 (bottom picture) and r = 7 (top).
Figure 18. Clustering of L 2 -norm F-transform components (polynomials) of order 2 for Bitcoin series. The clusters correspond to the fuzzy r-partition P a with r = 4 (bottom picture) and r = 7 (top).
Axioms 09 00139 g018
Figure 19. L 2 -norm modified iF-transform values f c , ( P a , A , r ) q ( t j ) (top: q = 1 , bottom: q = 2 ) for Bitcoin series. The fuzzy r-partition is P a with r = 4 .
Figure 19. L 2 -norm modified iF-transform values f c , ( P a , A , r ) q ( t j ) (top: q = 1 , bottom: q = 2 ) for Bitcoin series. The fuzzy r-partition is P a with r = 4 .
Axioms 09 00139 g019
Figure 20. L 2 -norm modified iF-transform values f c , ( P a , A , r ) q ( t j ) (top: q = 1 , bottom: q = 2 ) for Bitcoin series. The fuzzy r-partition is P a with r = 7 .
Figure 20. L 2 -norm modified iF-transform values f c , ( P a , A , r ) q ( t j ) (top: q = 1 , bottom: q = 2 ) for Bitcoin series. The fuzzy r-partition is P a with r = 7 .
Axioms 09 00139 g020
Figure 21. Time evolution of clusters obtained with the L 2 -norm F-transform smoothing of order 1 for Bitcoin series. The fuzzy r-partition is P a with r = 4 .
Figure 21. Time evolution of clusters obtained with the L 2 -norm F-transform smoothing of order 1 for Bitcoin series. The fuzzy r-partition is P a with r = 4 .
Axioms 09 00139 g021
Figure 22. L 2 -norm F-transform components of order 1 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Figure 22. L 2 -norm F-transform components of order 1 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Axioms 09 00139 g022
Figure 23. L 2 -norm F-transform components of order 2 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Figure 23. L 2 -norm F-transform components of order 2 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Axioms 09 00139 g023
Figure 24. Clustering of L 2 -norm F-transform components (polynomials) of order 1 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Figure 24. Clustering of L 2 -norm F-transform components (polynomials) of order 1 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Axioms 09 00139 g024
Figure 25. Clustering of L 2 -norm F-transform components (polynomials) of order 2 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Figure 25. Clustering of L 2 -norm F-transform components (polynomials) of order 2 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Axioms 09 00139 g025
Figure 26. L 2 -norm modified iF-transform values f c , ( P b , A , r ) q ( t j ) for Bitcoin series. Here, q = 1 and the fuzzy r-partition is P b with r = 3 .
Figure 26. L 2 -norm modified iF-transform values f c , ( P b , A , r ) q ( t j ) for Bitcoin series. Here, q = 1 and the fuzzy r-partition is P b with r = 3 .
Axioms 09 00139 g026
Figure 27. L 2 -norm modified iF-transform values f c , ( P b , A , r ) q ( t j ) for Bitcoin series. Here, q = 2 and the fuzzy r-partition is P b with r = 3 .
Figure 27. L 2 -norm modified iF-transform values f c , ( P b , A , r ) q ( t j ) for Bitcoin series. Here, q = 2 and the fuzzy r-partition is P b with r = 3 .
Axioms 09 00139 g027
Figure 28. Pairwise scatter-plots of values f j , f ( P b , A , r ) q ( t j ) and f c , ( P b , A , r ) q ( t j ) using L 2 -norm F-transform smoothing of order q = 1 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Figure 28. Pairwise scatter-plots of values f j , f ( P b , A , r ) q ( t j ) and f c , ( P b , A , r ) q ( t j ) using L 2 -norm F-transform smoothing of order q = 1 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Axioms 09 00139 g028
Figure 29. Pairwise scatter-plots of values f j , f ( P b , A , r ) q ( t j ) and f c , ( P b , A , r ) q ( t j ) using L 2 -norm F-transform smoothing of order q = 2 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Figure 29. Pairwise scatter-plots of values f j , f ( P b , A , r ) q ( t j ) and f c , ( P b , A , r ) q ( t j ) using L 2 -norm F-transform smoothing of order q = 2 for Bitcoin series. The fuzzy r-partition is P b with r = 3 .
Axioms 09 00139 g029
Figure 30. Model A: L p -norm F-transform forecast (denoted as fFor) for the last 1200 days of available data for Bitcoin series (observed time series is denoted as fSer). Here, p = 1.5 , q = 1 and two lags l = 1 (top two pictures), l = 2 (bottom). The percent errors 100 f S e r t f F o r t f S e r t are also plotted.
Figure 30. Model A: L p -norm F-transform forecast (denoted as fFor) for the last 1200 days of available data for Bitcoin series (observed time series is denoted as fSer). Here, p = 1.5 , q = 1 and two lags l = 1 (top two pictures), l = 2 (bottom). The percent errors 100 f S e r t f F o r t f S e r t are also plotted.
Axioms 09 00139 g030
Figure 31. Model B: L p -norm F-transform forecast (denoted as fFor) for the last 1200 days of available data for Bitcoin series (observed time series is denoted as fSer). Here, p = 1.5 , q = 1 and two lags l = 4 (top two pictures), l = 5 (bottom). The percent errors 100 f S e r t f F o r t f S e r t are also plotted.
Figure 31. Model B: L p -norm F-transform forecast (denoted as fFor) for the last 1200 days of available data for Bitcoin series (observed time series is denoted as fSer). Here, p = 1.5 , q = 1 and two lags l = 4 (top two pictures), l = 5 (bottom). The percent errors 100 f S e r t f F o r t f S e r t are also plotted.
Axioms 09 00139 g031
Table 1. L p -norm F-transform of Bitcoin and GT100 time series For five values p and three orders q 0 , 1 , 3 the table shows ratios L ( f , f ( r ) ) with f B i t c o i n , G T 100 .
Table 1. L p -norm F-transform of Bitcoin and GT100 time series For five values p and three orders q 0 , 1 , 3 the table shows ratios L ( f , f ( r ) ) with f B i t c o i n , G T 100 .
pqBitcoinGT100
100.35260.1646
110.42980.2643
130.58130.4922
1.2500.34930.1628
1.2510.43150.2677
1.2530.56930.4813
1.500.34650.1649
1.510.42820.2702
1.530.55580.4785
1.7500.34480.1777
1.7510.42500.2783
1.7530.54830.4798
200.34280.1715
210.42210.2859
230.54320.4815
Table 2. Approximation indices for L p -norm F-transform of Bitcoin and GT100 For five values p 1 , 1.25 , 1.5 , 1.75 , 2 and three orders q 0 , 1 , 3 the table shows indices M S E , % M A E , and Kendall τ correlation for the pairs ( f , f ( r ) ) with f B i t c o i n , G T 100 .
Table 2. Approximation indices for L p -norm F-transform of Bitcoin and GT100 For five values p 1 , 1.25 , 1.5 , 1.75 , 2 and three orders q 0 , 1 , 3 the table shows indices M S E , % M A E , and Kendall τ correlation for the pairs ( f , f ( r ) ) with f B i t c o i n , G T 100 .
Bitcoin GT100
pq M S E % M A E τ M S E % M A E τ
10363.982.51730.97662.76707.82190.8897
11201.662.01520.98151.86106.71390.9049
13148.751.47390.98601.56144.82930.9331
1.250219.722.51710.97682.39397.93070.8911
1.251191.582.03150.98171.82116.83420.9074
1.253139.341.48740.98631.45564.92100.9345
1.50217.612.56410.97652.33398.13740.8910
1.51190.202.08050.98141.80277.02400.9068
1.53137.721.52840.98611.41695.11720.9331
1.750217.792.62320.97622.30208.39320.8896
1.751191.262.12980.98101.79207.25050.9057
1.753137.751.57340.98581.40315.34300.9311
20219.412.68590.97572.28558.68580.8877
21192.742.17780.97081.79647.48790.9038
23138.461.61550.98561.40405.57620.9289
Table 3. Approximation indices for L p -norm F-transform of Bitcoin and GT100 For five values p 1 , 1.25 , 1.5 , 1.75 , 2 and three orders q 0 , 1 , 3 the table shows indices M S E , % M A E , and Kendall τ correlation for the pairs ( f , f ( r ) ) with f B i t c o i n , G T 100 .
Table 3. Approximation indices for L p -norm F-transform of Bitcoin and GT100 For five values p 1 , 1.25 , 1.5 , 1.75 , 2 and three orders q 0 , 1 , 3 the table shows indices M S E , % M A E , and Kendall τ correlation for the pairs ( f , f ( r ) ) with f B i t c o i n , G T 100 .
Bitcoin GT100
pq M S E % M A E τ M S E % M A E τ
10363.982.51730.97662.76707.82190.8897
11201.662.01520.98151.86106.71390.9049
13148.751.47390.98601.56144.82930.9331
1.250219.722.51710.97682.39397.93070.8911
1.251191.582.03150.98171.82116.83420.9074
1.253139.341.48740.98631.45564.92100.9345
1.50217.612.56410.97652.33398.13740.8910
1.51190.202.08050.98141.80277.02400.9068
1.53137.721.52840.98611.41695.11720.9331
1.750217.792.62320.97622.30208.39320.8896
1.751191.262.12980.98101.79207.25050.9057
1.753137.751.57340.98581.40315.34300.9311
20219.412.68590.97572.28558.68580.8877
21192.742.17780.97081.79647.48790.9038
23138.461.61550.98561.40405.57620.9289
Table 4. Model A: L p -norm Fitting and Forecasting results For p = 1.5 , the table shows indices % M A E and Kendall τ correlation for the fitting (columns 4,5 with label f i t ) and the forecasting (columns 6,7 with label f o r ) corresponding to five lags l 1 , 2 , 3 , 4 , 5 and three pairs ( q , r ) of order q and bandwidth r.
Table 4. Model A: L p -norm Fitting and Forecasting results For p = 1.5 , the table shows indices % M A E and Kendall τ correlation for the fitting (columns 4,5 with label f i t ) and the forecasting (columns 6,7 with label f o r ) corresponding to five lags l 1 , 2 , 3 , 4 , 5 and three pairs ( q , r ) of order q and bandwidth r.
qrl % MAE -fit τ -fit % MAE -for τ -for
0111.650.983.300.96
21.670.984.410.95
31.670.985.350.94
41.580.986.170.93
51.580.986.880.92
1110.620.993.150.96
20.590.995.080.94
30.600.996.590.93
40.600.998.090.91
50.600.9910.130.89
0212.240.973.700.95
22.280.974.770.94
32.310.975.650.93
42.220.976.430.92
52.210.977.110.92
Table 5. Model B: L p -norm Fitting and Forecasting results For p = 1.5 , the table shows indices % M A E and Kendall τ correlation for the fitting (columns 4,5 with label f i t ) and the forecasting (columns 6,7 with label f o r ) corresponding to five lags l 1 , 2 , 3 , 4 , 5 and three pairs of order q and bandwidth r.
Table 5. Model B: L p -norm Fitting and Forecasting results For p = 1.5 , the table shows indices % M A E and Kendall τ correlation for the fitting (columns 4,5 with label f i t ) and the forecasting (columns 6,7 with label f o r ) corresponding to five lags l 1 , 2 , 3 , 4 , 5 and three pairs of order q and bandwidth r.
qrl % MAE -fit τ -fit % MAE -for τ -for
0111.560.983.210.96
21.570.984.330.95
31.560.985.310.93
41.570.986.120.93
51.570.986.830.92
1111.170.983.160.96
21.200.984.540.94
31.170.985.840.93
41.160.986.980.92
51.180.987.890.91
0211.560.983.210.96
21.570.984.340.95
31.560.985.310.93
41.570.986.120.93
51.570.986.830.92
Table 6. Diebold–Mariano test statistics for L p -norm forecasting results The table shows indexes s M S E = M S E and Kendall rank correlation τ , corresponding to lag l = 1 forecasts for Models A and B, obtained with p = 1.5 , q = 1 and bandwidth r = 1 .
Table 6. Diebold–Mariano test statistics for L p -norm forecasting results The table shows indexes s M S E = M S E and Kendall rank correlation τ , corresponding to lag l = 1 forecasts for Models A and B, obtained with p = 1.5 , q = 1 and bandwidth r = 1 .
sMSE DM-statp-Value τ
Model A488.2−2.94680.00160.9265
Model B487.0−3.83010.0000640.9186
Random Walk585.1--0.9078
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Guerra, M.L.; Sorini, L.; Stefanini, L. Bitcoin Analysis and Forecasting through Fuzzy Transform. Axioms 2020, 9, 139. https://0-doi-org.brum.beds.ac.uk/10.3390/axioms9040139

AMA Style

Guerra ML, Sorini L, Stefanini L. Bitcoin Analysis and Forecasting through Fuzzy Transform. Axioms. 2020; 9(4):139. https://0-doi-org.brum.beds.ac.uk/10.3390/axioms9040139

Chicago/Turabian Style

Guerra, Maria Letizia, Laerte Sorini, and Luciano Stefanini. 2020. "Bitcoin Analysis and Forecasting through Fuzzy Transform" Axioms 9, no. 4: 139. https://0-doi-org.brum.beds.ac.uk/10.3390/axioms9040139

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop