Deep Learning Methods for Modeling Bitcoin Price

Lamothe-Fernández, Prosper; Alaminos, David; Lamothe-López, Prosper; Fernández-Gámez, Manuel A.

doi:10.3390/math8081245

Open AccessArticle

Deep Learning Methods for Modeling Bitcoin Price

¹

Department of Financing and Commercial Research, UDI of Financing, Calle Francisco Tomás y Valiente, 5, Universidad Autónoma de Madrid, 28049 Madrid, Spain

²

Department of Economic Theory and Economic History, Campus El Ejido s/n, University of Malaga, 29071 Malaga, Spain

³

Rho Finanzas Partner, Calle de Zorrilla, 21, 28014 Madrid, Spain

⁴

Department of Finance and Accounting, Campus El Ejido s/n, University of Malaga, 29071 Malaga, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(8), 1245; https://0-doi-org.brum.beds.ac.uk/10.3390/math8081245

Submission received: 25 June 2020 / Revised: 24 July 2020 / Accepted: 28 July 2020 / Published: 30 July 2020

(This article belongs to the Special Issue Quantitative Methods for Economics and Finance)

Download

Browse Figures

Versions Notes

Abstract

:

A precise prediction of Bitcoin price is an important aspect of digital financial markets because it improves the valuation of an asset belonging to a decentralized control market. Numerous studies have studied the accuracy of models from a set of factors. Hence, previous literature shows how models for the prediction of Bitcoin suffer from poor performance capacity and, therefore, more progress is needed on predictive models, and they do not select the most significant variables. This paper presents a comparison of deep learning methodologies for forecasting Bitcoin price and, therefore, a new prediction model with the ability to estimate accurately. A sample of 29 initial factors was used, which has made possible the application of explanatory factors of different aspects related to the formation of the price of Bitcoin. To the sample under study, different methods have been applied to achieve a robust model, namely, deep recurrent convolutional neural networks, which have shown the importance of transaction costs and difficulty in Bitcoin price, among others. Our results have a great potential impact on the adequacy of asset pricing against the uncertainties derived from digital currencies, providing tools that help to achieve stability in cryptocurrency markets. Our models offer high and stable success results for a future prediction horizon, something useful for asset valuation of cryptocurrencies like Bitcoin.

Keywords:

bitcoin; deep learning; deep recurrent convolutional neural networks; forecasting; asset pricing

1. Introduction

Bitcoin is a cryptocurrency built by free software based on peer-to-peer networks as an irreversible private payment platform. Bitcoin lacks a physical form, is not backed by any public body, and therefore any intervention by a government agency or other agent is not necessary to transact [1]. These transactions are made from the blockchain system. Blockchain is an open accounting book, which records transactions between two parties efficiently, leaving such a mark permanently and impossible to erase, making this tool a decentralized validation protocol that is difficult to manipulate, and with low risk of fraud. The blockchain system is not subject to any individual entity [2].

For Bitcoin, the concept originated from the concept of cryptocurrency, or virtual currency [3]. Cryptocurrencies are a monetary medium that is not affected by public regulation, nor is it subject to a regulatory body. It only affects the activity and rules developed by the developers. Cryptocurrencies are virtual currencies that can be created and stored only electronically [4]. The cryptocurrency is designed to serve as a medium of exchange and for this, it uses cryptography systems to secure the transaction and control the subsequent creation of the cryptocurrency. Cryptocurrency is a subset of a digital currency designed to function as a medium of exchange and cryptography is used to secure the transaction and control the future creation of the cryptocurrency.

Forecasting Bitcoin price is vitally important for both asset managers and independent investors. Although Bitcoin is a currency, it cannot be studied as another traditional currency where economic theories about uncovered interest rate parity, future cash-flows model, and purchasing power parity matter, since different standard factors of the relationship between supply and demand cannot be applied in the digital currency market like Bitcoin [5]. On the one hand, Bitcoin has different characteristics that make it useful for those agents who invest in Bitcoin, such as transaction speed, dissemination, decentrality, and the large virtual community of people interested in talking and providing relevant information about digital currencies, mainly Bitcoin [6].

Velankar and colleagues [7] attempted to predict the daily price change sign as accurately as possible using Bayesian regression and generalized linear model. To do this, they considered the daily trends of the Bitcoin market and focused on the characteristics of Bitcoin transactions, reaching an accuracy of 51% with the generalized linear model. McNally and co-workers [8] studied the precision with which the direction of the Bitcoin price in United States Dollar (USD) can be predicted. They used a recurrent neural network (RNN), a long short-term memory (LSTM) network, and the autoregressive integrated moving average (ARIMA) method. The LSTM network obtains the highest classification accuracy of 52% and a root mean square error (RMSE) of 8%. As expected, non-linear deep learning methods exceeded the ARIMA method’s prognosis. For their part, Yogeshwaran and co-workers [9] applied convolutional and recurrent neural networks to predict the price of Bitcoin using data from a time interval of 5 min to 2 h, with convolutional neural networks showing a lower level of error, at around 5%. Demir and colleagues [10] predicted the price of Bitcoin using methods such as long short-term memory networks, naïve Bayes, and the nearest neighbor algorithm. These methods achieved accuracy rates between 97.2% and 81.2%. Rizwan, Narejo, and Javed [11] continued with the application of deep learning methods with the techniques of RNN and LSTM. Their results showed an accuracy of 52% and an 8% RMSE by the LSTM. Linardatos and Kotsiantis [12] had the same results, after using eXtreme Gradient Boosting (XGBoost) and LSTM; they concluded that this last technique yielded a lower RMSE of 0.999. Despite the superiority of computational techniques, Felizardo and colleagues [13] showed that ARIMA had a lower error rate than methods, such as random forest (RF), support vector machine (SVM), LSTM, and WaveNets, to predict the future price of Bitcoin. Finally, other works showed new deep learning methods, such as Dutta, Kumar, and Basu [14], who applied both LSTM and the gated recurring unit (GRU) model; the latter showed the best error result, with an RMSE of 0.019. Ji and co-workers [15] predicted the price of Bitcoin with different methodologies such as deep neural network (DNN), the LSTM model, and convolutional neural network. They obtained a precision of 60%, leaving the improvement of precision with deep learning techniques and a greater definition of significant variables as a future line of research. These authors show the need for stable prediction models, not only with data in and out of the sample, but also in forecasts of future results.

To contribute to the robustness of the Bitcoin price prediction models, in the present study a comparison of deep learning methodologies to predict and model the Bitcoin price is developed and, as a consequence, a new model that generates better forecasts of the Bitcoin price and its behavior in the future. This model can predict achieving accuracy levels above 95%. This model was constructed from a sample of 29 variables. Different methods were applied in the construction of the Bitcoin price prediction model to build a reliable model, which is contrasted with various methodologies used in previous works to check with which technique a high predictive capacity is achieved; specifically, the methods of deep recurrent neural networks, deep neural decision trees, and deep support vector machines, were used. Furthermore, this work attempts to obtain high accuracy, but it is also robust and stable in the future horizon to predict new observations, something that has not yet been reported by previous works [7,8,9,10,11,12,13,14,15], but which some authors demand for the development of these models and their real contribution [9,12].

We make two main contributions to the literature. First, we consider new explanatory variables for modeling the Bitcoin price, testing the importance of these variables which have not been considered so far. It has important implications for investors, who will know which indicators provide reliable, accurate, and potential forecasts of the Bitcoin price. Second, we improve the prediction accuracy concerning that obtained in previous studies with innovative methodologies.

This study is structured as follows: Section 2 explains the theory of methods applied. Section 3 offers details of the data and the variables used in this study. Section 4 develops the results obtained. Section 5 provides conclusions of the study and the purposes of the models obtained.

2. Deep Learning Methods

As previously stated, different deep learning methods have been applied for the development of Bitcoin price prediction models. We use this type of methodology thanks to its high predictive capacity obtained in the previous literature on asset pricing to meet one of the objectives of this study, which is to achieve a robust model. Specifically, deep recurrent convolution neural network, deep neural decision trees, and deep learning linear support vector machines have been used. The characteristics of each classification technique used are detailed below. In addition, the method of analysis of the sensitivity of variables used in the present study, in particular, the method of Sobol [16], which is necessary to determine the level of significance of the variables used in the prediction of Bitcoin price is recorded, fulfilling the need presented by the previous literature in the realization of the task of feature selection [15].

2.1. Deep Recurrent Convolution Neural Network (DRCNN)

Recurrent neural networks (RNN) have been applied in different fields for prediction due to its huge prediction performance. The previous calculations made are those that form the result within the structure of the RNN [17]. Having an input sequence vector x, the hidden nodes of a layer s, and the output of a hidden layer y, can be estimated as explained in Equations (1) and (2).

s_{t} = σ (W_{xs} x_{t} + W_{ss} s_{t - 1} + b_{s})

(1)

y_{t} = o (W_{so} s_{t} + b_{y})

(2)

where W_xs, W_ss, and W_so define the weights from the input layer x to the hidden layer s, by the biases of the hidden layer and output layer. Equation (3) points out σ and o as the activation functions.

S T F T {z (t)} (τ, ω) \equiv T (τ, ω) = \int_{- \infty}^{+ \infty} z (t) ω (t - τ) e^{- j ω t} d t

(3)

where z(t) is the vibration signals, and ω(t) is the Gaussian window function focused around 0. T(τ, ω) is the function that expresses the vibration signals. To calculate the hidden layers with the convolutional operation, Equations (4) and (5) are applied.

S_{t} = σ (W_{TS} {* T}_{t} + W_{ss} {* S}_{t - 1} + B_{s})

(4)

Y_{t} = o (W_{YS} {* S}_{t} + B_{y})

(5)

where W indicates the convolution kernels.

Recurrent convolutional neural network (RCNN) can be heaped to establish a deep architecture, called the deep recurrent convolutional neural network (DRCNN) [18,19]. To use the DRCNN method in the predictive task, Equation (6) determines how the last phase of the model serves as a supervised learning layer.

\hat{r} = σ (W_{h} * h + b_{h})

(6)

where W_h is the weight and b_h is the bias. The model calculates the residuals caused by the difference between the predicted and the actual observations in the training stage [20]. Stochastic gradient descent is applied for optimization to learn the parameters. Considering that the data at time t is r, the loss function is determined as shown in Equation (7).

L (r, \hat{r}) = \frac{1}{2} ‖ r - \hat{r} ‖_{2}^{2}

(7)

2.2. Deep Neural Decision Trees (DNDT)

Deep neural decision trees are decision tree (DT) models performed by deep learning neural networks, where a weight division corresponding to the DNDT belongs to a specific decision tree and, therefore, it is possible to interpret its information [21]. Stochastic gradient descent (SGD) is used to optimize the parameters at the same time; this partitions the learning processing in mini-batches and can be attached to a larger standard neural network (NN) model for end-to-end learning with backward propagation. In addition, standard DTs gain experience through a greedy and recursive factor division. This can make a selection of functions more efficient [22]. The method starts by performing a soft binning function to compute the residual rate for each node, making it possible to make decisions divided into DNDTs [23]. The input of a binning function is a real scalar x which makes an index of the containers to which x belongs.

The activation function of the DNDT algorithm is carried out based on the NN represented in Equation (8).

π = fw,b,τ (x) = softmax((wx + b)/τ)

(8)

where w is a constant with value w = [1, 2, ..., n + 1], τ > 0 is a temperature factor, and b is defined in Equation (9).

b = [0, −β1, −β1, −β2, ..., −β1 − β2 − · · · − βn]

(9)

The coding of the binning function x is given by the NN according the expression of Equation (9) [24]. The key idea is to build the DT with the applied Kronecker product from the binning function defined above. Connecting every feature x_d with its NN f_d (x_d), we can determine all the final nodes of the DT as appears in Equation (10).

z = f1(x1) ⊗ f2(x2) ⊗ · · · ⊗fD(xD)

(10)

where z expresses the leaf node index obtained by instance x in vector form. The complexity parameter of the model is determined by the number of cut points of each node. There may be inactive points since the values of the cut points are usually not limited.

2.3. Deep Learning Linear Support Vector Machines (DSVR)

Support vector machines (SVMs) were created for binary classification. Training data are denoted by its labels

(x_{n}, y_{n}), n = 1, \dots, N, x_{n} \in ℝ^{D}, t_{n} \in {- 1, + 1}

; SVMs are optimized according to Equation (11).

\begin{matrix} \min_{w ξ_{n}} \frac{1}{2} W^{T} W + C \sum_{n = 1}^{N} ξ_{n} \\ s . t . W^{T} x_{n} t_{n} \geq 1 - ξ_{n} > \forall n \\ ξ_{n} \geq 0 \forall n \end{matrix}

(11)

where

ξ_{n}

are features that punish observations that do not meet the margin requirements [25]. The optimization problem is defined as appears in Equation (12).

\min_{w} \frac{1}{2} W^{T} W + C \sum_{n = 1}^{N} m a x (1 - W^{T} x_{n} t_{n}, 0)

(12)

Usually the Softmax or 1-of-K encoding method is applied in the classification task of deep learning algorithms. In the case of working with 10 classes, the Softmax layer is composed of 10 nodes and expressed by p_i, where i = 1, ..., 10; p_i specifies a discrete probability distribution,

\sum_{i}^{10} p_{i} = 1

.

Equation (13) is defined by h as the activation of the penultimate layer nodes, W as the weight linked by the penultimate layer to the Softmax layer, and the total input into a Softmax layer. The next expression is the result.

a_{i} = \sum_{k} h_{k} W_{k i}

(13)

p_{i} = \frac{e x p (a_{i})}{\sum_{j}^{10} e x p (a_{j})}

(14)

The predicted class î would be as follows in Equation (15).

î = a r g \max_{i} p_{i} = a r g \max_{i} a_{i}

(15)

Since linear-SVM is not differentiable, a popular variation is known as the DSVR, which minimizes the squared hinge loss as indicated in Equation (16).

\min_{w} \frac{1}{2} W^{T} W + C \sum_{n = 1}^{N} m a x {(1 - W^{T} x_{n} t_{n}, 0)}^{2}

(16)

The target of the DSVR is to train deep neural networks for prediction [24,25]. Equation (17) expresses the differentiation of the activation concerning the penultimate layer, where l (w) is said differentiation, changing the input x for the activation h.

\frac{\partial l (w)}{\partial h_{n}} = - C t_{n} w (1 {1 > w^{T} h_{t} t_{n}})

(17)

where

𝟙 {\cdot}

is the indicator function. Likewise, for the DSVR, we have Equation (18).

\frac{\partial l (w)}{\partial h_{n}} = - 2 C t_{n} w (m a x (1 - W^{T} h_{n} t_{n}, 0))

(18)

2.4. Sensitivity Analysis

Data mining methods have the virtue of offering a great amount of explanation to the authors’ studied problem. To know what the degree is, sensitivity analysis is performed. This analysis tries to quantify the relative importance of the independent variables concerning the dependent variable [26,27]. To do this, the search for the reduction of the set of initial variables continues, leaving only the most significant ones. The variance limit follows, where one variable is significant if its variance increases concerning the rest of the variables as a whole. The Sobol method [16] is applied to decompose the variance of the total output V (Y) offered by the set of equations expressed in Equation (19).

V (Y) = \sum_{i} V_{i} + \sum_{i} \sum_{j > 1} V_{i j} + \dots + V_{1, 2, \dots k}

(19)

where

V_{i} = V E (Y | X_{i})

and

V_{i j} = V E (Y | X_{i}, X_{j})) - V_{i} - V_{j}

.

S_{i} = V i / V

and

S_{i j} = V_{i j} / V

define the sensitivity indexes, with S_ij being the effect of interaction between two variables. The Sobol decomposition allows the estimation of a total sensitivity index, STi, which measures the sum of all the sensitivity effects involved in the independent variables.

3. Data and Variables

The sample period selected is from 2011 to 2019, with a quarterly frequency of data. To obtain the information of the independent variables, data from the IMF’s International Financial Statistics (IFS), the World Bank, FRED Sant Louis, Google Trends, Quandl, and Blockchain.info were used.

The dependent variable used in this study is the Bitcoin price and is defined as the value of Bitcoin in USD. In addition, we used 29 independent variables, classified into demand and supply variables, attractiveness, and macroeconomic and financial variables, as possible predictors of the Bitcoin future price (Table 1). These variables were used throughout the previous literature [1,3,4,14].

The sample is fragmented into three mutually exclusive parts, one for training (70% of the data), one for validation (10% of the data), and the third group for testing (20% of the data). The training data are used to build the intended models, while the validation data attempt to assess whether there is overtraining of those models. As for the test data, they serve to evaluate the built model and measure the predictive capacity. The percentage of correctly classified cases is the precision results and RMSE measures the level of errors made. Furthermore, for the distribution of the sample data in these three phases, cross-validation 10 times with 500 iterations was used [28,29].

4. Results

4.1. Descriptive Statistics

Table 2 shows a statistical summary of the independent variables for predicting Bitcoin price. It is observed that all the variables obtain a standard deviation not higher than each value of the mean. Therefore, the data show initial stability. On the other hand, there is a greater difference between the minimum and maximum values. Variables like mining commissions and cost per transaction show a small minimum value compared to their mean value. The same fact happens with the hash variable. Despite these extremes, they do not affect the values of the standard deviations of the respective variables.

4.2. Empirical Results

Table 3 and Figure 1, Figure 2 and Figure 3 show the level of accuracy, the root mean square error (RMSE), and the mean absolute percentage error (MAPE). In all models, the level of accuracy always exceeds 92.61% for testing data. For its part, the RMSE and MAPE levels are adequate. The model with the highest accuracy is that of deep recurrent convolution neural network (DRCNN) with 97.34%, followed by the model of deep neural decision trees (DNDT) method with 96.94% on average by regions. Taken together, these results provide a level of accuracy far superior to that of previous studies. Thus, in the work of Ji and co-workers [15], an accuracy of around 60% is revealed; in the case of McNally and co-workers [8], it is close to 52%; and in the study of Rizwan, Narejo, and Javed [11], it approaches 52%. Finally, Table 4 shows the most significative variables by methods after applying the Sobol method for the sensitivity analysis.

Table 4 shows additional information on the significant variables. Block size, cost per transaction, and difficulty were significant in the three models for each method applied. This demonstrates the importance of the cost to carry out the Bitcoin transaction, of the block of Bitcoins to buy, as well as the difficulty of the miners to find new Bitcoins, as the main factors in the task of determining the price of Bitcoin. This contrasts with the results shown in previous studies, where these variables are not significant or are not used by the initial set of variables [5,7,8]. The best results were obtained by the DRCNN method, where in addition to the aforementioned variables, the transaction value, transaction volume, block size, dollar exchange rate, Dow Jones, and gold were also significant. This shows that the demand and supply variables of the Bitcoin market are essential to predict its price, something that has been shown by some previous works [1,30]. Yet significant macroeconomic and financial variables have not been observed as important factors by other recent works [30,31], since they were shown as variables that did not influence Bitcoin price fluctuations. In our results, the macroeconomic variables of Dow Jones and gold have been significant in all methods.

On the other hand, the models built by the DNDT and DSVR methods show high levels of precision, although lower than those obtained by the DRCNN. Furthermore, these methods show some different significant variables. Such is the case of the variables of forum posts, a variable popularly used as a proxy for the level of future demand that Bitcoin could have, although with divergences in previous works regarding its significance to predict the price of Bitcoin, where some works show that this variable is not significant [11,14]. Finally, these methods show another macroeconomic variable that is more significant, in the case of the dollar exchange rate. This represents the importance that changes in the price of the USD with Bitcoin can be decisive in estimating the possible demand and, therefore, a change in price. This variable, like the rest of the macroeconomic variables, has not been shown as a significant variable [5,31].

This set of variables observed as significant represents a group of novel factors that determine the price of Bitcoin and therefore, is different from that shown in the previous literature.

4.3. Post-Estimations

In this section, we try to perform estimations of models to generate forecasts in a future horizon. For this, we used the framework of multiple-step ahead prediction, applying the iterative strategy and models built to predict one step forward are trained [32]. At time t, a prediction is made for moment t + 1, and this prediction is used to predict for moment t + 2 and so on. This means that the predicted data for t + 1 are considered real data and are added to the end of the available data [33]. Table 5 and Figure 4, Figure 5 and Figure 6 show the accuracy and error results for t + 1 and t + 2 forecasting horizons. For t + 1, the range of precision for the three methods is 88.34–94.19% on average, where the percentage of accuracy is higher in the DRCNN (94.19%). For t + 2, this range of precision is 85.76–91.37%, where the percentage of accuracy is once again higher in the DRCNN (91.37%). These results show the high precision and great robustness of the models.

5. Conclusions

This study developed a comparison of methodologies to predict Bitcoin price and, therefore, a new model was created to forecast this price. The period selected was from 2011 to 2019. We applied different deep learning methods in the construction of the Bitcoin price prediction model to achieve a robust model, such as deep recurrent convolutional neural network, deep neural decision trees and deep support vector machines. The DRCNN model obtained the highest levels of precision. We propose to increase the level of performance of the models to predict the price of Bitcoin compared to previous literature. This research has shown significantly higher precision results than those shown in previous works, achieving a precision hit range of 92.61–95.27%. Likewise, it was possible to identify a new set of significant variables for the prediction of the price of Bitcoin, offering great stability in the models developed predicting in the future horizons of one and two years.

This research allows us to increase the results and conclusions on the price of Bitcoin concerning previous works, both in matters of precision and error, but also on significant variables. A set of significant variables for each methodology applied has been selected analyzing our results, but some of these variables are recurrent in the three methods. This supposes an important addition to the field of cryptocurrency pricing. The conclusions are relevant to central bankers, investors, asset managers, private forecasters, and business professionals for the cryptocurrencies market, who are generally interested in knowing which indicators provide reliable, accurate, and potential forecasts of price changes. Our study suggests new and significant explanatory variables to allow these agents to predict the Bitcoin price phenomenon. These results have provided a new Bitcoin price forecasting model developed using three methods, with the DCRNN model as the most accurate, thus contributing to existing knowledge in the field of machine learning, and especially, deep learning. This new model can be used as a reference for setting asset pricing and improved investment decision-making.

In summary, this study provides a significant opportunity to contribute to the field of finance, since the results obtained have significant implications for the future decisions of asset managers, making it possible to avoid big change events of the price and the potential associated costs. It also helps these agents send warning signals to financial markets and avoid massive losses derived from an increase of volatility in the price.

Opportunities for further research in this field include developing predictive models considering volatility correlation of the other new alternative assets and also safe-haven assets such as gold or stable currencies, that evaluate the different scenarios of portfolio choice and optimization.

Author Contributions

Conceptualization, P.L.-F., D.A., P.L.-L. and M.A.F.-G.; Data curation, D.A. and M.A.F.-G.; Formal analysis, P.L.-F., D.A. and P.L.-L.; Funding acquisition, P.L.-F., P.L.-L. and M.A.F.-G.; Investigation, D.A. and M.A.F.-G.; Methodology, D.A.; Project administration, P.L.-F. and M.A.F.-G.; Resources, P.L.-F. and M.A.F.-G.; Software, D.A.; Supervision, D.A.; Validation, D.A. and P.L.-L.; Visualization, P.L.-F. and D.A.; Writing—original draft, P.L.-F. and D.A.; Writing—review & editing, P.L.-F., D.A., P.L.-L. and M.A.F.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Cátedra de Economía y Finanzas Sostenibles, University of Malaga, Spain.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kristoufek, L. What Are the Main Drivers of the Bitcoin Price? Evidence from Wavelet Coherence Analysis. PLoS ONE 2015, 10, e0123923. [Google Scholar] [CrossRef]
Wamba, S.F.; Kamdjoug, J.R.K.; Bawack, R.E.; Keogh, J.G. Bitcoin, Blockchain and Fintech: A systematic review and case studies in the supply chain. Prod. Plan. Control Manag. Oper. 2019, 31, 115–142. [Google Scholar] [CrossRef]
Chen, W.; Zheng, Z.; Ma, M.; Wu, J.; Zhou, Y.; Yao, J. Dependence structure between bitcoin price and its influence factors. Int. J. Comput. Sci. Eng. 2020, 21, 334–345. [Google Scholar] [CrossRef]
Balcilar, M.; Bouri, E.; Gupta, R.; Roubaud, D. Can volume predict bitcoin returns and volatility? A quantiles-based approach. Econ. Model. 2017, 64, 74–81. [Google Scholar] [CrossRef] [Green Version]
Ciaian, P.; Rajcaniova, M.; Artis Kancs, D. The economics of BitCoin price formation. Appl. Econ. 2016, 48, 1799–1815. [Google Scholar] [CrossRef] [Green Version]
Schmidt, R.; Möhring, M.; Glück, D.; Haerting, R.; Keller, B.; Reichstein, C. Benefits from Using Bitcoin: Empirical Evidence from a European Country. Int. J. Serv. Sci. Manag. Eng. Technol. 2016, 7, 48–62. [Google Scholar] [CrossRef]
Velankar, S.; Valecha, S.; Maji, S. Bitcoin Price Prediction using Machine Learning. In Proceedings of the 20th International Conference on Advanced Communications Technology (ICACT), Chuncheon-si, Korea, 11–14 February 2018. [Google Scholar]
McNally, S.; Roche, J.; Caton, S. Predicting the Price of Bitcoin Using Machine Learning. In Proceedings of the 26th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Cambridge, UK, 21–23 March 2018. [Google Scholar]
Yogeshwaran, S.; Kaur, M.J.; Maheshwari, P. Project Based Learning: Predicting Bitcoin Prices using Deep Learning. In Proceedings of the 2019 IEEE Global Engineering Education Conference (EDUCON), Dubai, UAE, 9–11 April 2019. [Google Scholar]
Demir, A.; Akılotu, B.N.; Kadiroğlu, Z.; Şengür, A. Bitcoin Price Prediction Using Machine Learning Methods. In Proceedings of the 2019 1st International Informatics and Software Engineering Conference (UBMYK), Ankara, Turkey, 6–7 November 2019. [Google Scholar]
Rizwan, M.; Narejo, S.; Javed, M. Bitcoin price prediction using Deep Learning Algorithm. In Proceedings of the 13th International Conference on Mathematics, Actuarial Science, Computer Science and Statistics (MACS), Karachi, Pakistan, 14–15 December 2019. [Google Scholar]
Linardatos, P.; Kotsiantis, S. Bitcoin Price Prediction Combining Data and Text Mining. In Advances in Integrations of Intelligent Methods. Smart Innovation, Systems and Technologies; Hatzilygeroudis, I., Perikos, I., Grivokostopoulou, F., Eds.; Springer: Singapore, 2020. [Google Scholar]
Felizardo, L.; Oliveira, R.; Del-Moral-Hernández, E.; Cozman, F. Comparative study of Bitcoin price prediction using WaveNets, Recurrent Neural Networks and other Machine Learning Methods. In Proceedings of the 6th International Conference on Behavioral, Economic and Socio-Cultural Computing (BESC), Beijing, China, 28–30 October 2019. [Google Scholar]
Dutta, A.; Kumar, S.; Basu, M. A Gated Recurrent Unit Approach to Bitcoin Price Prediction. J. Risk Financ. Manag. 2020, 13, 23. [Google Scholar] [CrossRef] [Green Version]
Ji, S.; Kim, J.; Im, H. A Comparative Study of Bitcoin Price Prediction Using Deep Learning. Mathematics 2019, 7, 898. [Google Scholar] [CrossRef] [Green Version]
Saltelli, A. Making best use of model evaluations to compute sensitivity indices. Comput. Phys. Commun. 2002, 145, 280–297. [Google Scholar] [CrossRef]
Wang, S.; Chen, X.; Tong, C.; Zhao, Z. Matching Synchrosqueezing Wavelet Transform and Application to Aeroengine Vibration Monitoring. IEEE Trans. Instrum. Meas. 2017, 66, 360–372. [Google Scholar] [CrossRef]
Huang, C.-W.; Narayanan, S.S. Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognition. In Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, Hong Kong, China, 10–14 July 2017; pp. 583–588. [Google Scholar]
Ran, X.; Xue, L.; Zhang, Y.; Liu, Z.; Sang, X.; Xe, J. Rock Classification from Field Image Patches Analyzed Using a Deep Convolutional Neural Network. Mathematics 2019, 7, 755. [Google Scholar] [CrossRef] [Green Version]
Ma, M.; Mao, Z. Deep Recurrent Convolutional Neural Network for Remaining Useful Life Prediction. In Proceedings of the 2019 IEEE International Conference on Prognostics and Health Management (ICPHM), San Francisco, CA, USA, 17–20 June 2019; pp. 1–4. [Google Scholar]
Yang, Y.; Garcia-Morillo, I.; Hospedales, T.M. Deep Neural Decision Trees. In Proceedings of the 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Sweden, 14 July 2018. [Google Scholar]
Norouzi, M.; Collins, M.D.; Johnson, M.; Fleet, D.J.; Kohli, P. Efficient non-greedy optimization of decision trees. In Proceedings of the 28th International Conference on Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2015; pp. 1729–1737. [Google Scholar]
Dougherty, J.; Kohavi, R.; Sahami, M. Supervised and unsupervised discretization of continuous features. In Proceedings of the 12th International Conference on Machine Learning (ICML), Tahoe City, CA, USA, 9–12 July 1995. [Google Scholar]
Jang, E.; Gu, S.; Poole, B. Categorical reparameterization with Gumbel-Softmax. arXiv 2017, arXiv:1611.01144. [Google Scholar]
Tang, Y. Categorical reparameterization with Gumbel-Softmax. arXiv 2013, arXiv:1306.0239. [Google Scholar]
Delen, D.; Kuzey, C.; Uyar, A. Measuring firm performance using financial ratios: A decision tree approach. Expert Syst. Appl. 2013, 40, 3970–3983. [Google Scholar] [CrossRef]
Efimov, D.; Sulieman, H. Sobol Sensitivity: A Strategy for Feature Selection. In Mathematics Across Contemporary Sciences. AUS-ICMS 2015; Springer Proceedings in Mathematics & Statistics: Cham, Switzerland, 2017; Volume 190. [Google Scholar]
Alaminos, D.; Fernández, S.M.; García, F.; Fernández, M.A. Data Mining for Municipal Financial Distress Prediction, Advances in Data Mining, Applications and Theoretical Aspects. Lect. Notes Comput. Sci. 2018, 10933, 296–308. [Google Scholar]
Zhang, G.P.; Qi, M. Neural network forecasting for seasonal and trend time series. Eur. J. Oper. Res. 2005, 160, 501–514. [Google Scholar] [CrossRef]
Polasik, M.; Piotrowska, A.I.; Wisniewski, T.P.; Kotkowski, R.; Lightfoot, G. Price fluctuations and the use of Bitcoin: An empirical inquiry. Int. J. Electron. Commer. 2015, 20, 9–49. [Google Scholar] [CrossRef] [Green Version]
Al-Khazali, O.; Bouri, E.; Roubaud, D. The impact of positive and negative macroeconomic news surprises: Gold versus Bitcoin. Econ. Bull. 2018, 38, 373–382. [Google Scholar]
Koprinska, I.; Rana, M.; Rahman, A. Dynamic ensemble using previous and predicted future performance for Multi-step-ahead solar power forecasting. In Proceedings of the ICANN 2019: Artificial Neural Networks and Machine Learning, Munich, Germany, 17–19 September 2019; pp. 436–449. [Google Scholar]
Makridakis, S.; Spiliotis, E.; Assimakopoulos, V. Statistical and Machine Learning forecasting methods: Concerns and ways forward. PLoS ONE 2018, 13, e0194889. [Google Scholar] [CrossRef] [PubMed] [Green Version]

Figure 1. Results of accuracy evaluation: classification (%).

Figure 2. Results of accuracy evaluation: RMSE.

Figure 3. Results of accuracy evaluation: MAPE.

Figure 4. Multiple-step ahead forecasts in forecast horizon: accuracy.

Figure 5. Multiple-step ahead forecasts in forecast horizon: RMSE.

Figure 6. Multiple-step ahead forecasts in forecast horizon: MAPE.

Table 1. Independent variables.

Variables	Description
(a) Demand and Supply
Transaction value	Value of daily transactions
Number of Bitcoins	Number of mined Bitcoins currently circulating on the network
Bitcoins addresses	Number of unique Bitcoin addresses used per day
Transaction volume	Number of transactions per day
Unspent transactions	Number of valid unspent transactions
Blockchain transactions	Number of transactions on blockchain
Blockchain addresses	Number of unique addresses used in blockchain
Block size	Average block size expressed in megabytes
Miners reward	Block rewards paid to miners
Mining commissions	Average transaction fees (in USD)
Cost per transaction	Miners’ income divided by the number of transactions
Difficulty	Difficulty mining a new blockchain block
Hash	Times a hash function can be calculated per second
Halving	Process of reducing the emission rate of new units
(b) Attractive
Forum posts	Number of new members in online Bitcoin forums
Forum members	New posts in online Bitcoin forums
(c) Macroeconomic and Financial
Texas oil	Oil Price (West Texas)
Brent oil	Oil Price (Brent, London)
Dollar exchange rate	Exchange rate between the US dollar and the euro
Dow Jones	Dow Jones Index of the New York Stock Exchange
Gold	Gold price in US dollars per troy ounce

Table 2. Summary statistics.

Variables	Obs	Mean	SD	Min	Max
Transaction value	112	342,460,106,866,711.0000	143,084,554,727,531.0000	59,238,547,391,199.6000	735,905,260,141,564.0000
Number of bitcoins	112	13,634,297.4824	3,709,010.0736	5,235,454.5455	18,311,982.5000
Bitcoins addresses	112	285,034.2515	219,406.3874	1576.8333	849,668.1000
Transaction volume	112	154,548.8041	117,104.3686	1105.5000	373,845.6000
Unspent transactions	112	28,581,914.9054	22,987,595.3012	78,469.7273	66,688,779.9000
Blockchain transactions	112	156,444,312.9120	161,252,448.1997	237,174.8889	520,792,976.5000
Blockchain addresses	112	4,812,692.05	13,735,245.35	−14,437,299.03	117,863,226.2
Block size	112	0.4956	0.3638	0.0022	0.9875
Miners reward	112	420,160,582,581,028.0000	174,396,895,338,462.0000	101,244,436,734,897.0000	796,533,076,376,536.0000
Mining commissions	112	9,581,973,325,205.4400	42,699,799,790,392.8000	0.2591	315,387,506,596,395.0000
Cost per transaction	112	155,354,364,458,705.0000	156,696,788,525,225.0000	0.1179	757,049,771,708,905.0000
Difficulty	112	187,513,499,336,866.0000	195,421,886,528,251.0000	212,295,141,771.2000	836,728,509,520,663.0000
Hash	112	110,434,372.2765	154,717,725.3881	0.5705	516,395,703.4338
Halving	112	279,853,454,485,387.0000	162,806,469,642,875.0000	6,473,142,955,255.1700	804,437,327,302,638.0000
Forum posts	112	9279.8844	8585.0583	455.0000	53132.0000
Forum members	112	2432.2545	3394.4635	30.6364	14,833.3409
Texas Oil	112	72.4878	23.7311	21.1230	135.6700
Brent Oil	112	78.4964	26.5819	19.1900	139.3800
Dollar exchange rate	112	1.3767	0.9604	1.0494	8.7912
Dow Jones	112	15,926.7161	3324.8875	11,602.5212	22,044.8627
Gold	112	1329.400847	244.4099259	739.15	1846.75

Table 3. Results of accuracy evaluation: classification (%).

Sample	DRCNN			DNDT			DSVR
Sample	Acc. (%)	RMSE	MAPE	Acc. (%)	RMSE	MAPE	Acc. (%)	RMSE	MAPE
Training	97.34	0.66	0.29	95.86	0.70	0.33	94.49	0.75	0.38
Validation	96.18	0.71	0.34	95.07	0.74	0.37	93.18	0.81	0.43
Testing	95.27	0.77	0.40	94.42	0.79	0.42	92.61	0.84	0.47

DRCNN: deep recurrent convolution neural network; DNDT: deep neural decision trees; DSVR: deep learning linear support vector machines; Acc: accuracy; RMSE: root mean square error; MAPE: mean absolute percentage error.

Table 4. Results of accuracy evaluation: greater sensitivity variables.

DRCNN	DNDT	DSVR
Transaction value	Transaction volume	Transaction value
Transaction volume	Block size	Block size
Block size	Blockchain transactions	Blockchain transactions
Cost per transaction	Cost per transaction	Cost per transaction
Difficulty	Difficulty	Difficulty
Dollar exchange rate	Forum posts	Forum posts
Dow Jones	Dow Jones	Dollar exchange rate
Gold	Gold	Dow Jones
		Gold

Table 5. Multiple-step ahead forecasts in forecast horizon = t + 1 and t + 2.

Horizon	DRCNN			DNDT			DSVR
Horizon	Acc. (%)	RMSE	MAPE	Acc. (%)	RMSE	MAPE	Acc. (%)	RMSE	MAPE
t + 1	94.19	0.81	0.52	92.35	0.87	0.59	88.34	0.97	0.65
t + 2	91.37	0.92	0.63	89.41	1.03	0.67	85.76	1.10	0.78

Acc: accuracy.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lamothe-Fernández, P.; Alaminos, D.; Lamothe-López, P.; Fernández-Gámez, M.A. Deep Learning Methods for Modeling Bitcoin Price. Mathematics 2020, 8, 1245. https://0-doi-org.brum.beds.ac.uk/10.3390/math8081245

AMA Style

Lamothe-Fernández P, Alaminos D, Lamothe-López P, Fernández-Gámez MA. Deep Learning Methods for Modeling Bitcoin Price. Mathematics. 2020; 8(8):1245. https://0-doi-org.brum.beds.ac.uk/10.3390/math8081245

Chicago/Turabian Style

Lamothe-Fernández, Prosper, David Alaminos, Prosper Lamothe-López, and Manuel A. Fernández-Gámez. 2020. "Deep Learning Methods for Modeling Bitcoin Price" Mathematics 8, no. 8: 1245. https://0-doi-org.brum.beds.ac.uk/10.3390/math8081245

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Deep Learning Methods for Modeling Bitcoin Price

Abstract

1. Introduction

2. Deep Learning Methods

2.1. Deep Recurrent Convolution Neural Network (DRCNN)

2.2. Deep Neural Decision Trees (DNDT)

2.3. Deep Learning Linear Support Vector Machines (DSVR)

2.4. Sensitivity Analysis

3. Data and Variables

4. Results

4.1. Descriptive Statistics

4.2. Empirical Results

4.3. Post-Estimations

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI