Next Article in Journal
Mediating Effects of Emotional Support Reception and Provision on the Relationship between Group Interaction and Psychological Well-Being: A Study of Young Patients
Next Article in Special Issue
Cardiovascular Health Peaks and Meteorological Conditions: A Quantile Regression Approach
Previous Article in Journal
Feasibility of Delivering an on-Campus Food Distribution Program in a Community College Setting: A Mixed Methods Sequential Explanatory Investigation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessing the Effect of Climate Variables on the Incidence of Dengue Cases in the Metropolitan Region of Panama City

by
Vicente Navarro Valencia
1,
Yamilka Díaz
2,
Juan Miguel Pascale
3,4,
Maciej F. Boni
5 and
Javier E. Sanchez-Galan
1,4,6,*
1
Facultad de Ciencias y Tecnología, Universidad Tecnológica de Panamá (UTP), El Dorado 0819-07289, Panama
2
Department of Research in Virology and Biotechnology, Gorgas Memorial Institute of Health Studies, Justo Arosemena Avenue and 35st Street, Panama 0816-02593, Panama
3
Unit of Diagnosis, Clinical Research and Tropical Medicine, Gorgas Memorial Institute of Health Studies, Justo Arosemena Avenue and 35st Street, Panama 0816-02593, Panama
4
Sistema Nacional de Investigación (SNI) SENACYT, Panama 0816-02852, Panama
5
Center for Infectious Disease Dynamics, Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
6
Grupo de Investigaciones en Biotecnología, Bioinformática y Biología de Sistemas (GIBBS), Facultad de Ingenieria de Sistemas Computacionales, Universidad Tecnológica de Panamá (UTP), El Dorado 0819-07289, Panama
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2021, 18(22), 12108; https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph182212108
Submission received: 30 September 2021 / Revised: 8 November 2021 / Accepted: 9 November 2021 / Published: 18 November 2021
(This article belongs to the Special Issue Statistical Methods in Environmental Epidemiology)

Abstract

:
The present analysis uses the data of confirmed incidence of dengue cases in the metropolitan region of Panama from 1999 to 2017 and climatic variables (air temperature, precipitation, and relative humidity) during the same period to determine if there exists a correlation between these variables. In addition, we compare the predictive performance of two regression models (SARIMA, SARIMAX) and a recurrent neural network model (RNN-LSTM) on the dengue incidence series. For this data from 1999–2014 was used for training and the three subsequent years of incidence 2015–2017 were used for prediction. The results show a correlation coefficient between the climatic variables and the incidence of dengue were low but statistical significant. The RMSE and MAPE obtained for the SARIMAX and RNN-LSTM models were 25.76, 108.44 and 26.16, 59.68, which suggest that any of these models can be used to predict new outbreaks. Although, it can be said that there is a limited role of climatic variables in the outputs the models. The value of this work is that it helps understand the behaviour of cases in a tropical setting as is the Metropolitan Region of Panama City, and provides the basis needed for a much needed early alert system for the region.

1. Introduction

As the global population grows and becomes more connected, human pathogens that were previously localized may expand their range and require greater collective action for disease control. One of the more likely candidates for geographic expansion is dengue virus (DENV), an Aedes-transmitted flavivirus that infects hundreds of millions of people per year [1] and is predicted to expand its range as the climate warms [2]. DENV circulates in the global tropics as four distinct serotypes (called DENV1, DENV2, DENV3 and DENV4) that show seasonal epidemic patterns, endemic behavior, and infrequent outbreaks, depending on the regional climate and altitude. Dengue hospitalizations number in the single-digit millions per year globally [3,4] presenting both a health and an economic burden [4,5]. After reintroduction of DENV to the Americas in 1980, the first endemic case of dengue was recorded in Panamanian territory in 1993 [6]. Since then, all four serotypes of DENV have been circulating in the canal lands for more than 25 years [7].
Proximity to breeding grounds of either Aedes aegypti and Aedes albopictus mosquitoes enhances the risk of transmission of the virus from one host to another [8]. It is known that typically an increase number of mosquitoes is reflected in an relative increase of the dengue cases and in Panama as in most tropical places, this increase occurs in the rainy season [9]. Moreover, dengue cases in Panama have been studied as spacial-temporal clusters of larvae and incidence [10,11], have been related to the existence and prevalence of used tires [12], and even cases have been related to socio-economical status [13]. However, to the best of our knowledge, there is no study relating climatic conditions with the incidence of dengue cases in the Metropolitan region of Panama.

Modeling the Dengue Incidence as a Time Series

Correlation, modeling, and prediction methods have been applied extensively to understand epidemics such as dengue. Most studies focus on the human host, accounting for the confirmed cases and from there trying to understand its relationship with climatic variables [14]. Other focus on the vector proliferation in all its stages in order to understand the dynamic of the vector incidences and its relation to cases incidence [15].
A wealth of studies trying to model Dengue incidence use statistical techniques, borrowed from time series analysis, to be able to model and predict new events of the series. This studies often make use of univariate methods, as the Autoregressive Integrated Moving Average-ARIMA, which tries to explain the values of the event of the series based on its own lags, moving averages and autoregressive components [16,17].
Other studies make use of multivariate methods, that are able to relate the incidence series to other variables that might be influential to the series. In that case, seasonal effects are taking into consideration, for instance: natural climate variations that happen during the year and the change of seasons. Some of the methods under this category are: Seasonal Autoregressive Integrated Moving Average-SARIMA and Seasonal Autoregressive Integrated Moving Average with eXogenous factors-SARIMAX [18,19,20,21,22].
More recently, studies have started to use machine learning techniques (neural networks and deep neural networks) focused on time-series analysis for studying this type of relationships [23].
For instance, recurrent neural networks (RNN) with long-short-term memory (LSTM) type, have been applied to forecast the incidence in 20 Chinese cities from climatic variables and historical cases [24]. Furthermore, hybrid methods using both traditional statistical and LSTM have been studied, evaluating their ability to reproduce epidemic events seen for dengue, as well as predict future incidence [25]. Also, implementations of stacking models have been made, where stacking involves averaging predictions of multiple models using a weighted average, these models outperform the Bayesian moving average [26]. Finally, social networks have been used be a potential predictor in forecasting the number of reported cases [27].
The objectives of this study are as follows: (1) assess the relation between dengue incidence cases and its relation with rural or urban settings in the metropolitan region of Panama; (2) to assess the correlation between confirmed dengue cases and air temperature, precipitation and relative humidity in the metropolitan region of Panama; (3) Assess if time series regression based models (SARIMA and SARIMAX) or a recursive neural network based model (RNN-LSTM) can learn the incidence pattern and able to predict the future behaviour of the series.

2. Materials and Methods

2.1. Dengue Incidence Data

Dengue incidence data were compiled by the Gorgas Memorial Institute of Health Studies (ICGES) and the Ministry of Health (MINSA). The data collection program was founded by the MINSA in 1988, and the data used in the present analysis is a subset of that described in Diaz et al. [7]. The definition of a dengue case follows the guidelines of the World Health Organization (WHO). From 1993 to 2011, the 1997 classification of clinical cases of Dengue fever (DF), Dengue with hemorrhagic fever (DHF), and Dengue shock syndrome (DSS) was applied. From 2012 forward, the definition of dengue with symptoms or without symptoms and severe dengue provided by WHO 2009 was applied [28]. ICGES, located in Panama City, was the primary site for conducting DENV laboratory assays for inpatients and outpatients in provincial hospitals in Panama who had fever that was clinically suspected to be dengue.
The data were aggregated by epidemic week and include all laboratory-confirmed dengue cases from January 1999 to December 2017, as it is shown in Figure 1.
The Metropolitan region, comprises the regions with the highest population density of the country, with more than 1.5 millions inhabitants. It comprises the provinces of Panamá, Panamá Oeste and Colón. It has with an area of approximately 14,034 km 2 and an average elevation of 80 (meters above sea level-MASL) [29]. Figure 2 shows the distribution by township of the accumulated confirmed cases of Dengue.
Many authors mention that the direct factors in the transmission of Dengue are both the climatic variables and the population density [12,13,30]. In order to assess the second premise about population density the national definition of rural areas, basically speaking, areas formed by the group of townships with less than 1500 inhabitants [29]. The whole metropolitan areas and the number of cases per-capita (cases per thousand people) in rural vs. urban areas were compared.

2.2. Climate Data

The data set of climatic variables were taken from the Metropolitan Park meteorological station (8.9944, −79.5430), belonging to the network of The Smithsonian Tropical Research Institute (STRI) [31,32,33]. The station is located in the extreme northwest of Panama City, at the eastern end of the Metropolitan Natural Park. It is surrounded by an 80-years-old semi-deciduous lowland forest. Figure 3 shows the exact location of the station, in relation to Panama City and the rest of the Country.
The full data set covers climate variables from 1995 to present, but it was arranged to cover period as dengue cases incidence. This weather station was chosen because it had an elevation similar to the average elevation of the studied area of about 94 MASL.
Three time series of interest were selected from this station: air temperature, precipitation and relative humidity. Empty data points were imputed using a multivariate deep learning LSTM model and the nearby station of Barro Colorado, similar to the procedure described in [34].

2.3. Time Series Creation

The confirmed cases were indexed by epidemiological week as a time series. The same was done for the climatic variables. However, for the air temperature and relative humidity the weekly average was used. As for precipitation, the accumulated amount of rain during the week was used.
Having all four series at the same interval (Equation (1)), was the first step to a series of analyses that will be explained in the next subsections.
Y i = Y i , 1 , Y i , 2 , Y i , 3 , , Y i , t : t T ; | T | = 991 weeks
where i is the time series = 1 , dengue incidence 2 , air temperature 3 , relative humidity 4 , precipitation ( rain )

2.3.1. Correlation

Once all time series were created, the second objective was to define if there was a correlation among them. For this matter, both the Pearson correlation coefficients (Equation (2)) [35] and the Spearman’s rank correlation coefficient (Equation (3)) [36], were calculated.
ρ Y i , Y j = c o v ( Y i , Y j ) σ Y i σ Y j
ρ R ( Y i ) , R ( Y j ) = c o v ( R ( Y i ) , R ( Y j ) ) σ R ( Y i ) σ R ( Y j )
where c o v ( · ) is the represents the covariance calculation between two time series, σ · is the standard deviation of the series and R ( · ) are the series values converted to ranks.

2.3.2. Prediction

Resulting significant correlations were used to predict the Dengue incidence time series Y 1 , t . Three different types of models M capable of learning the parameters of this time series and predicting y ^ 1 , t = M x 1 , x 2 , , x T were used.
The first two models being of the Moving Average Models (MA) family and the last one being a recurrent neural network method:
  • SARIMA: these models use non-seasonal differences, auto-regressions and moving average data from previous samples, and seasonal differences, auto-regressions and moving average from previous periods, which allows them to accurately predict the next steps of a time series, with the assumption that the same behavior is maintained in the causes of these events.
    The mathematical formulation of SARIMA models can be generalized as described in (Equation (4)).
    Δ S D Δ d Y t = Θ Q ( B S ) θ q ( B ) ϵ t Φ P ( B S ) ϕ p ( B )
    In this equation p, d and q represent the non-seasonal order of Auto-Regression (AR), differentiation and Moving Average (MA, respectively. P, D, and Q represent the seasonal order of AR, differentiation, and MA respectively. Moreover, Y t represents the time series data in period t. ϵ t represents the Gaussian white noise process (random walk) in period t. B represent the backward shift operator ( B k x t = x t k ). Δ S D represents the seasonal difference. Δ d represents the non-seasonal difference. S represent the seasonal order ( S = 52 for weekly data analysed yearly).
  • SARIMAX: these models are related to SARIMA and share the same variables of the model, however they include exogenous or explanatory variables of the time series. Also, they can be formulated adding a vector or matrix that represents the exogenous variables and his respective weights that represents the influence or contribution of these variables to the regression, as it can be seen in (Equation (5)):
    Δ S D Δ d Y t = Θ Q ( B S ) θ q ( B ) ϵ t + β k x k , t Φ P ( B S ) ϕ p ( B )
    In SARIMAX models, x k , t represents the vector or matrix that includes the k-th exogenous variable. β k represents the coefficient or weight that accompanies the k-th exogenous variable, all the coefficients are adjusted to the data in order to obtain the value that best relates their influences to the time series. Models that use seasonal variables are also called models with long-term memory.
  • RNN-LSTM: Recurrent Neural Networks (RNN), as with any neural network, can be used to adjust a non-linear model, by learning long-term dependencies that could be present in a data set, thus can be used to describe and model a time series. Although in practice what is known as an explosion or disappearance of the gradient happens [37]. A solution to this technical problem, is using networks with long-short-term memory (LSTM) [38] that in general perform better than RNNs.
    The advantage of LSTM networks lies in the way the hidden state is computed, in the iteration t, the output state (see Equation (11)), is calculated using the result of four components as input, known as follows: input gate (Equation (6)), forget gate (Equation (7)), output gate (Equation (8)) and a cell state (Equations (9) and (10)).
    Gates:
    i g = σ R i a h t 1 + W i x x t + b i
    f g = σ R f a h t 1 + W f x x t + b f
    o g = σ R o a h t 1 + W o x x t + b o
    where σ is a nonlinear activation function, for example, the sigmoid function, R i x and W i a are the recurring and input weights respectively, b i is the activation threshold or bias.
    Cell state:
    g t = tanh R g a h t 1 + W g x x t + b g
    C t = f g C t 1 + i g g t
    Hidden state:
    h t = o g tanh C t
    ⊙ represents the Hadamard’s product.
Figure 4 provides a visual representation on how this gates and cells interact inside each unit (neuron) of the recurrent neural network.
The architecture of the LSTM network used for this task, had an input layer, an LSTM Layer, a dropout Layer, a second LSTM Layer, a dropout Layer, and a final regression layer at the output. The dropout layers in the training process have and input parameter of 0.45, a learning rate of 5 × 10 6 was used, mini-batch size of 204 values (4 years of data were used), the LSTM have the same amount of hidden units (H.U.).
Once the network was trained the learned weights approximate/predict the following weekly values by actualizing the inner state with all the previous data an then using 12 weeks of data to predict the next one successively.
The features used to predict the Dengue incidence in the step t were the incidence, air temperature, precipitation and relative humidity of the step t 1 . The in-sample prediction take the past true value of the Dengue Incidence to predict the future one, the out-sample prediction take the past prediction to predict the next one successively.

2.3.3. Error Metrics

All three models were compared under the Root Mean Square Error (RMSE) (Equation (12)) and Mean Absolute Percentage Error (MAPE) (Equation (13)) performance metrics [37].
RMSE = 1 n i = 1 n y ^ 1 , t Y 1 , t 2
MAPE = 100 n i = 1 n y ^ 1 , t Y 1 , t Y 1 , t
The RMSE metric is a way of measuring how much a model fits to a set of data, and it can be a regression or prediction model as in our case, and MAPE measures the cumulative percentage error of a data set generated by a model on another set of target data.

2.3.4. Data Pre-Processing and Software

The data set was normalized within the [0,1] interval. In total 16 years (1999–2014) of data were used to train the models, and performance metrics were compared by predicting 3 years. For the organization of the data a spreadsheet software (Apache OpenOffice Calc 4.1.11, Apache Software Foundation) was used. For all the visualizations, correlation analysis and regression models (SARIMA and SARIMAX), the R statistical software was used, in specific the forecast, stats, xts packages. The RNN-LSTM network was implemented using the Deep-Learning Toolbox of Matlab 2021a.

3. Results

3.1. Urban vs. Rural by Population Density in the Metropolitan Region

As the first objective of this work was to assess the relation between dengue incidence cases and its relation with population density. First the incidence was tabulated per township size, using all data collected on dengue cases, including laboratory confirmed and epidemiological link (suspicious) cases. Also, the cumulative number cases, were aggregated distinguishing between rural and urban areas metropolitan region of Panama. Figure 5 shows the numbers of cases in rural and urban areas per 1000 cases in the 2004–2017 (unregistered and/or missing values caused to drop the 1999–2003 period). Furthermore, a significant difference was confirmed via a T-test (p < 0.01) when comparing both areas per year, see Appendix A. Confirming the relationship between population density and cases for the metropolitan region of Panama.
The data presented in Figure 5 shows a relationship between population density and the number of cases per-capita. This information have been used by local authorities to direct efforts to control Dengue outbreaks. The main strategies include the elimination of hatcheries and the eradication of vectors. Although, the same does not happen in less populated areas, where the virus is circulating almost freely. This is important to the light of recent studies by Whiteman et al. [13] and Bennett et al. [12] which state that in the not so populated areas the vector Aedes Albopictus is more active and in the more densely populated areas it is the Aedes Aegypti the most active vector.

3.2. Preliminary Series Analysis

This subsection compiles the preliminary statistical analysis carried out on the Dengue incidence and the climatic variables. The data throughout the study were grouped by week in order to better understand the influence of climatic variables and how they change over the year. As it can be seen in Figure 6, the distributions of confirmed Dengue cases over the years allows us to see that approximately 7–8 weeks after the onset of rain season (highlighted with light blue square) the incidence of Dengue cases tends to increase.
The intervals comprised by weeks 1–3 and 33–42 registered the highest average number of cases (>48 cases per week), with a value greater than the third quartile of the averages of all the weeks. In weeks 10–22, the lowest average number of cases was recorded (<12 cases per week), corresponding to a value lower than the first quartile. The highest number of cases registered in one week was 392.
The results of the incidence of Dengue, are directly influenced by the actions taken to prevent infections, public health strategies have demonstrated a high effectiveness in the control of vectors and with it in the control of Dengue outbreaks. However, there is a relationship with the climatic variables that could shed light on the critical moments that could improve the effectiveness of these strategies, then the climatic variables will be analyzed statistically to seek to understand the key moments in the prevention of Dengue epidemics.
Figure 7 shows that the highest average air temperatures (>26.3 C) were recorded in the weeks interval of 7–21 corresponding to the values above the third quartile, from weeks 42–51 the lowest average temperatures (<25.6 C) were recorded. This fits in with the sunniest months of summer and the rainiest months of winter respectively.
Figure 8 shows that from week 11–22 the lowest values of average relative humidity (<82%) were recorded, weeks 1–2 and 34–52 present the highest average relative humidity (>90%) with values higher than the third quartile, this includes the rainiest months (mid-September-October-November-mid-December) and it can be seen that even after the rains are over, the environment remains humid until about 4 weeks later. Panama tends to be a region with a high relative humidity, even more so in the summer season it is possible to find itself dry enough to influence the decrease in the breeding grounds of mosquitoes.
As it can be seen in Figure 9, from week 4 to week 13 the lowest amount of accumulated precipitation was recorded (<11 mm/week), this correspond to the lowest part of the first quartile. Weeks 38–50 register the highest amount of accumulated precipitation (>52 mm/week) as part of the upper third quartile. The beginning of the rainy season marks a turning point where the average temperature begins to decrease and the average relative humidity to increase.

3.3. Correlation Analysis

Results confirm that there is a correlation between the climatic variables and the incidence of Dengue. Both correlation coefficients (Spearman’s and Pearson’s) provide a measure of how strong these relationships are in different moments or lags. The summary of the correlation coefficients using the Spearman method are shown in Figure 10. Where air temperature is refered as AT, relative humidity as RH, and precipitation as RA.
The monotonic correlation has higher values and which vary more abruptly, while the linear correlation is smaller and varies smoothly. The correlation analysis supports what has been found statistically about the relationship in an increase in the incidence of Dengue with the climatic variables, it gives us light on the time in which these variables are reflected in the incidence of Dengue, being approximately at the time of the highest correlation (8–9 weeks before). The most significant correlations (p < 0.001) are those with a coefficient greater than 0.11.
Figure 11 shows the correlation calculated by the Pearson coefficient and its lags.
In this case, Air Temperature shows its strongest relationship with Dengue incidence two weeks later, Precipitation has its highest correlation at a lag of 8 weeks, and Relative Humidity maintains it’s stronger correlation without lag. The most significant correlations (p < 0.001) are those with a correlation coefficient greater than 0.10.

3.4. Prediction

The three different models were assessed with the variables and lags extracted from the correlation analysis described in the previous section.
  • SARIMA: The predictions were made using 16 years of training data (835 weeks) to predict the future index in the next 3 years (156 weeks), seasonality of the series was achieved after a (1) difference, in Figure 12 the prediction can be seen, the minimum amount of data necessary to make the prediction was also tested where a minimum of 6 years of data were needed to maintain a similar error metric to the one shown.
  • SARIMAX: The SARIMAX model uses the data of the Dengue incidence, the Air temperature with a lag of 2, the precipitation with a lag of 8 and the relative humidity with a lag if 0. As it can be seen in Figure 13 the results are very similar to those obtained using SARIMA.
In both cases, regression models for SARIMA and SARIMAX make a prediction error, presenting a peak just at the beginning of the predicted series. This could be attributed to the presence of a heavy periodic temporal component and the existence of a similar peak just in the previous period (see 2014 peek in Figure 1).
3.
LSTM: In the prediction using RNN-LSTM, an initial configuration of parameters that best adjusted to the training set was first tested. Then, it was verified by making a prediction within the sample set (using the real Dengue incidence to predict the future incidence in each step).
Figure 14 shows the seamlessly fit (shown in dotted line pattern) of the RNN-LSTM model tested by making a prediction within the sample.
After the adjustment and verification of the RNN preliminary results, the prediction was carried out outside the sample set, using only the training data to successively generate each point of the future Dengue incidence, as it can be seen in Figure 15.
It is of importance to notice, that even when out-of-sample prediction were made it still is able to predict a general pattern of the incidence series, with predicted peaks close to every elevation of cases in the 2015–2017 period.

3.5. Model Evaluation

The SARIMAX prediction model had the lowest RMSE results, while the RNN-LSTM obtained the lowest MAPE, which suggest that any of these models can be used to predict new outbreaks with the LSTM having a slight advantage. In Table 1 a summary of the results of the performance metrics, is shown.
Table 2 shows the predicted cases for each year, with the values in parentheses representing the relative percentual error, calculated from ground truth. The models over-predicted cases in 2015 and 2016, and under-predicted cases for 2017. But in general, under-predicted the cases for the complete 2015–2017 period.

4. Discussion

Dengue virus was reintroduced in Panama in 1993, since then all four serotypes has circulating causing several outbreaks [7]. Panama is located in the tropical zone, where climatic factors are favorable to the proliferation of mosquitoes. Temperature have been described as a best predictor of the increase of Dengue cases. Pinto et al. [39], found that minimum and maximum temperature are strong weather predictors for the increase of dengue cases, whereas rainfall and relative humidity have little correlation with dengue cases, using a data from 2000 to 2007 from Singapore. Optimal temperatures for mosquitoes proliferation are 20 C to 35 C. Aedes aegypti mosquitoes will amplify and transmit dengue viruses only if exposed to temperatures withing the range of 20 C to 35 C [40].
In our study the distribution of Dengue incidences was accumulated in the intervals of 1–3 and 33–42 weeks, which correlate with a drop of temperature due to the raining season. Although, environmental temperature in Panama remains all the year between 24 C to 36 C, other factors as of relative humidity and the accumulated precipitation could influence in the increment of dengue incidence. Models to predict the dynamic of a vector disease based only environmental variables, has the challenge of the introduction and reintroduction of different dengue serotypes or genotype, during the study period, could also influence the increment of Dengue cases [7], as well as the introduction of other arboviruses like Zika in 2015 and Chikungunya in 2014, that are transmitted by same vector and cause a similar clinical disease [41,42]
The results of this study shows that climatic variables could be important factors to predict Dengue incidence behavior, especially those whom affected the Aedes infestation index. It was observed that incidence of Dengue cases increased around of 7–8 weeks after raining season begins, similar to what Vargas et al. [43], and Diaz et al. [44], reported previously.
The results of the correlations reflect the effect of a larger population of mosquitoes, it is due to a greater availability of places as breeding sites during the rainy season, which are also connected to urban regions with higher population density.
The correlations indicate the existence of a strong relationship between the climatic variables and the Dengue Incidence cases, with precipitation being an indicator that warns up to 8 weeks before the start of the Dengue case season, knowing the weeks where the highest average of cases are registered as well as an estimate of the total number of cases, the necessary supplies can be obtained for the different interventions that are carried out to preserve public health during Dengue epidemics, the predictive models will not tell us exactly when a case will occur and the exact quantity but they will provide lights when facing the unknown.
It is a public health priority to study diseases with the potential to affect an increasingly large population. In this spirit, this manuscript proposes traditional statistical time-series analysis (SARIMA and SARIMAX) and RNN-LSTM networks to study the effect of climate variables on the incidence of dengue cases in the Metropolitan region of Panama for the 1999–2017 period. In general both, SARIMAX and RNN-LSTM could be suggest as predictor of new outbreaks, with the LSTM having a slight advantage.
LSTM models have great potential due to their flexibility and the ability to learn from previous events of a series. In future work, these capabilities can be put to work to predict future outbreaks, special by novel Attention based RNN-LSTM models. They allow the network to automatically search for the values within the past values that are most influential for the prediction [45,46].
From the modeling results, it seems that although the initial correlations of Dengue incidence with climatic variables were statistical significant. These variables play a minor role in the effectiveness of the SARIMAX and RNN-LSTM models. This conclusions are not surpising as Dengue is a complex problem with many exogenous variables that are not controlled in models. For instance, some authors suggest that it can be attributed to the possibility of having almost constant temperatures throughout the year [47]. Also, there are other factors that can influence the dengue cases, such as: the distribution of objects unintentionally dispersed by man that can serve as a breeding ground once the water can recharge in them, collection of water in containers for human consumption due the limited access to drinking water, which is truly relevant in the case of the metropolitan area of Panama City [48]. A way to further understand this relation, would be to apply spatio-temporal analysis with climatic measurements for the area studied as presented by Nascimineto et al. [49], for brazilian dry climate regions. Also, a numerical analysis based on differential equations, in which compartments could be used to represent human-vector interactions, as well quantifying local vector erradication and control efforts [15,50,51], could help understand the transmission strategies beyond climatic variables.
Finally, Out-of-sample forecasting is more challenging as the changing nature of dengue in Panama over the past two decades does not guarantee that past incidence will be a an accurate predictor of future incidence. This underlines the crucial nature of long-term population and climate changes that will need to be incorporated into dengue forecasting models. In this regards a great deal of work has been advanced in the determination of climate at the end of the century for Panama, mostly focusing on precipitation. Many studies concur that precipitation will shift to earlier months of the year and that their intensity and duration (Consecutive Wet Days, CWD) will increase [52,53,54,55,56].
One of the limitations of this study, is that its scope was to only look at short-medium term predictions. Another limitation is the availability of the data, as some parts of the time series had to be imputed using known methodologies. Also, measurements from a single metoreological station is used to represent the climatic used in this study.
In recent years public health has become a priority, when facing the challenges that the future holds. For future work, it would be advantageous for Panama to use prediction techniques and the use of early warning strategies, would provide a clear path for mitigation of a future risk.

Author Contributions

Conceptualization, V.N.V., Y.D., M.F.B., J.M.P. and J.E.S.-G.; methodology, V.N.V. and J.E.S.-G.; software, V.N.V. and J.E.S.-G.; validation, V.N.V., Y.D., M.F.B., J.M.P. and J.E.S.-G.; resources, M.F.B., Y.D., J.M.P. and J.E.S.-G.; writing—original draft preparation, V.N.V. and J.E.S.-G.; writing—review and editing, V.N.V., Y.D., M.F.B., J.M.P. and J.E.S.-G.; visualization, V.N.V. and J.E.S.-G.; supervision of students, J.E.S.-G. All authors have read and agreed to the published version of the manuscript.

Funding

V.N. was supported by a scholarship of the Programa de Fortalecimiento de los Postgrados Nacionales from the National Secretariat for Science, Technology and Innovation (SENACYT-Panama) The Sistema Nacional de Investigación (SNI) of SENACYT-Panama supports research activities by J.M.P. and J.E.S.-G. The APC was covered by J.E.S.-G. SNI funds.

Institutional Review Board Statement

Not applicable. The original study from which the data in this article was generated was approved by the Institutional Research Bioethics Committee for the Gorgas Memorial Institute for Health Studies approved under approval code 019/CBI/ICGES/18 on January 11th 2018.

Informed Consent Statement

Not applicable. The IRB (019/CBI/ICGES/18) covers the use of de-identified epidemiological data collected by the National Department of Epidemiology (NDE) from MINSA.

Data Availability Statement

Incidence of Dengue cases data is of public domain, and are compiled yearly by MINSA and published in the Health Statistical Yearbook http://minsa.gob.pa/informacion-salud/anuarios-estadisticos, last accessed in 15 November 2021. Climate data are available at https://biogeodb.stri.si.edu/physical_monitoring/research/metpark, last accessed on 15 November 2021.

Acknowledgments

The authors acknowledge administrative support provided by Universidad Tecnológica de Panamá (UTP) and the Gorgas Memorial Institute of Health Studies.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Analys of confirmed per-capita Dengue cases (cases per 1000 inhabitants) in rural and urban areas within the metropolitan region of Panama.
Table A1. Analys of confirmed per-capita Dengue cases (cases per 1000 inhabitants) in rural and urban areas within the metropolitan region of Panama.
YearCases in Rural AreasCases in Urban Areas
20040.53.1
20055.915.2
20065.310.7
20074.06.3
20082.94.6
20091.718.9
20100.62.4
201110.69.0
20121.01.2
20133.611.8
201411.416.8
20153.27.8
20163.110.1
20174.212.9
Average58130.8

References

  1. Bhatt, S.; Gething, P.W.; Brady, O.J.; Messina, J.P.; Farlow, A.W.; Moyes, C.L.; Drake, J.M.; Brownstein, J.S.; Hoen, A.G.; Sankoh, O.; et al. The global distribution and burden of dengue. Nature 2013, 496, 504–507. [Google Scholar] [CrossRef]
  2. Messina, J.P.; Brady, O.J.; Golding, N.; Kraemer, M.U.G.; Wint, G.R.W.; Ray, S.E.; Pigott, D.M.; Shearer, F.M.; Johnson, K.; Earl, L.; et al. The current and future global distribution and population at risk of dengue. Nat. Microbiol. 2019, 4, 1508–1515. [Google Scholar] [CrossRef]
  3. Stanaway, J.D.; Shepard, D.S.; Undurraga, E.A.; Halasa, Y.A.; Coffeng, L.E.; Brady, O.J.; Hay, S.I.; Bedi, N.; Bensenor, I.M.; Castañeda-Orjuela, C.A.; et al. The global burden of dengue: An analysis from the Global Burden of Disease Study 2013. Lancet Infect. Dis. 2016, 16, 712–723. [Google Scholar] [CrossRef] [Green Version]
  4. Shepard, D.S.; Undurraga, E.A.; Halasa, Y.A.; Stanaway, J.D. The global economic burden of dengue: A systematic analysis. Lancet Infect. Dis. 2016, 16, 935–941. [Google Scholar] [CrossRef]
  5. Hung, T.M.; Clapham, H.E.; Bettis, A.A.; Cuong, H.Q.; Thwaites, G.E.; Wills, B.A.; Boni, M.F.; Turner, H.C. The Estimates of the Health and Economic Burden of Dengue in Vietnam. Trends Parasitol. 2018, 34, 904–918. [Google Scholar] [CrossRef] [Green Version]
  6. Quiroz, E.; Ortega, M.; Guzmán, M.G.; Vázquez, S.; Pelegrino, J.L.; Campos, C.; Bayard, V.; Vázquez, M.; Kourí, G. Dengue en Panamá, 1993. Rev. Cuba. Med. Trop. 1997, 49, 86–93. [Google Scholar]
  7. Díaz, Y.; Chen-Germán, M.; Quiroz, E.; Carrera, J.P.; Cisneros, J.; Moreno, B.; Cerezo, L.; Martinez-Torres, A.O.; Moreno, L.; Barahona de Mosca, I.; et al. Molecular epidemiology of dengue in Panama: 25 years of circulation. Viruses 2019, 11, 764. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Gorgas, W.C. Sanitation in Panama; D. Appleton: New York, NY, USA, 1915. [Google Scholar]
  9. Gubler, D.; Trent, D. Emergence of epidemic dengue/dengue hemorrhagic fever as a public health problem in the Americas. Infect. Agents Dis. 1993, 2, 383–393. [Google Scholar]
  10. Whiteman, A.; Desjardins, M.R.; Eskildsen, G.A.; Loaiza, J.R. Detecting space-time clusters of dengue fever in Panama after adjusting for vector surveillance data. PLoS Negl. Trop. Dis. 2019, 13, e0007266. [Google Scholar] [CrossRef] [Green Version]
  11. Bennett, K.L.; McMillan, W.O.; Enriquez, V.; Barraza, E.; Díaz, M.; Baca, B.; Whiteman, A.; Medina, J.C.; Ducasa, M.; Martínez, C.G.; et al. The role of heterogenous environmental conditions in shaping the spatiotemporal distribution of competing Aedes mosquitoes in Panama: Implications for the landscape of arboviral disease transmission. Biol. Invasions 2021, 23, 1933–1948. [Google Scholar] [CrossRef]
  12. Bennett, K.L.; Martínez, C.G.; Almanza, A.; Rovira, J.R.; McMillan, W.O.; Enriquez, V.; Barraza, E.; Diaz, M.; Sanchez-Galan, J.E.; Whiteman, A.; et al. High infestation of invasive Aedes mosquitoes in used tires along the local transport network of Panama. Parasites Vectors 2019, 12, 1–10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Whiteman, A.; Gomez, C.; Rovira, J.; Chen, G.; McMillan, W.O.; Loaiza, J. Aedes mosquito infestation in socioeconomically contrasting neighborhoods of Panama city. EcoHealth 2019, 16, 210–221. [Google Scholar] [CrossRef] [PubMed]
  14. Morin, C.W.; Comrie, A.C.; Ernst, K. Climate and dengue transmission: Evidence and implications. Environ. Health Perspect. 2013, 121, 1264–1272. [Google Scholar] [CrossRef]
  15. Alves, L.D.; Lana, R.M.; Coelho, F.C. A Framework for Weather-Driven Dengue Virus Transmission Dynamics in Different Brazilian Regions. Int. J. Environ. Res. Public Health 2021, 18, 9493. [Google Scholar] [CrossRef]
  16. Dhamodharavadhani, S.; Rathipriya, R. Forecasting dengue incidence rate in Tamil Nadu using ARIMA time series model. In Machine Learning for Healthcare; Chapman and Hall/CRC: Boca Raton, FL, USA, 2020; pp. 187–202. [Google Scholar]
  17. Thiruchelvam, L.; Dass, S.C.; Asirvadam, V.S.; Daud, H.; Gill, B.S. Determine neighboring region spatial effect on dengue cases using ensemble ARIMA models. Sci. Rep. 2021, 11, 1–9. [Google Scholar]
  18. Xavier, L.L.; Honório, N.A.; Pessanha, J.F.M.; Peiter, P.C. Analysis of climate factors and dengue incidence in the metropolitan region of Rio de Janeiro, Brazil. PLoS ONE 2021, 16, e0251403. [Google Scholar] [CrossRef] [PubMed]
  19. Lu, L.; Lin, H.; Tian, L.; Yang, W.; Sun, J.; Liu, Q. Time series analysis of dengue fever and weather in Guangzhou, China. BMC Public Health 2009, 9, 1–5. [Google Scholar] [CrossRef]
  20. Cortes, F.; Martelli, C.M.T.; de Alencar Ximenes, R.A.; Montarroyos, U.R.; Junior, J.B.S.; Cruz, O.G.; Alexander, N.; de Souza, W.V. Time series analysis of dengue surveillance data in two Brazilian cities. Acta Trop. 2018, 182, 190–197. [Google Scholar] [CrossRef]
  21. Gharbi, M.; Quenel, P.; Gustave, J.; Cassadou, S.; La Ruche, G.; Girdary, L.; Marrama, L. Time series analysis of dengue incidence in Guadeloupe, French West Indies: Forecasting models using climate variables as predictors. BMC Infect. Dis. 2011, 11, 1–13. [Google Scholar] [CrossRef] [Green Version]
  22. Bhatnagar, S.; Lal, V.; Gupta, S.D.; Gupta, O.P. Forecasting incidence of dengue in Rajasthan, using time series analyses. Indian J. Public Health 2012, 56, 281. [Google Scholar]
  23. Hoyos, W.; Aguilar, J.; Toro, M. Dengue Models Based on Machine-Learning Techniques: A Systematic Literature Review. Artif. Intell. Med. 2021, 119, 102157. [Google Scholar] [CrossRef]
  24. Xu, J.; Xu, K.; Li, Z.; Meng, F.; Tu, T.; Xu, L.; Liu, Q. Forecast of dengue cases in 20 Chinese cities based on the deep learning method. Int. J. Environ. Res. Public Health 2020, 17, 453. [Google Scholar] [CrossRef] [Green Version]
  25. Khaira, U.; Utomo, P.E.P.; Aryani, R.; Weni, I. A comparison of SARIMA and LSTM in forecasting dengue hemorrhagic fever incidence in Jambi, Indonesia. J. Phys. Conf. Ser. 2020, 1566, 012054. [Google Scholar] [CrossRef]
  26. Colón-González, F.J.; Soares Bastos, L.; Hofmann, B.; Hopkin, A.; Harpham, Q.; Crocker, T.; Amato, R.; Ferrario, I.; Moschini, F.; James, S.; et al. Probabilistic seasonal dengue forecasting in Vietnam: A modelling study using superensembles. PLoS Med. 2021, 18, e1003542. [Google Scholar] [CrossRef]
  27. Mussumeci, E.; Coelho, F.C. Large-scale multivariate forecasting models for Dengue-LSTM versus random forest regression. Spat. Spatio-Temporal Epidemiol. 2020, 35, 100372. [Google Scholar] [CrossRef] [PubMed]
  28. Ministerio de Salud de Panama (MINSA). Decreto Ejecutivo N 1617. In Gaceta OFICIAL; MINSA: Panama City, Panama, 2014. [Google Scholar]
  29. Instituto Nacional de Estadística y Censo. Panama en Cifras; Contraloria General de la Republica: Panama City, Panama, 2020. [Google Scholar]
  30. Rose, N.H.; Sylla, M.; Badolo, A.; Lutomiah, J.; Ayala, D.; Aribodor, O.B.; Ibe, N.; Akorli, J.; Otoo, S.; Mutebi, J.P.; et al. Climate and urbanization drive mosquito preference for humans. Curr. Biol. 2020, 30, 3570–3579. [Google Scholar] [CrossRef] [PubMed]
  31. Paton, S. Parque Metropolitano Crane, 25 m Relative Humidity. Smithsonian Tropical Research Institute. Dataset. Figshare 2021. Available online: https://smithsonian.figshare.com/articles/dataset/Parque_Metropolitano_Crane_25m_Relative_Humidity/10042661 (accessed on 17 November 2021). [CrossRef]
  32. Paton, S. Parque Metropolitano Crane Precipitation. Smithsonian Tropical Research Institute. Dataset. Figshare 2021. Available online: https://smithsonian.figshare.com/articles/dataset/Parque_Metropolitano_Crane_Precipitation/10042676 (accessed on 17 November 2021). [CrossRef]
  33. Paton, S. Parque Metropolitano Crane, 25 m Air Temperature. Smithsonian Tropical Research Institute. Dataset. Figshare 2021. Available online: https://smithsonian.figshare.com/articles/dataset/Parque_Metropolitano_Crane_25m_Air_Temperature/10042652 (accessed on 17 November 2021). [CrossRef]
  34. Bellido-Jiménez, J.A.; Gualda, J.E.; García-Marín, A.P. Assessing Machine Learning Models for Gap Filling Daily Rainfall Series in a Semiarid Region of Spain. Atmosphere 2021, 12, 1158. [Google Scholar] [CrossRef]
  35. Pearson, K. VII. Note on regression and inheritance in the case of two parents. Proc. R. Soc. Lond. 1895, 58, 240–242. [Google Scholar]
  36. Spearman, C. The proof and measurement of association between two things. Am. J. Psychol. 1987, 100, 441–471. [Google Scholar] [CrossRef] [PubMed]
  37. Bengio, Y.; Simard, P.; Frasconi, P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 1994, 5, 57–166. [Google Scholar] [CrossRef]
  38. Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
  39. Pinto, E.; Coelho, M.; Oliver, L.; Massad, E. The influence of climate variables on dengue in Singapore. Int. J. Environ. Health Res. 2011, 21, 415–426. [Google Scholar] [CrossRef] [PubMed]
  40. Carrington, L.B.; Armijos, M.V.; Lambrechts, L.; Scott, T.W. Fluctuations at a low mean temperature accelerate dengue virus transmission by Aedes aegypti. PLoS Negl. Trop. Dis. 2013, 7, e2190. [Google Scholar] [CrossRef]
  41. Carrera, J.P.; Díaz, Y.; Denis, B.; Barahona de Mosca, I.; Rodriguez, D.; Cedeño, I.; Arauz, D.; González, P.; Cerezo, L.; Moreno, L.; et al. Unusual pattern of chikungunya virus epidemic in the Americas, the Panamanian experience. PLoS Negl. Trop. Dis. 2017, 11, e0005338. [Google Scholar] [CrossRef]
  42. Sigfrid, L.; Reusken, C.; Eckerle, I.; Nussenblatt, V.; Lipworth, S.; Messina, J.; Kraemer, M.; Ergonul, O.; Papa, A.; Koopmans, M.; et al. Preparing clinicians for (re-) emerging arbovirus infectious diseases in Europe. Clin. Microbiol. Infect. 2018, 24, 229–239. [Google Scholar] [CrossRef] [Green Version]
  43. Vargas, S.L.; Céspedes, D.C.; Vergel, J.D.; Ruiz, E.P.; Luna, M.C. Co-infección por los virus del dengue y chikungunya. Revisión narrativa. Rev. Chil. Infectol. 2018, 35, 658–668. [Google Scholar] [CrossRef] [Green Version]
  44. Díaz, Y.; Cisneros, J.; Guzmán, H.; Cordoba, P.; Carrera, J.P.; Moreno, B.; Chen, R.; Mewa, J.C.; García, L.; Cerezo, L.; et al. The reintroduction of DENV-2 in 2011 in Panama and subsequent outbreak characteristic. Acta Trop. 2018, 177, 58–65. [Google Scholar] [CrossRef] [PubMed]
  45. Wu, N.; Green, B.; Ben, X.; O’Banion, S. Deep transformer models for time series forecasting: The influenza prevalence case. arXiv 2020, arXiv:2001.08317. [Google Scholar]
  46. Kondo, K.; Ishikawa, A.; Kimura, M. Sequence to sequence with attention for influenza prevalence prediction using google trends. In Proceedings of the 2019 3rd International Conference on Computational Biology and Bioinformatics, Nagoya, Japan, 17–19 October 2019; pp. 1–7. [Google Scholar]
  47. Polwiang, S. The correlation of climate factors on dengue transmission in urban area: Bangkok and Singapore cases. PeerJ Prepr. 2016, 4, e2322v1. [Google Scholar]
  48. Vaux, H., Jr.; Vammen, K.; Bernex, N.; Fabrega, J.; Forde, M.; Roldan, G.; Torregrosa, M.L. The challenges of managing the urban waters of the Americas. Environ. Sci. Policy Sustain. Dev. 2020, 62, 14–29. [Google Scholar] [CrossRef]
  49. Nascimento Filho, A.S.; Murari, T.B.; Ferreira, P.; Saba, H.; Moret, M.A. A spatio-temporal analysis of dengue spread in a Brazilian dry climate region. Sci. Rep. 2021, 11, 1–8. [Google Scholar]
  50. Kong, L.; Wang, J.; Li, Z.; Lai, S.; Liu, Q.; Wu, H.; Yang, W. Modeling the heterogeneity of dengue transmission in a city. Int. J. Environ. Res. Public Health 2018, 15, 1128. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  51. Metelmann, S.; Liu, X.; Lu, L.; Caminade, C.; Liu, K.; Cao, L.; Medlock, J.M.; Baylis, M.; Morse, A.P.; Liu, Q. Assessing the suitability for Aedes albopictus and dengue transmission risk in China with a delay differential equation model. PLoS Negl. Trop. Dis. 2021, 15, e0009153. [Google Scholar] [CrossRef]
  52. Kusunoki, S.; Nakaegawa, T.; Pinzón, R.; Sanchez-Galan, J.E.; Fábrega, J.R. Future precipitation changes over Panama projected with the atmospheric global model MRI-AGCM3. 2. Clim. Dyn. 2019, 53, 5019–5034. [Google Scholar] [CrossRef]
  53. Martínez, M.M.; Nakaegawa, T.; Pinzón, R.; Kusunoki, S.; Gordón, R.; Sanchez-Galan, J.E. Using a Statistical Crop Model to Predict Maize Yield by the End-Of-Century for the Azuero Region in Panama. Atmosphere 2020, 11, 1097. [Google Scholar] [CrossRef]
  54. Pinzón, R.E.; Hibino, K.; Takayabu, I.; Nakaegawa, T. Virtually experiencing future climate changes in Central America with MRI-AGCM: Climate analogues study. Hydrol. Res. Lett. 2017, 11, 106–113. [Google Scholar] [CrossRef] [Green Version]
  55. Nakaegawa, T.; Arakawa, O.; Kamiguchi, K. Investigation of climatological onset and withdrawal of the rainy season in Panama based on a daily gridded precipitation dataset with a high horizontal resolution. J. Clim. 2015, 28, 2745–2763. [Google Scholar] [CrossRef]
  56. Nakaegawa, T.; Pinzon, R.; Fabrega, J.; Cuevas, J.A.; De Lima, H.A.; Cordoba, E.; Nakayama, K.; Batista Lao, J.I.; Lau Melo, A.; Gonzalez, D.A.; et al. Seasonal changes of the diurnal variation of precipitation in the upper Río Chagres basin, Panamá. PLoS ONE 2019, 14, e0224662. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Map with the scale distribution of the accumulated number of reported cases of Dengue in the Metropolitan Region.
Figure 1. Map with the scale distribution of the accumulated number of reported cases of Dengue in the Metropolitan Region.
Ijerph 18 12108 g001
Figure 2. Map with the scale distribution of the accumulated number of reported cases of Dengue in the Metropolitan Region.
Figure 2. Map with the scale distribution of the accumulated number of reported cases of Dengue in the Metropolitan Region.
Ijerph 18 12108 g002
Figure 3. Map with the location of the Metropolitan Natural Park.
Figure 3. Map with the location of the Metropolitan Natural Park.
Ijerph 18 12108 g003
Figure 4. Graphical representation of an RNN of LSTM type.
Figure 4. Graphical representation of an RNN of LSTM type.
Ijerph 18 12108 g004
Figure 5. Portion of the number of cases per-capita in urban and rural areas per year.
Figure 5. Portion of the number of cases per-capita in urban and rural areas per year.
Ijerph 18 12108 g005
Figure 6. Weekly distribution of confirmed dengue cases, grouped by weeks along the study years.
Figure 6. Weekly distribution of confirmed dengue cases, grouped by weeks along the study years.
Ijerph 18 12108 g006
Figure 7. Weekly average air temperature distribution, the weekly average air temperature was grouped by week to visualize changes over the years by week.
Figure 7. Weekly average air temperature distribution, the weekly average air temperature was grouped by week to visualize changes over the years by week.
Ijerph 18 12108 g007
Figure 8. Weekly distribution of average Relative Humidity, grouped by week along the study period.
Figure 8. Weekly distribution of average Relative Humidity, grouped by week along the study period.
Ijerph 18 12108 g008
Figure 9. Weekly distribution of accumulated Precipitation, grouped by week along the study period.
Figure 9. Weekly distribution of accumulated Precipitation, grouped by week along the study period.
Ijerph 18 12108 g009
Figure 10. Variation in Spearman’s correlation coefficient as climatic variables are lagged.
Figure 10. Variation in Spearman’s correlation coefficient as climatic variables are lagged.
Ijerph 18 12108 g010
Figure 11. Variation in Pearson’s correlation coefficient as climatic variables are lagged.
Figure 11. Variation in Pearson’s correlation coefficient as climatic variables are lagged.
Ijerph 18 12108 g011
Figure 12. Prediction of Dengue Incidence for 3 years using a SARIMA model and comparison with the real number of cases.
Figure 12. Prediction of Dengue Incidence for 3 years using a SARIMA model and comparison with the real number of cases.
Ijerph 18 12108 g012
Figure 13. Prediction of Dengue Incidence for 3 years using a SARIMAX model and comparison with the real number of cases.
Figure 13. Prediction of Dengue Incidence for 3 years using a SARIMAX model and comparison with the real number of cases.
Ijerph 18 12108 g013
Figure 14. Fitting of Dengue Incidence for 3 years using a RNN-LSTM in-sample model and comparison with the real number of cases.
Figure 14. Fitting of Dengue Incidence for 3 years using a RNN-LSTM in-sample model and comparison with the real number of cases.
Ijerph 18 12108 g014
Figure 15. Prediction of Dengue Incidence for 3 years using a RNN-LSTM out-sample model and comparison with the real number of cases.
Figure 15. Prediction of Dengue Incidence for 3 years using a RNN-LSTM out-sample model and comparison with the real number of cases.
Ijerph 18 12108 g015
Table 1. Summary of the performance metrics obtained by the 3 models studied.
Table 1. Summary of the performance metrics obtained by the 3 models studied.
ModelRMSEMAPE
SARIMA (3,1,3)(1,1,1)(52)25.83112.70
SARIMAX (3,1,3)(1,1,1)(52)25.76108.44
LSTM 208 H.U.26.1659.68
Table 2. Summary of the prediction year-by-year of the 3 models studied.
Table 2. Summary of the prediction year-by-year of the 3 models studied.
Model201520162017Total
Ground Truth1478173630056119
SARIMA1682 (14%)1970 (20%)1978 (−34%)5631 (−8%)
SARIMAX1558 (5%)1882 (15%)2001 (−33%)5441 (−11%)
LSTM1648 (11%)1915 (17%)1687 (−44%)5241 (−14%)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Navarro Valencia, V.; Díaz, Y.; Pascale, J.M.; Boni, M.F.; Sanchez-Galan, J.E. Assessing the Effect of Climate Variables on the Incidence of Dengue Cases in the Metropolitan Region of Panama City. Int. J. Environ. Res. Public Health 2021, 18, 12108. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph182212108

AMA Style

Navarro Valencia V, Díaz Y, Pascale JM, Boni MF, Sanchez-Galan JE. Assessing the Effect of Climate Variables on the Incidence of Dengue Cases in the Metropolitan Region of Panama City. International Journal of Environmental Research and Public Health. 2021; 18(22):12108. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph182212108

Chicago/Turabian Style

Navarro Valencia, Vicente, Yamilka Díaz, Juan Miguel Pascale, Maciej F. Boni, and Javier E. Sanchez-Galan. 2021. "Assessing the Effect of Climate Variables on the Incidence of Dengue Cases in the Metropolitan Region of Panama City" International Journal of Environmental Research and Public Health 18, no. 22: 12108. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph182212108

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop