Next Article in Journal
Modeling the Impact of Fiscal Decentralization on Energy Poverty: Do Energy Efficiency and Technological Innovation Matter?
Next Article in Special Issue
Ecogeographic Drivers of the Spatial Spread of Highly Pathogenic Avian Influenza Outbreaks in Europe and the United States, 2016–Early 2022
Previous Article in Journal
Removal of Antibiotic Resistance Genes from Animal Wastewater by Ecological Treatment Technology Based on Plant Absorption
Previous Article in Special Issue
Passive Surveillance of Human-Biting Ixodes scapularis Ticks in Massachusetts from 2015–2019
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Epidemic Diffusion Network of Spain: A Mobility Model to Characterize the Transmission Routes of Disease

by
Javier Del-Águila-Mejía
1,2,3,*,
David García-García
2,4,
Ayelén Rojas-Benedicto
2,4,5,
Nicolás Rosillo
6,
María Guerrero-Vadillo
2,4,
Marina Peñuelas
2,4,
Rebeca Ramis
2,4,
Diana Gómez-Barroso
2,4 and
Juan de Mata Donado-Campos
1,4
1
Departamento de Medicina Preventiva y Salud Pública y Microbiología, Facultad de Medicina, Universidad Autónoma de Madrid. C. Arzobispo Morcillo 4, 28029 Madrid, Spain
2
Centro Nacional de Epidemiología, Instituto de Salud Carlos IIII, Calle de Melchor Fernández Almagro 5, 28029 Madrid, Spain
3
Servicio de Medicina Preventiva, Hospital Universitario de Móstoles, Calle Río Júcar s/n, 28935 Móstoles, Spain
4
Consorcio de Investigación Biomédica en Red de Epidemiología y Salud Pública (CIBERESP), Calle Monforte de Lemos 3-5, 28029 Madrid, Spain
5
Universidad Nacional de Educación a Distancia (UNED), Calle de Bravo Murillo 38, 28015 Madrid, Spain
6
Servicio de Medicina Preventiva, Hospital Universitario 12 de Octubre, Avenida de Córdoba s/n, 28041 Madrid, Spain
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2023, 20(5), 4356; https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph20054356
Submission received: 23 January 2023 / Revised: 23 February 2023 / Accepted: 27 February 2023 / Published: 28 February 2023
(This article belongs to the Special Issue 2nd Edition: Infectious Disease Modeling in the Era of Complex Data)

Abstract

:
Human mobility drives the geographical diffusion of infectious diseases at different scales, but few studies focus on mobility itself. Using publicly available data from Spain, we define a Mobility Matrix that captures constant flows between provinces by using a distance-like measure of effective distance to build a network model with the 52 provinces and 135 relevant edges. Madrid, Valladolid and Araba/Álaba are the most relevant nodes in terms of degree and strength. The shortest routes (most likely path between two points) between all provinces are calculated. A total of 7 mobility communities were found with a modularity of 63%, and a relationship was established with a cumulative incidence of COVID-19 in 14 days (CI14) during the study period. In conclusion, mobility patterns in Spain are governed by a small number of high-flow connections that remain constant in time and seem unaffected by seasonality or restrictions. Most of the travels happen within communities that do not completely represent political borders, and a wave-like spreading pattern with occasional long-distance jumps (small-world properties) can be identified. This information can be incorporated into preparedness and response plans targeting locations that are at risk of contagion preventively, underscoring the importance of coordination between administrations when addressing health emergencies.

1. Introduction

The COVID-19 disease took only a few months from its emergence in the Chinese province of Wuhan to become a global pandemic affecting the world [1]. As with other respiratory viruses, its main route of infection is direct person-to-person contact between an infectious host and a susceptible individual, and therefore, COVID-19 dynamics are shaped mainly by the way both individuals and populations interact with each other [2]. This implies a multi-scale nature of transmission dynamics in which human mobility plays a key role explaining local outbreaks and epidemics [3], geographical diffusion [4,5] and international spread of pandemics [6,7].
We have seen a surge in mobility information availability mainly due to the innovative development of big data techniques [8,9,10,11] that the COVID-19 pandemic catalyzed. Since the early stages of the pandemic, numerous studies have been conducted in countries such as China [12,13,14], Italy [15,16], Spain [17] or the USA [5,9,18], among others [19]. Mobility data have been used to model geographical diffusion [12,13,14,15,17,18,19,20], epidemic growth [5,13,16] or to evaluate non-pharmaceutical interventions in place [5,12], primarily by examining the relationship between epidemiological outcomes and changes in the number of travelers from a particular source location.
Gibbs et al., through exploring mobility trends in mainland China in early 2020, noted that lockdowns were effective in reducing the number of people traveling, but not in changing their destinations, despite restrictions being in place [14]. Mobility here can be understood as a subjacent matrix of connections or a network—a notion long used in epidemic models with coupled metapopulations interchanging individuals (travelers) [21]. Analyzing the structure of mobility networks themselves, rather than considering raw flows as explaining covariates, may provide useful insights on ‘why’ or ‘how’ some phenomena have happened, rather than just describing ‘what’ happened.
Network science methods focus on the study of relationships between elements [22]; the diffusion of diseases in contact or travel networks has been a cornerstone of this field since its inception [22,23]. Two central components can be identified in this context: graph theory, which studies the properties derived from the network's structure and how it determines aspects of disease dynamics [24], and social network, which provides analytical tools for quantifying social behavior (i.e., contact patterns) and relates it to individual and population metrics of centrality, transitivity or modularity; these are useful for statistical and epidemiological models [25,26] that later translate into epidemiological risks, e.g., of infectivity, exposure or transmissibility [22,26,27].
This relationship between network methods, disease transmission and the modelling of large-scale geographical mobility networks is an area that requires further investigation to develop evidence-based frameworks in infectious disease epidemiology [26]. Mobility can introduce a new level of complexity with a multiplicity of possible routes, distances and geographical patterns for disease diffusion [7,22]. Such insights can inform public health authorities for future health emergencies if empirical data from actual case studies and large mobility datasets become available to researchers, something that was not accessible before the COVID-19 pandemic.

Objectives

With this goal in mind, the objective of the present study was to analyze and characterize mobility patterns in Spain, using data gathered by the Spanish Government during 2020–21 [28]. First, raw mobility between the 52 Spanish provinces was turned into a stable matrix of connections, the Mobility Matrix (MM). Second, a directed network was built from the MM after turning raw mobility into a unit of disease transmissibility, the effective distance [7], which enabled us to directly describe both the network and node properties in terms of their epidemiological risk of disease transmission. Third, we used COVID-19 data from the same time period [29] to test how mobility communities within the network coupled the province’s epidemics, thus linking disease dynamics with mobility structure.

2. Materials and Methods

2.1. Data Sources

Mobility data from 14 February 2020 to 9 May 2021 was gathered by the Ministry of Transportation, Mobility and Urban Agenda (MITMA) while state-of-alarm laws to answer the COVID-19 pandemic were in force [28]. We defined Spain´s 52 provinces as the area of study (Figure 1) and data were aggregated in the form of daily unique travels be-tween all provinces. The COVID-19 data for Spain were available from the Instituto de Salud Carlos III public information dashboard [29]. This information was gathered through the National Surveillance Network (RENAVE, in Spanish). Demographic information was provided by the Instituto Nacional de Estadística (INE) [30].

2.2. Mobility Matrix of Spain and Shortest Distance Maps

A description of mobility patterns in Spain was first carried out to characterize behavioral, seasonal and geographical properties. Let F n m be the flux of daily total travelers emanating from a province m to any different province n, and P n m the fraction of people leaving m and going to n among all people leaving m [7]:
P n m = F n m n m F n m
The P n m fractions represent the distribution of the relative importance of each destination province n connected with m. We defined an adjacency matrix containing all P n m pairs for the 52 provinces and named it our Mobility Matrix (MM). We compared a mean MM for the whole 2020 period with subsegments in different 2020 periods by the means of MM ratios and standard deviations (Supplementary Materials, Table S1) to conclude that there were no significant differences in time, season or measures in place between any of them, thus granting stability. To further study transmission phenomena within this MM, all percentages P n m were turned into a distance-like unit: Effective Distance (ED), as proposed and tested by [7]:
d n m = 1 - l o g P n m
ED is a measure of proximity: the larger P n m , the lesser d n m , and therefore, the closer node n is from m in terms of disease diffusion [7]. Complex connection patterns emerge from the matrix P n m and its d n m derivate, since EDs allow to mathematically compute paths along the MM that imply multi-step routes due to the aggregative property of logarithms compared with the conditional probability of percentages [7]. We can find the shortest distances between two provinces D n m as the minimal sum of d n m required to reach node n from m.
For example, if d B A = 6 but d C A = 2 and d B C = 2 , any travelling disease would arrive from A to B faster by transitioning through C ( D B A = 4 ) rather than directly from A to B. The set of shortest paths between pairs of provinces within the MM was obtained using the classical Dijkstra’s algorithm [31] implemented in the igraph R package [32]. For each province, a map of all the shortest incoming and outgoing D n m was created.
An MM built from effective distances informed of the relative connectivity strengths of the nodes of a network in relation to each node's mobility flow, rather than the raw numbers of moving individuals. As shown in [7], this quantity was empirically proven to better reflect the role of human mobility with the spread of infectious diseases than the untransformed flows. Including effective distances in a network is a novel approach that aims to improve the generalizability of our results to epidemiological conclusion.

2.3. Epidemic Diffusion Network and Measures

From the MM, we defined a network [33] to represent and uncover these complex mobility patterns that might drive diffusion events. To model the network, only relevant connections (e.g., those with P n m over 10%, see Supplementary Material) were selected. When applied to the study of disease diffusion dynamics, we named this network the “Epidemic Diffusion Network” (EDN) and studied several network measures with epidemiological relevance to the spread of diseases (Table S2).
In a network, the degree of a node is the number of edges connected to it (incoming and outgoing)—a representation of connectiveness. In weighted networks, the strength of a node is the sum of the weights of each edge connected to the node, i.e., the weighted degree. Degree and strength are local measures that account only for the neighboring properties of nodes. Betweenness, on the other hand, sums up the number of times the node lies within the path between any other two nodes, thus scoring nodes that act as a bridge or intersecting points [27,34]. The three are estimated for each node of the EDN.
Network substructure analysis was performed to detect communities using the Walktrap [35] and Infomap [36] algorithms that were implemented in the igraph R package [32]. These were chosen based on the “Question Alignment” method, considering the nature of the research question, given that Walktrap and Infomap are better suited for community detection for transmission events between nodes [37]. A found community must be understood as a mobility cluster in epidemiological terms. This implies that nodes are connected in such a way that movement (flow) within the community is much more likely. The percentage of total movements happening within communities rather than between them is known as the modularity score. The clustering of nodes happens hierarchically and will be represented with a dendrogram.

2.4. Network Structure and COVID-19 Dynamics

Since mobility drives the geographical diffusion of COVID-19, it would be expected that provinces within the same community, which share the most of their mobility fluxes, also share more similar disease dynamics due to increased multi-seeding processes [17]. These dynamics are summed up in the cumulative incidence in 14-day (CI14) epidemic curves; therefore, we can compare their trajectories, shapes and magnitude throughout the study period.
We adopted an analytical approximation by proposing a measure of curve proximity. We can compute the absolute difference in daily CI14 between any two provinces m , n for a given day t:
d m n , t = CI 14 m , t - CI 14 n , t ,
where CI 14 m , t denotes the CI14 at province m on day t. The mean d m n ¯ of these daily differences over the time under study is the global proximity measure between the disease dynamics of any two provinces. The lower d m n ¯ value, the more similar COVID-19 dynamics that two provinces have experienced.
The 52 × 52 possible d m n ¯ are calculated between each province and (1) the rest of the provinces within its community and (2) the rest of the provinces outside. All obtained means are compared with a T test with the null hypothesis of no differences in d m n ¯ , i.e., being part of a community does not imply a difference in disease dynamic for the studied period.
All analyses were made using R version 4.0.3 [38] and the network package igraph [32].

3. Results

3.1. Mobility Matrix of Spain

The daily aggregated total mobility from the 14 February 2020 to 9 May 2021 is shown in Figure 2. The first week (14–21 February 2020) is considered the reference mobility value before the pandemic began. The first sharp decline in March 2020 happens after the first declaration of the stay-at-home order, which was lifted in June when mobility almost reached pre-pandemic values. By the end of August, it declines again and does not recover until the end of our study period, even after the second state of alarm came into force from 24 October to 9 May.
Using the example of the relationship between two provinces, we see how raw F n m give place to the stable patterns of provinces’ connections: originally, the flux between them is identical in both ways (Figure 3A) and follows the variations depicted in Figure 2. However, once turned into the P n m , we learn that the originally identical F n m now represent very different percentages for each of them: ~80% of total travels for one and only ~25% for the other (Figure 3B). P n m are stable connection patterns that are not affected by seasonality or measures in place and give birth to each province’s set of connections (Figure 3C,D), where major fluxes that will conform the EDN can be clearly discerned from the larger number of less relevant connections existing. A deeper explanation of this rationale with examples can be found in the Supplementary Materials.

3.2. Epidemic Diffusion Network and Shortest Distance Maps

The 52 provinces are the network´s nodes and, between them, 135 weighted edges [33] represent the main MM connections, as defined in the Supplementary Material. An example of the shortest distance maps is shown in Figure 4. The modelled network is presented in Figure 5A. Madrid scores the highest degree (11), followed by Valladolid (9), Araba/Álava (8), Málaga (8), Sevilla (8) and Valencia (8). Madrid, Valladolid and Araba/Álava also record the highest strength (25.7, 25.3 and 20.2). The network flow betweenness, on the other hand, concentrates in provinces of the southernmost community. The mean (SD) values for the entire network are 5.2 (2) degrees, 12.5 (5.5) strength and 318.1 (408.9) flow betweenness (Table S3).
A total of seven communities are detected within the network, not differing between Walktrap and Infomap algorithms (Figure 5 and Figure S6). The biggest community is the southernmost, consisting in a total of twelve provinces and connected to the outside only by two provinces. On the other hand, the smallest communities are the two composed by the Canary Islands (two provinces) and Galicia (four provinces). The first is dependent only on Madrid as the main connection. The other four communities represent different geographical regions. The highest scoring provinces in terms of degree/strength are well divided among communities. The network´s modularity is 63%. The hierarchical clustering dendrogram and qualitative conclusions derived from it are depicted in the Supplementary Materials.

3.3. Network Structure and COVID-19 Dynamics

COVID-19 CI14 epidemic curves by provinces grouped by the community can be seen in Figure 6. Provinces in the same network community seem to share a more similar COVID-19 incidence, represented by their curves’ trajectories, shapes and magnitude. This is confirmed by the fact that all provinces have a statistically significant lower mean daily difference d i j ¯ with other provinces within their same community compared with the outer ones (Figure 7).
Three communities show a smaller d i j ¯ between inner and outer provinces: Central–East (138.4 vs. 150.5), Northeast (131.1 vs. 143) and Central–West (101.7 vs. 133.7) communities. They encompass 28 out of 52 provinces together, including the main cities such as Madrid, Barcelona, Valencia, Valladolid or Zaragoza, with all having experienced a significant epidemic burden (Figure 6). They are followed by the South (101.7 vs. 133.7) and North (111 vs. 152.9) communities, which showed greater mean CI14 differences with the outside provinces. Finally, the greatest differences are found for the Canary Islands (49.9 vs. 175.3) and Northwest (50.3 vs. 130) communities, noting, at the same time, that these two communities were the least impacted in terms of CI14 (Figure 6), and both remain to be peripheral small communities (two and four provinces) within the network (Figure 5).

4. Discussion

We used raw mobility data gathered during the COVID-19 pandemic to curate and develop a Mobility Matrix (MM) in the form of an adjacency matrix between the 52 provinces of Spain; this was used to build a network model to unveil complex properties that drive geographical diffusion phenomena linked to mobility (such as, but not limited to, epidemics). This network has been described, together with its properties, and linked to COVID-19 dynamics between March 2020 and May 2021.
Mobility has been analyzed and described in successive layers of complexity: first, by depicting the raw fluxes of daily travelers. Secondly, by turning them into a distance-like variable and unveiling the constant relationships and patterns emerging from the data. Third, by building up a network model to further understand diffusion phenomena within that network. The latter was mainly achieved through hierarchical clustering for community detection and the shortest distance maps. Each of these layers of study allowed for us to gain a share of insight into the complexity of human mobility in one specific way.

4.1. Mobility Matrix(MM) and its Derived Effective Distance Network (EDN)

The MM reveals that diffusion phenomena are governed by the provinces’ set of major bilateral fluxes that remain constant in time despite natural variations such as weekends, holidays or seasonal (i.e., summer) mobility. Even external variations such as mobility restrictions that were put in place were able to reduce the number of people travelling (Figure 2) but not their usual destinations (Figure 3). Each province has a unique set of main neighbors and ways of interacting with them. These constitute the properties of provinces which will determine their role in diffusion phenomena as the nodes of a network, connected by their relevant effective distances on the edges: the EDN.
The EDN shows a configuration of hubs that connect with smaller provinces rather than between them, acting more like local gravitating poles for adjacent lesser provinces rather than drivers of the whole network (Figure 5). This is the case for the four largest degree-scoring provinces: Madrid, Valladolid, Araba/Álava and Málaga, each belonging to one big community (Table S3). They are followed by other important provinces of the same communities (Sevilla, Valencia…). However, the other big community, Northeast, does not appear until the 12th and 13th position with Zaragoza and Barcelona, the latter being the 2nd largest city of Spain. Contrary to the rest of communities, the Northeast shows a completely different behavior in which no main hub emerges. We are unable to explain these findings and their implications in diffusion dynamics, but we hypothesize that this subnetwork would be capable of self-sustaining a disease loop more effectively; we hypothesize this due to positive infective feedback emanating from the horizontal structure of shared connections of equal importance, with no “secondary” provinces just sending a large flux to the hub; however, a disease-modelling-in-networks approach would be required to confirm this hypothesis.
Out of the seven found communities, only the smallest two (Canary Islands and Northwest) are composed of a single full administrative region while the rest bring together provinces under different administrations at the level of Autonomous Communities—an outcome that similar studies also found when assessing mobility clusters [39]. This is especially true for the second and third largest central communities, with Madrid acting as a main bridge between two mobility clusters and several Autonomous Communities (including the Canary Islands). Hierarchical clustering (Figure S6) shows how the southern community conforms an independent area from the rest of Spain. This also reflects the fact that these provinces score the highest in flow betweenness (Table S3), since this is a metric of global transit, and 12 provinces are connected by only two bridges, thus tarnishing this measure in our network.
A relationship between the EDN-found communities and CI14 evolution at the province level in Spain exists (Figure 7), linking mobility patterns and spatial diffusion of a transmissible disease in a time period when public health measures targeted mobility between provinces. Given that the majority of mobility communities share more than one Autonomous Community, these findings stress the importance of coordinated, data-driven and locally wise measures.

4.2. Network Epidemiology

The network on which a disease spreads highly determines the evolution and dynamics it will experience [22]. We observed how several studies used mobility data to study the early stages of the COVID-19 diffusion [12,13,14,15,16,17,18,19,20] but did not address the subjacent properties of the countries’ mobility networks. By refining mobility data from Spain into a new approach within network modelling, we enabled social network analysis at a large geographical scale, treating each province like an individual relating with its neighbors. We were then able to establish some parallelisms with previous network studies.
Centrality measures such as degree, strength or betweenness have been investigated in COVID-19 contact tracing data. Two examples from India and South Korea [40,41] found that out-degree could identify a small proportion of patients responsible for a large portion of secondary infections. In addition, individuals with high betweenness scores acted as ‘bridges’ between subcomponents in the data [41]. Contact-tracing is always a retrospective approach, but if the mobility structure is stable, we can identify the most relevant locations before any outbreak and prepare for it in advance. Degree and strength identify well-connected individuals that are, at the same time, more exposed to a disease and at a higher risk of transmitting the disease onward, while betweenness identifies bridging locations that can be targeted to geographically contain an infection. All these measures are highly context-dependent [27,42], acquiring different meanings in different kinds of networks, especially where substructure is a relevant network component, such as the EDN [25].
This structure links individual behavior to broad population effects in networks. For example, during the 2014–2016 Sierra Leona Ebola outbreak, changes in the pattern of the case-contacts network forecasted the final epidemic size [43]. During the COVID-19 pandemics, governments implemented measures aiming to alter the social network of contacts by social distancing, travel bans or mandatory mask use. Their effects can be measured in terms of network measures [40] or they can be modelled as intervention scenarios for pandemic control [44].
The network structure has a significant impact on the analysis and interpretation of the entire network [25]. At the population (spatial) level, a mobility network such as the EDN turns an individual set of contacts into a series of infective distances, shortest paths and subcommunities [22]. These can be compared with several idealized types of networks found in natural, social or technological systems [45], among which small-world networks are one of the most relevant [46]. They are characterized by a high clustering degree and short path lengths, similar to those found in the EDN, but they also have random links that connect otherwise distant elements of the network.
Diseases that spread on small-world networks take the form of a spreading wave from an origin source, with the ability to quickly reach distant elements [22]. This was observed in the international air-travel network during the pandemics of H1N1 in 2009 or SARS in 2003, as described by Brockman and Helbing [7]. A similar pattern was seen in the diffusion of COVID-19 during the summer of 2020 in Spain, where it started in a region (northeast Spain) and then quickly spread to Madrid, from where it reached the rest of Spain [47], a path depicted by the effective distance map route in Figure 4B.
High modularity was also found in our network which, according to some authors, means that the disease is more likely to remain trapped within communities [48]. However, other authors consider this effect to be modest, especially in highly contagious diseases which would only be trapped with a modularity of over 80%, which is rare [49]. Despite this, modularity is still an essential factor to consider for control strategies in epidemiology. In the presence of a less transmissible disease, appropriate measures can increase the "structural trapping" of the network, which refers to the ability to prevent the disease from spreading from one community to another [49].

4.3. Limitations

The main limitation of the study comes from the MM and EDN design, which was based on the researchers’ decisions. Should those decisions have been different, the network structure, measures or communities—or its interpretation—might differ [22,24]. When working with networks, there is no common framework for deciding which structure or measures are the most useful [26], with each network yielding different measures’ meaning in its specific context [27]. The same applies to community detection algorithms, as previously stated [37].
The epidemiological implications of our findings in the geographical scale comes from research in social network analysis in individual groups of people. By turning Brockmann and Helbing’s effective distance [7]—an epidemiological measure of disease spreading through mobility—to the network’s edges, we can link the individual and spatial domains. However, more recent developments of the effective distance metric are available that better approximate reality, going beyond the mere shortest paths through multiple random walks at the same time [50]. We decided to continue with the first version, since modelling more complex epidemic dynamics would require much more than only accounting for multi-seeding processes, and we focused on the mobility network itself rather than the disease network.
The chosen administrative unit for this work was the province since public health units usually operate at this level. However, this approach might hide subtle mobility patterns at a smaller level, e.g., municipalities, which could lead to different conclusions or otherwise would be needed when dealing with more localized problems.

5. Conclusions and Recommendations

It is only by addressing together information from network measures, structure and modularity that a complete picture can be drawn for the epidemiological implications of the MM and the EDN in Spain. An analysis of the public health implications should encompass three aspects: (1) the community of the origin province, (2) the nearest destinations and the effective distance maps and (3) the risk of spreading to the entire network.
For example, an outbreak starting in the South community is more likely to remain trapped since only two bridging provinces connect it with the rest of Spain (Figure 5), and therefore, early measures should focus on those provinces to prevent the disease from spreading throughout the country. Similarly, if the source of the outbreak is a small province linked to a main hub (as shown in Figure 3D), a strategy could be to target directly third-order provinces linked to that hub, since the spreading from the hub is much more unlikely to be prevented. Madrid acts as the main bridge between most of the mobility communities, reflecting the importance of the capital city and the socio-economic distributions of travel in Spain, playing a central role in disease diffusion in the country. However, this hub effect is less relevant on the second-largest regions of Spain, Barcelona and its mobility community of the Northeast where no main hub emerges and all provinces relate equally to each other. All these epidemiological correlates could be incorporated into preparedness and response plans at the subnational, national and international levels, since the method is scalable to different levels of aggregation and disease dynamics are multi-layered in nature.
Further research is needed to test the relations between network components and disease epidemiology. Simple mathematical (e.g., SEIR) models are able to fully capture the complexity of these dynamics when paired with the adequate subjacent network [3]. The massive amount of epidemiological and social information made available during the COVID-19 pandemic poses a unique opportunity to address the identified gaps in knowledge and help us advance towards a more generalized theoretical framework in the field.

Supplementary Materials

The following supporting information can be downloaded at: https://0-www-mdpi-com.brum.beds.ac.uk/article/10.3390/ijerph20054356/s1, Figure S1: Travels between Madrid and Guadalajara during 2020; Figure S2: Percentage of daily/weekly travelers between Madrid and Guadalajara; Figure S3: Weekly percentage of travels from Madrid (A) and Guadalajara (B) by epiweek, 2020; Figure S4: Daily percentage of travels between two provinces. Spain, 2020; Figure S5: Weekly percentage of travels from each province; Figure S6: Hierarchical clustering dendrogram from the ED; Table S1: Comparison of Effective Distance Matrixes during 2020; Table S2: Studied network measures and their epidemiological implications; Table S3: Epidemic Diffusion Network measures by province and community.

Author Contributions

Conceptualization, J.D.-Á.-M., D.G.-B. and J.d.M.D.-C.; methodology, J.D.-Á.-M., D.G.-B., J.d.M.D.-C., R.R. and D.G.-G.; software, J.D.-Á.-M., A.R.-B. and N.R.; formal analysis, J.D.-Á.-M. and D.G.-G.; investigation, J.D.-Á.-M., D.G.-B., D.G.-G. and J.d.M.D.-C.; data curation, J.D.-Á.-M.; writing—original draft preparation, J.D.-Á.-M.; writing—review and editing, J.D.-Á.-M., M.G.-V., M.P., R.R., N.R., D.G.-G., D.G.-B. and J.d.M.D.-C.; visualization, J.D.-Á.-M., A.R.-B. and N.R.; supervision, D.G.-B. and J.d.M.D.-C.; funding acquisition, D.G.-B. All authors have read and agreed to the published version of the manuscript.

Funding

This research has been financed by Carlos III Health Institute (ISCIII) under the project COV20-00881 and the project Fundación BBVA.DGVI 256/22 “COVID-19 Urban Atlas Spain”.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and/or analyzed during the current study are publicly available from the following sources: Instituto de Salud Carlos III. Situación y evolución de la pandemia de COVID-19 en España. Available from: URL: https://cnecovid.isciii.es/covid19/ (accessed on 22 February 2023). Ministerio de Transportes, Movilidad y Agenda Urbana. Estudio de movilidad con Big Data. Available from: URL: https://www.mitma.gob.es/ministerio/covid-19/evolucion-movilidad-big-data (accessed on 22 February 2023).

Acknowledgments

The authors wish to thank all the colleagues who provided valuable insight and support during the project at its many phases.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. World Health Organization. WHO Coronavirus (COVID-19) Dashboard 2020; WHO: Geneva, Switzerland, 2020.
  2. Brauer, F.; van den Driessche, P.; Wu, J. Mathematical Epidemiology; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
  3. Chang, S.; Pierson, E.; Koh, P.W.; Gerardin, J.; Redbird, B.; Grusky, D.; Leskovec, J. Mobility network models of COVID-19 explain inequities and inform reopening. Nature 2021, 589, 82–87. [Google Scholar] [CrossRef] [PubMed]
  4. Charu, V.; Zeger, S.; Gog, J.; Bjørnstad, O.N.; Kissler, S.; Simonsen, L.; Viboud, C. Human mobility and the spatial transmission of influenza in the United States. PLoS Comput. Biol. 2017, 13, e1005382. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Badr, H.S.; Du, H.; Marshall, M.; Dong, E.; Squire, M.M.; Gardner, L.M. Association between mobility patterns and COVID-19 transmission in the USA: A mathematical modelling study. Lancet Infect. Dis. 2020, 20, 1247–1254. [Google Scholar] [CrossRef] [PubMed]
  6. Balcan, D.; Gonçalves, B.; Hu, H.; Ramasco, J.J.; Colizza, V.; Vespignani, A. Modeling the spatial spread of infectious diseases: The GLobal Epidemic and Mobility computational model. J. Comput. Sci. 2010, 1, 132–145. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Brockmann, D.; Helbing, D. The hidden geometry of complex, network-driven contagion phenomena. Science 2013, 342, 1337–1342. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Grantz, K.H.; Meredith, H.R.; Cummings, D.A.T.; Metcalf, C.J.E.; Grenfell, B.T.; Giles, J.R.; Mehta, S.; Solomon, S.; Labrique, A.; Kishore, N.; et al. The use of mobile phone data to inform analysis of COVID-19 pandemic epidemiology. Nat. Commun. 2020, 11, 1–8. [Google Scholar] [CrossRef]
  9. Kang, Y.; Gao, S.; Liang, Y.; Li, M.; Rao, J.; Kruse, J. Multiscale dynamic human mobility flow dataset in the U.S. during the COVID-19 epidemic. Sci. Data 2020, 7, 1–13. [Google Scholar] [CrossRef]
  10. Oliver, N.; Lepri, B.; Sterly, H.; Lambiotte, R.; Deletaille, S.; Nadai, M.; Letouzé, E.; Salah, A.A.; Benjamins, R.; Cattuto, C.; et al. Mobile phone data for informing public health actions across the COVID-19 pandemic life cycle. Sci. Adv. 2020, 6, eabc0764. [Google Scholar] [CrossRef]
  11. Pepe, E.; Bajardi, P.; Gauvin, L.; Privitera, F.; Lake, B.; Cattuto, C.; Tizzoni, M. COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown. Sci. Data 2020, 7, 230. [Google Scholar] [CrossRef]
  12. Jiang, J.; Luo, L. Influence of population mobility on the novel coronavirus disease (COVID-19) epidemic: Based on panel data from Hubei, China. Glob. Health Res. Policy 2020, 5, 1–10. [Google Scholar] [CrossRef]
  13. Kraemer, M.U.G.; Yang, C.-H.; Gutierrez, B.; Wu, C.-H.; Klein, B.; Pigott, D.M.; Open COVID-19 Data Working Group; du Plessis, L.; Faria, N.R.; Li, R.; et al. The effect of human mobility and control measures on the COVID-19 epidemic in China. Science 2020, 368, 493–497. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Gibbs, H.; Liu, Y.; Pearson, C.A.B.; Jarvis, C.I.; Grundy, C.; Quilty, B.J.; Diamond, C.; LSHTM CMMID COVID-19 Working Group; Simons, D.; Gimma, A.; et al. Changing travel patterns in China during the early stages of the COVID-19 pandemic. Nat. Commun. 2020, 11, 5012. [Google Scholar] [CrossRef] [PubMed]
  15. Cartenì, A.; Di Francesco, L.; Martino, M. How mobility habits influenced the spread of the COVID-19 pandemic: Results from the Italian case study. Sci. Total Environ. 2020, 741, 140489. [Google Scholar] [CrossRef] [PubMed]
  16. Cintia, P.; Fadda, D.; Giannotti, F.; Pappalardo, L.; Rinzivillo, S.; Boschi, T.; Chiaromonte, F.; Bonato, P.; Fabbri, F.; Penone, F.; et al. The relationship between human mobility and viral transmissibility during the COVID-19 epidemics in Italy. arXiv 2006, arXiv:2006.03141. [Google Scholar]
  17. Mazzoli, M.; Pepe, E.; Mateo, D.; Cattuto, C.; Gauvin, L.; Bajardi, P.; Ramasco, J.J. Interplay between mobility, multi-seeding and lockdowns shapes COVID-19 local impact. PLoS Comput. Biol. 2021, 17, e1009326. [Google Scholar] [CrossRef]
  18. Kissler, S.M.; Kishore, N.; Prabhu, M.; Goffman, D.; Beilin, Y.; Landau, R.; Gyamfi-Bannerman, C.; Bateman, B.T.; Snyder, J.; Razavi, A.S.; et al. Reductions in commuting mobility correlate with geographic differences in SARS-CoV-2 prevalence in New York City. Nat. Commun. 2020, 11, 4674. [Google Scholar] [CrossRef]
  19. Kishore, N.; Kahn, R.; Martinez, P.P.; Salazar, P.M.; de Mahmud, A.S.; Buckee, C.O. Lockdowns result in changes in human mobility which may impact the epidemiologic dynamics of SARS-CoV-2. Sci. Rep. 2021, 11, 6995. [Google Scholar] [CrossRef]
  20. Jia, J.S.; Lu, X.; Yuan, Y.; Xu, G.; Jia, J.; Christakis, N.A. Population flow drives spatio-temporal distribution of COVID-19 in China. Nature 2020, 582, 389–394. [Google Scholar] [CrossRef]
  21. Li, Z.; Li, H.; Zhang, X.; Zhao, C. Estimation of Human Mobility Patterns for Forecasting the Early Spread of Disease. Healthcare 2021, 9, 1224. [Google Scholar] [CrossRef]
  22. Keeling, M.J.; Eames, K.T.D. Networks and epidemic models. J. R. Soc. Interface. 2005, 2, 295–307. [Google Scholar] [CrossRef] [Green Version]
  23. Pastor-Satorras, R.; Vespignani, A. Epidemic spreading in scale-free networks. Phys. Rev. Lett. 2001, 86, 3200–3203. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Keeling, M. The implications of network structure for epidemic dynamics. Theor. Popul. Biol. 2005, 67, 1–8. [Google Scholar] [CrossRef] [PubMed]
  25. Silk, M.J.; Hodgson, D.J.; Rozins, C.; Croft, D.P.; Delahay, R.J.; Boots, M.; McDonald, R.A. Integrating social behaviour, demography and disease dynamics in network models: Applications to disease management in declining wildlife populations. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 2019, 374, 20180211. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Silk, M.J.; Croft, D.P.; Delahay, R.J.; Hodgson, D.J.; Boots, M.; Weber, N.; McDonald, R.A. Using Social Network Measures in Wildlife Disease Ecology, Epidemiology, and Management. BioScience 2017, 67, 245–257. [Google Scholar] [CrossRef]
  27. Bell, D.C.; Atkinson, J.S.; Carlson, J.W. Centrality measures for disease transmission networks. Soc. Netw. 1999, 21, 1–21. [Google Scholar] [CrossRef]
  28. Ministerio de Transportes, Movilidad y Agenda Urbana. Estudio de Movilidad con Big Data. Available online: https://www.mitma.gob.es/ministerio/covid-19/evolucion-movilidad-big-data (accessed on 26 February 2023).
  29. Instituto de Salud Carlos III. Situación y evolución de la pandemia de COVID-19 en España. Available online: https://cnecovid.isciii.es/covid19/ (accessed on 26 February 2023).
  30. Instituto Nacional de Estadística (INE). INEbase. Available online: https://www.ine.es/dyngs/INEbase/listaoperaciones.htm.
  31. Dijkstra, E.W. A note on two problems in connexion with graphs. Numer. Math. 1959, 1, 269–271. [Google Scholar] [CrossRef] [Green Version]
  32. Csardi, G.; Nepusz, T. The igraph software package for complex network research. InterJournal, Complex Systems 2006, 1695, 1–9. [Google Scholar]
  33. Barrat, A.; Barthélemy, M.; Pastor-Satorras, R.; Vespignani, A. The architecture of complex weighted networks. Proc. Natl. Acad. Sci. USA 2004, 101, 3747–3752. [Google Scholar] [CrossRef] [Green Version]
  34. Takemoto, K.; Oosawa, C. Introduction to Complex Networks: Measures, Statistical Properties, and Models. In Statistical and Machine Learning Approaches for Network Analysis; Dehmer, M., Basak, S.C., Eds.; Wiley: Hoboken, NJ, USA, 2012; pp. 45–75. [Google Scholar] [CrossRef]
  35. Pons, P.; Latapy, M. Computing communities in large networks using random walks (long version). arXiv 2005. [CrossRef]
  36. Rosvall, M.; Bergstrom, C.T. Maps of random walks on complex networks reveal community structure. PNAS 2008, 105, 1118–1123. [Google Scholar] [CrossRef] [Green Version]
  37. Smith, N.R.; Zivich, P.N.; Frerichs, L.M.; Moody, J.; Aiello, A.E. A Guide for Choosing Community Detection Algorithms in Social Network Studies: The Question Alignment Approach. Am. J. Prev. Med. 2020, 59, 597–605. [Google Scholar] [CrossRef] [PubMed]
  38. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2020. [Google Scholar]
  39. Rinzivillo, S.; Mainardi, S.; Pezzoni, F.; Coscia, M.; Pedreschi, D.; Giannotti, F. Discovering the Geographical Borders of Human Mobility. Künstl. Intell. 2012, 26, 253–260. [Google Scholar] [CrossRef]
  40. Jo, W.; Chang, D.; You, M.; Ghim, G.-H. A social network analysis of the spread of COVID-19 in South Korea and policy implications. Sci. Rep. 2021, 11, 8581. [Google Scholar] [CrossRef]
  41. Nagarajan, K.; Muniyandi, M.; Palani, B.; Sellappan, S. Social network analysis methods for exploring SARS-CoV-2 contact tracing data. BMC Med. Res. Methodol. 2020, 20, 233. [Google Scholar] [CrossRef] [PubMed]
  42. Farine, D.R.; Whitehead, H. Constructing, conducting and interpreting animal social network analysis. J. Anim. Ecol. 2015, 84, 1144–1163. [Google Scholar] [CrossRef] [Green Version]
  43. Hébert-Dufresne, L.; Young, J.-G.; Bedson, J.; Skrip, L.A.; Pedi, D.; Jalloh, M.F.; Raulier, B.; Lapointe-Gagné, O.; Jambai, A.; Allard, A.; et al. The network epidemiology of an Ebola epidemic. arXiv 2021. [CrossRef]
  44. Maheshwari, P.; Albert, R. Network model and analysis of the spread of Covid-19 with social distancing. Appl. Netw. Sci. 2020, 5, 100. [Google Scholar] [CrossRef]
  45. Barabási, A.-L. Network Science; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
  46. Watts, D.J.; Strogatz, S.H. Collective dynamics of ‘small-world’ networks. Nature 1998, 393, 440–442. [Google Scholar] [CrossRef]
  47. Rosillo, N.; Del-Águila-Mejía, J.; Rojas-Benedicto, A.; Guerrero-Vadillo, M.; Peñuelas, M.; Mazagatos, C.; Segú-Tell, J.; Ramis, R.; Gómez-Barroso, D. Real time surveillance of COVID-19 space and time clusters during the summer 2020 in Spain. BMC Public Health 2021, 21, 961. [Google Scholar] [CrossRef]
  48. Rozins, C.; Silk, M.J.; Croft, D.P.; Delahay, R.J.; Hodgson, D.J.; McDonald, R.A.; Weber, N.; Boots, M. Social structure contains epidemics and regulates individual roles in disease transmission in a group-living mammal. Ecol. Evol. 2018, 8, 12044–12055. [Google Scholar] [CrossRef] [Green Version]
  49. Sah, P.; Leu, S.T.; Cross, P.C.; Hudson, P.J.; Bansal, S. Unraveling the disease consequences and mechanisms of modular structure in animal social networks. Proc. Natl. Acad. Sci. USA 2017, 114, 4165–4170. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Iannelli, F.; Koher, A.; Brockmann, D.; Hövel, P.; Sokolov, I.M. Effective distances for epidemics spreading on complex networks. Phys. Rev. E. 2017, 95, 12313. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Political map of Spain.
Figure 1. Political map of Spain.
Ijerph 20 04356 g001
Figure 2. Daily total travels between provinces and the 14-day cumulative incidence in Spain, 14 February 2020–9 May 2021. Cumulative incidence in 14 days per 100,000 inhabitants.
Figure 2. Daily total travels between provinces and the 14-day cumulative incidence in Spain, 14 February 2020–9 May 2021. Cumulative incidence in 14 days per 100,000 inhabitants.
Ijerph 20 04356 g002
Figure 3. Rationale for defining the Mobility Matrix of Spain from unique daily travelers. Red line: flux from Guadalajara to Madrid; Blue line: flux from Madrid to Guadalajara. (A) Total unique travelers (Fnm) by 2020 week between Madrid and Guadalajara and (B) Percentage of total travelers (Pnm) by 2020 week between the same provinces. (C) Set of percentages of total travels from Madrid to all other provinces it sends travelers to. (D) Set of percentages of total travels from Guadalajara to all other provinces it sends travelers to.
Figure 3. Rationale for defining the Mobility Matrix of Spain from unique daily travelers. Red line: flux from Guadalajara to Madrid; Blue line: flux from Madrid to Guadalajara. (A) Total unique travelers (Fnm) by 2020 week between Madrid and Guadalajara and (B) Percentage of total travelers (Pnm) by 2020 week between the same provinces. (C) Set of percentages of total travels from Madrid to all other provinces it sends travelers to. (D) Set of percentages of total travels from Guadalajara to all other provinces it sends travelers to.
Ijerph 20 04356 g003
Figure 4. Shortest distance maps from (outgoing) and to (incoming) two provinces, Huesca and Madrid. (A) Shortest distances to Madrid. (B) Shortest distances from Huesca.
Figure 4. Shortest distance maps from (outgoing) and to (incoming) two provinces, Huesca and Madrid. (A) Shortest distances to Madrid. (B) Shortest distances from Huesca.
Ijerph 20 04356 g004
Figure 5. Effective distance network (EDN) representation of network measures and map of communities. (A) Vertex size is proportional to incoming/outgoing connections (degree). Each edge represents a connection. Communities are represented by color. (B) A map of Spain with provinces colored by community belonging.
Figure 5. Effective distance network (EDN) representation of network measures and map of communities. (A) Vertex size is proportional to incoming/outgoing connections (degree). Each edge represents a connection. Communities are represented by color. (B) A map of Spain with provinces colored by community belonging.
Ijerph 20 04356 g005
Figure 6. CI14 per 100,000 inhabitants by province, grouped by community during the study period.
Figure 6. CI14 per 100,000 inhabitants by province, grouped by community during the study period.
Ijerph 20 04356 g006
Figure 7. Mean daily differences between CI14 curves within communities’ vs. outside community for every detected community. All mean differences between groups for all communities were statistically significant. X axis: ‘0’ stands for all the provinces outside the represented community.
Figure 7. Mean daily differences between CI14 curves within communities’ vs. outside community for every detected community. All mean differences between groups for all communities were statistically significant. X axis: ‘0’ stands for all the provinces outside the represented community.
Ijerph 20 04356 g007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Del-Águila-Mejía, J.; García-García, D.; Rojas-Benedicto, A.; Rosillo, N.; Guerrero-Vadillo, M.; Peñuelas, M.; Ramis, R.; Gómez-Barroso, D.; Donado-Campos, J.d.M. Epidemic Diffusion Network of Spain: A Mobility Model to Characterize the Transmission Routes of Disease. Int. J. Environ. Res. Public Health 2023, 20, 4356. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph20054356

AMA Style

Del-Águila-Mejía J, García-García D, Rojas-Benedicto A, Rosillo N, Guerrero-Vadillo M, Peñuelas M, Ramis R, Gómez-Barroso D, Donado-Campos JdM. Epidemic Diffusion Network of Spain: A Mobility Model to Characterize the Transmission Routes of Disease. International Journal of Environmental Research and Public Health. 2023; 20(5):4356. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph20054356

Chicago/Turabian Style

Del-Águila-Mejía, Javier, David García-García, Ayelén Rojas-Benedicto, Nicolás Rosillo, María Guerrero-Vadillo, Marina Peñuelas, Rebeca Ramis, Diana Gómez-Barroso, and Juan de Mata Donado-Campos. 2023. "Epidemic Diffusion Network of Spain: A Mobility Model to Characterize the Transmission Routes of Disease" International Journal of Environmental Research and Public Health 20, no. 5: 4356. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph20054356

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop