Next Article in Journal
The Potential of Wastewater Reuse and the Role of Economic Valuation in the Pursuit of Sustainability: The Case of the Canal de Isabel II
Previous Article in Journal
E-Learning for Continuing Professional Development of University Librarians: A Systematic Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatial Analysis on the Variances of Landslide Factors Using Geographically Weighted Logistic Regression in Penang Island, Malaysia

by
Syaidatul Azwani Zulkafli
1,
Nuriah Abd Majid
1,* and
Ruslan Rainis
2
1
Institute for Environment and Development (LESTARI), Universiti Kebangsaan Malaysia, 43600 Bangi, Malaysia
2
Department of Geography, School of Humanities, Universiti Sains Malaysia, 11800 Penang, Malaysia
*
Author to whom correspondence should be addressed.
Sustainability 2023, 15(1), 852; https://0-doi-org.brum.beds.ac.uk/10.3390/su15010852
Submission received: 29 September 2022 / Revised: 31 October 2022 / Accepted: 2 November 2022 / Published: 3 January 2023
(This article belongs to the Section Sustainability in Geographic Science)

Abstract

:
Landslides are one of the common natural disasters involving mostly movement of soil surfaces associated with gravitational attraction. Their adverse losses and significant damage, which always result in at least 17% of casualties and billions of dollars of financial losses worldwide, have made landslides the third most notorious phenomenon devastating many parts of the world. Malaysia has had multiple landslide occurrences, particularly in highly urbanized areas, such as Penang Island, owing to the declining vegetation cover in hilly terrains. Thus, this study aims to delineate the spatial relationship variances between landslide occurrences and the influencing factors in the area of interest. Ten influencing factors considered, including distance to roads, distance to rivers, distance to faults, slope angle, slope aspect, curvature, rainfall annual average, lithology, soil series, and land use. In this study, we use a software (GWR 4.0) as a medium for the analysis processing, coupled with GIS. A local statistical technique, Geographically Weighted Logistic Regression (GWLR), is primacy in capturing the geographical variation of the model coefficients that considers non-stationary variables and models their relationships, as well as processes regression coefficients over space. Goodness-of-fit criteria were used to evaluate the GWLR model, namely AICc that decrease from 872.202167 to 800.856998. Bayesian Information Criterion (BIC) shows a decrease in value from 925.784185 to 945.196942. Likewise, deviance decreased from 849.931675 to 739.175630, while pdev increased from 0.379457 to 0.460321. These goodness-of-fit criteria values express GWLR as the best model for local measure. The variances in both local parameter estimates and the t-values (negative and positive values) show the level of significance for each landslide factor in influencing landslide occurrences across the study area. The results of the local parameter estimates and the t-values also show that the slope angle and the slope aspect spatially affect landslide occurrences across the study area. Therefore, a proper perspective and a thorough understanding of the certain slope condition must be established for future mitigation actions to support the agenda of SDG 15, which promotes resilience and disaster risk reduction.

1. Introduction

A landslide is a natural disaster involving mostly the movement of soil surfaces, rock falls, or a combination of both movements, and is commonly associated with gravitational attraction [1,2,3,4]. The number of fatalities often depends on the severity of the landslide event, concerning thousands of human lives and the economy. Although landslides have caused a lower death toll compared to other disasters, the destructive effects of the later consequences are rather devastating. Due to their adverse losses and significant damage, which always result in at least 17% of casualties and billions of dollars of financial losses worldwide, landslides are quickly acknowledged as the third most notorious phenomenon devastating many parts of the world [5,6]. For instance, the mountainous areas of Peru experienced the most catastrophic landslides in 1970, when these events killed approximately 18,000–20,000 people [7]. According to NASA, more than 8935 landslides took place around the world, with 1120 cases scattered in Southeast Asia. On top of that, in the year 2010, the highest number of people affected by landslides was recorded, 89% of which were from China (Figure 1).
Being one of the developing countries in Southeast Asia, Malaysia has had multiple landslide occurrences, but particularly in highly urbanized areas such as Kuala Lumpur, Selangor, and Penang Island. This may be due to the landscape continuously shifting, which mostly involves human activities [9,10,11]. According to [12], the level of urbanization in Penang Island is approximately 90.8% and is among the highest, after Kuala Lumpur, Putrajaya, and Selangor. The growth in urbanization resulted in Penang Island being the most populated island in Malaysia. These days, the significant rise in population has indirectly limited the amount of flat land, thus leading to hillside development. The unrestricted development is clearly agonizing, as areas often undergo slope cutting and land clearing for housing, hotels, and apartments [13]. However, different factors can influence landslides and occasionally correspond to one another. Therefore, this study aims to delineate the spatial relationship between landslide occurrences and the influencing factors in the area of interest, Penang Island.
Comprehensive analysis of landslides can be carried out by qualitative and quantitative methods. The qualitative method is a relatively subjective approach that is entirely based on expert knowledge and opinions and uses the heuristic method technique [14,15,16]. On the other hand, quantitative approaches are mostly based on statistical techniques that can be classified as one of two sub-techniques: bivariate or multivariate analysis. Bivariate analysis is the simplest form of statistical analysis, which is commonly used when two variables are observed against each other; one variable is dependent and the other is independent [17,18]. Bivariate analyses such as Weighted Overlay, Weights of Evidence (WoE), and Frequency Ratio (FR) are widely used in landslide studies [19,20,21,22,23]. Meanwhile, multivariate analysis techniques are purposely employed to observe the relationship between the dependent variable and several independent variables. Logistic Regression (LR), Artificial Neural Networks (ANN), and Analytical Hierarchy Process (AHP) are among the most prevalent multivariate analyses in landslide fields [24,25,26,27,28,29].
In this study, Geographically Weighted Logistic Regression (GWLR), coupled with Geography Information System (GIS) software, is used to analyze 10 influencing factors, taking into consideration landslide spatial analysis. GWLR is an extended version of the standard LR, which can be fully utilized on a local scale. Moreover, GWLR has a significant advantage in binary classification, which is fitting, as this study uses the values 0 and 1 to represent the existence of landslide occurrences. In addition, each landslide occurrence is closely associated with the characteristics of the environment, thus the contribution from factors is less likely identical across the whole of Penang Island. GWLR has also been widely used in landslide studies, which resulted in the best accuracy compared to other models. Therefore, using a heterogeneously designed spatial method, such as GWLR, will generate the best-fitting model for each contributed factor in the study area.

2. Materials and Methods

2.1. Study Area

Penang Island was chosen as the study area due to its history of frequent landslide occurrences. Penang is located in Northwest Peninsular Malaysia at 5.4164° N, 100.3327° E and is 1 of the 14 states of Malaysia, see Figure 2. The terrain consists of coastal plains, hills, and mountains. Penang Island covers an area of approximately 293 km2, the fourth largest island in Malaysia, and is separated from the mainland by a channel. It has seen rapid industrialization, rapid technological progress, and the greatest rate of urbanization, resulting in increased demand for land [30]. The Institute of Strategic and International Studies and Penang Development Corporation stated that Penang Island is made up of steep topography above 60 m (exceeding 50% of the land area), which makes half of Penang’s land vulnerable to landslides. Since September 1995, Penang Island has recorded 60 landslide incidents in the Penang Hill area following a freak storm that damaged the pathway near the Penang Botanical Gardens [31]. The changes in land use in Penang Island, which developed along the hillside to accommodate the population growth, are indirectly increasing the frequency of landslide occurrences in the area. This, in the end, will cause an expansion of urbanization. Along with Kuala Lumpur, Penang Island has also experienced rapid urbanization due to industrial activities. For instance, according to the Department of Statistics Malaysia, the level of urbanization in Penang is 90.8% and is among the highest, after Kuala Lumpur, Putrajaya, and Selangor. The saturated urbanization in Penang today is associated with the drastic population increase. Additionally, [32] stated that with the increase in the human population, people have started to dig deeper pits to seek essential minerals. Hence, landslides will be intensified by abrupt and large-scale changes in land use, such as developing new land for neighborhoods, infrastructure, and cities in order to support population demands.

2.2. Flowchart

Figure 3 shows the flowchart of GWLR analysis implemented in the GWR 4.0 software.

2.3. Data Acquisition

The landslide inventory for this study was created based on the databases from previous landslide studies, the database managed by The Public Works Department Malaysia, and field surveys. A total of 988 landslide points (494 landslides, 494 non-landslides) were gathered and used to form this inventory. The dependent variable is binary and is classified with the value 1 (landslide presence) or 0 (landslide absence). The landslide absence points were created randomly using ArcGIS. According to [33], the majority of the landslides that occurred in the center of the island within steep terrain are primarily composed of shallow rotational debris slides and debris flows. Moreover, [17] stated that hill land development in the study area was responsible for most of the landslides over the decades.

2.4. Landslide Influencing Factors

This study collected data representing factors affecting a landslide from different sources (Table 1). One study [16] stated that there are no specific rules for choosing landslide characteristics. Thus, a total of 10 influencing factors were analyzed and generated for data–layer maps using GIS software, including distance to roads, distance to rivers, distance to faults, slope angle, slope aspect, curvature, rainfall annual average, lithology, soil series, and land use (Figure 4). Slopes, aspects, and curvatures were derived from a 30 × 30 digital elevation model (DEM) using a contour map.

2.5. Multicollinearity Analysis

The premise of establishing a regression model is that each explanatory variable is independent of the others [34]. This analysis requires the consideration of a possible strong linear correlation between the variables, which may later lead to multicollinearity issues. These issues cause instability in the calculation of the variables resulting in irrelevant results. Thus, the multicollinearity between these 10 variables was considered using two indicators, namely tolerance (TOL) and Variance Inflation Factor (VIF). If the value of TOL is <0.10 and VIF is >5, the results would indicate strong multicollinearity among the factors [35,36]. The following equations were used to measure the TOL and VIF:
TOL = 1 R j 2
VIF = 1 TOL
where R j 2 is the coefficient of regression of j on the added conditioning factors.

2.6. Geographically Weighted Logistic Regression (GWLR)

GWLR is a local statistical technique that considers non-stationary variables and models their relationships, which processes regression coefficients over space [23,37,38,39]. In other words, it is an outgrowth version of the standard Logistic Regression (LR), which determines the relationships between the dependent and independent variables down to a local scale instead of a global one. It was initially developed to traverse the relations between riverbank erosion and geomorphological variables [34]. However, GWLR is now gradually being applied to other studies in science [40]. GWLR analysis for this study utilizes the GWR 4.0 software [41]. The software prefers a spatial database, thus it is constructed first within a GIS software, which consists of the inventory of landslides and analyzed factors. The regression equation can be defined as:
P ( u i , v i ) = exp   ( β 0 ( u i , v i ) + β 1 ( u i , v i )   X 1 + + β k ( u i , v i )   X k ) 1 + exp   ( β 0 ( u i , v i ) + β 1 ( u i , v i )   X 1 + + β k ( u i , v i )   X k )
where P(ui, vi) is the probability of landslide events occurring at location i with coordinates (ui, vi); β1(ui, vi) is the local parameter coefficient of the landslide locations, (ui, vi); and β0(ui, vi) is the intercept of point (ui, vi) [41].
The stages for the creation of a landslide susceptibility map according to the GWLR model defined above are similar to the global model. The main difference is the application of the logistic regression analysis at a local scale. This is performed by fitting a weighted logistic model for each landslide location using a subset of the original dataset. Thus, a kernel that determines the size of the subset and a weighting function to calculate the weights are required in order to define the GWLR model. In this study, a Gaussian adaptive kernel was adopted. The optimal number of nearest neighbors is estimated by the GWR 4.0 software using an optimization criterion (i.e., the minimization of the corrected Akaike Information Criterion (AICc)). The equation is expressed as:
W i j = = exp ( ( d i j b ) 2 )
where Wij is the weight for unit j in the neighborhood of unit i, dij is the distance between the center point of unit i and j as the measurement of spatial proximity degree, and b is the bandwidth of the Gaussian kernel function. Many approaches are available for determining the bandwidth, including the cross-validation (CV) method and the Akaike Information Criterion (AIC) method. However, [42] stated that the AICc method is easier to avoid over-fitting compared to the CV method, hence this study used the AICc method to determine the bandwidth.

3. Results

The multicollinearity analysis of each independent variable is important to reduce over-fitting in the models. Results in Table 2 below show the lowest TOL value is 0.603 and the highest VIF value is 1.659, which are both laid down on lithology. These values achieved critical values (i.e., TOL < 0.10 or VIF > 5), indicating there is no significant multicollinearity among the 10 landslide factors considered in this study.
Four goodness-of-fit criteria provided by the GWR 4.0 software were used to evaluate the GWLR model. The optimal number of nearest neighbors is estimated using an optimization criterion resulting in the minimization of the corrected AICc. Table 3 summarizes the results of global and GWLR diagnoses. According to [43], if the difference in the AICc index of the GWLR is >4, the model is considered to have improved. The criterion of the AICc for local regression is 872.202167, which has a lower value than the global regression (800.856998), revealing that GWLR is the best model. Furthermore, the percent deviance (pdev) value increased from 0.379457 to 0.460321, showing that GWLR interprets the spatial variability of landslides better than the global parameter estimates.
Meanwhile, Table 4 expresses the descriptive statistics of local parameter coefficients of the landslide factors. The intercept ranges from 1.91 to 17.83 with a mean of 7.08. For the Euclidean distance factors: distance to roads ranges from −0.005 to −0.002 with a mean of −0.003; distance to rivers ranges from −0.003 to 0.0003 with a mean of −0.001; and distance to faults ranges from −0.001 to 0.001 with a mean of −0.005. Meanwhile, for the DEM-derived factors: slope aspect ranges from −0.0003 to 0.003 with a mean of 0.001; slope angle ranges from −0.03 to 0.05 with a mean of 0.01; and curvature ranges from −0.73 to −0.04 with a mean of −0.32. For one of the hydrological factors, namely rainfall annual average, the results show a range from −0.0017 to 0.00059 with a mean of −0.00033. Soil series and lithology factors show ranges of −2.34 to −0.42 with a mean of −0.94 and −0.99 to 0.12 with a mean of −0.41, respectively. Lastly, the land use factor ranges from −0.29 to 0.073 with a mean of −0.071.
GWLR is highly efficient when there is a spatial correlation between both variables. In most GWLR applications using multiple independent variables, parameter estimates and t-values are used to determine the influential regions, in which, the values detect the significance level of the relationship between the dependent variable and the independent variables [44]. For the selected independent variables in the GWLR model, raster maps were generated from the parameter estimates and t-values using an Inverse Distance Weighted (IDW) interpolation to show how the relationships between the landslide and each factor vary across space.

4. Discussion

The coefficient maps of the influencing factors are the stepping-stone for landslide mitigation in the study area. The results portrayed in Figure 5 and Figure 6 indicate a spatial variation in the relationship between landslides and the influencing factors in the study area. Because a landslide is a complex phenomenon, its occurrence may due to one or more factors that are constantly associated with each other. It is crucial to identify the factors that are most likely to influence one particular event. The results from the GWLR model reflect in more detail the contribution of each factor to the landslides. Because the spatial heterogeneity of the influencing variables is completely taken into account, the spatial relationship between the landslides and the influencing factors is no longer a fixed value.
For instance, Figure 5 expresses the estimated parameter values calculated in the GWR software. Evaluated values for a particular independent variable that are negative indicate a negative impact on the dependent variable, whereas a positive sign shows a significant influence on the occurrences of the dependent variable. Figure 5a represents parameter estimate values for distance to roads, which constantly shows negative values in all parts of the study area, ranging from −0.0054 to −0.0025. This shows that distance to roads does not influence the occurrence of landslides, as road networks are concentrated on the east side of the study area where the slope’s steepness is less than 15°.
Distance to rivers values, as shown in Figure 5b, convey various values ranging from −0.0032 to 0.00031, with negative values scattered across Penang Island, except for in the southwest. Meanwhile, distance to faults values remain negative and range from −0.0014 to −0.000016, as described in Figure 5c. Figure 5d represents the variation of slope angle values; negative values were only found in the northwestern part of the study area and range from −0.034 to −0.016, whereas other parts of Penang Island show positive values ranging from 0.00081 to 0.054. Figure 5e shows that the curvature values are constantly negative and range from −0.72 to −0.040, meanwhile slope aspect values range from −0.00033 to 0.003, as described in Figure 5f; slope aspect values are only negative in the southwest. For annual rainfall average, the estimated parameter is not always positive, as values range from −0.0016 to −0.00031 and are scattered in the center of Penang Island and towards the northwest (Figure 5g). Figure 5h depicts the soil series factor, which constantly shows negative values ranging from −2.32 to −0.41. Similarly, the lithology factor in Figure 5i shows that the majority of lithology types deliver negative values ranging from −0.98 to 0.12. Lastly, Figure 5j represents the estimated parameter for land use; values range from −0.29 to 0.073 and the majority of them are negative.
These values represent the significance of each landslide factor across the study area. However, in GWLR, estimate values are not the only indicator of the landslide factors’ significance, and as a result, they should always be associated with the t-values.
Therefore, Figure 6 demonstrates a compilation of 10 integrated maps of the landslide factors considering each t-value. Each variable is distinguished by a 5-color scheme to visualize the significance level in different areas. Figure 6a,c,e,h shows that distance to roads, distance to faults, curvature, and soil series values are negative across the study area, indicating that roads, faults, curvature, and soil series factors have a negative influence on landslide occurrences. Apart from that, negative values for distance to rivers and lithology were scattered across the study area, except for in the southwest and in the northwest, respectively (Figure 6b,i). Meanwhile, Figure 6g describes the annual rainfall average factor, showing that the t-values do not always stay positive, as negative values are found in the center and towards the northwestern part of Penang Island.
Figure 5 and Figure 6 both suggest that both the slope angle and the slope aspect are positively correlated with landslides, as the values increased, thus increasing the probability of landslide occurrences. This indirectly shows that both the slope angle and the slope aspect are the most significant factors influencing landslide occurrences in Penang Island, necessitating comprehensive mitigation actions. Because Penang Island is made up of steep topography above 60 m (exceeding 50 percent of the land area), half of Penang’s land is vulnerable to landslides. As the slope angle increases, shear stress in soil or other unconsolidated material generally increases as well [45]. Even though the slope angle is one of the factors deemed to be significant, it does not always remain a positive influence. The results show that the northwestern part conveys small negative values, indicating that the slope angle is less likely to contribute to landslide occurrences there. Meanwhile, according to [46,47,48], apart from the East-facing slopes, a slope that does not face the sun has a low temperature with a high soil moisture that when it is disturbed, it will tend to influence the landslide occurrences. Therefore, it is necessary to reinforce the slope treatment in other parts of Penang Island, instead of the northwestern part.

5. Conclusions

Several factors of landslides, namely distance to roads, distance to rivers, distance to faults, slope angle, slope aspect, curvature, rainfall annual average, lithology, soil series, and land use have been considered in this study. Based on the results obtained from this study, the landslides in Penang Island are spatially influenced by both the slope angle and the slope aspect. It was mentioned before that half of Penang Island is vulnerable to landslides due to its topography, and thus for urbanization activities, cutting slopes is inevitable. Moreover, a higher inclination of the slope can contribute to a higher gravity force for pulling materials down the slope, thereby increasing the risk of landslides. Therefore, a proper perspective and a thorough understanding of the certain slope conditions have to be established to avoid more landslide occurrences in the future and to ensure the conservation, restoration, and sustainable use of terrestrial and inland freshwater ecosystems, as stated in the 15th SDG. In the meantime, the maps generated can indirectly be useful to authorities in terms of coordinating landslide assessment guidelines, as mentioned by Ministry of Housing and Local Government of Malaysia, as well as by decision makers for selecting suitable locations for future development. We suggest that a future study should highlight an in-depth investigation of the potential use of GWLR at various scales so that the method can be fully utilized in other regions to tackle natural disaster impacts.

Author Contributions

Conceptualization, N.A.M.; methodology, S.A.Z.; R.R.; software, S.A.Z.; R.R.; formal analysis, S.A.Z.; N.A.M.; R.R.; writing and original draft preparation, S.A.Z.; writing, review, and editing, S.A.Z.; N.A.M.; R.R.; visualization, N.A.M.; supervision, N.A.M.; project administration, N.A.M.; funding acquisition, N.A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by a research project of the Ministry of Higher Education (grant number FRGS/1/2019/SS07/UKM/02/1). This article processing charge (APC) is supported by Tabung Agihan Penyelidikan (grant number TAP-K022256).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

This research was supported by FRGS/1/2019/SS07/UKM/02/1. The authors would like to acknowledge the APC support rendered by TAP-K022256. We also would like to thank the Department of Survey and Mapping, the Department of Irrigation and Drainage Malaysia, the Department of Mineral and Geosciences Malaysia, Public Works Department, PLANMalaysia, and Department of Agriculture for sharing the spatial data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cruden, D.M.; Varnes, D.J.; Turner, A. Landslides: Investigation and Mitigation; National Academy Press: Washington, DC, USA, 1996; pp. 36–75. [Google Scholar]
  2. Batterson, M.; Liverman, D.G.E.; Ryan, J.; Taylor, D. The assessment of geological hazards and disasters in Newfoundland: An update, Newfoundland (Report No. 99–1): Department of Mines and Energy. Geol. Surv. 1999, 95–123. [Google Scholar]
  3. Sivakumar Babu, G.L.; Mukesh, M.D. Landslide analysis in Geographic Information Systems. Available online: https://tinyurl.com/5n8pctwf (accessed on 18 September 2022).
  4. Hossain, K.M.D.; Lateh, H.; Tien, T.A.Y.I.; Kamil, A.A. Hazard Mapping & Modelling. In Risk Factor for Landslide in Penang, Malaysian Book: Landslides; Lateh, H., Kamil, A.A., Eds.; Japan International Cooperation Agency: Tokyo, Japan, 2015; pp. 115–121. [Google Scholar]
  5. Mersha, T.; Meten, M. GIS-based landslide susceptibility mapping and assessment using bivariate statistical methods in Simada area, northwestern Ethiopia. Geoenvironmental Disasters 2020, 7, 1–22. [Google Scholar] [CrossRef]
  6. Paryani, S.; Neshat, A.; Pradhan, B. Improvement of landslide spatial modeling using machine learning methods and two Harris hawks and bat algorithms. Egypt. J. Remote Sens. Space Sci. 2021, 24, 845–855. [Google Scholar] [CrossRef]
  7. West, J.A. Global Toll from Landslides Is Heaviest in Developing Countries. Available online: https://tinyurl.com/4uufz3u4 (accessed on 23 August 2022).
  8. Ritchie, H.; Roser, M. Natural Disasters. In Our World in Data; The University of Oxford: Oxford, UK, 2014. [Google Scholar]
  9. Elmahdy, S.I.; Marghany, M.M.; Mohamed, M.M. Application of a weighted spatial probability model in GIS to analyse landslides in Penang Island, Malaysia. Geomat. Nat. Hazards Risk 2014, 7, 345–359. [Google Scholar] [CrossRef] [Green Version]
  10. Abd Majid, N.; Taha, M.R.; Selamat, S.N. Historical landslide events in Malaysia 1993-2019. Indian J. Sci. Tech. 2020, 13, 3387–3399. [Google Scholar] [CrossRef]
  11. Kyriou, A.; Nikolakopoulos, K.; Koukouvelas, I.; Lampropoulou, P. Repeated UAV Campaigns, GNSS Measurements, GIS, and Petrographic Analyses for Landslide Mapping and Monitoring. Minerals 2021, 11, 1–26. [Google Scholar] [CrossRef]
  12. Department of Statistics. Lastest Statistics 2019. Available online: https://www.mycensus.gov.my/ (accessed on 18 September 2022).
  13. Pradhan, B.; Lee, S. Landslide susceptibility assessment and factor effect analysis: Backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling. Env. Model. Softw. 2010, 25, 747–759. [Google Scholar] [CrossRef]
  14. Saadatkhah, N.; Kassim, A.; Lee, M.L. Spatial patterns of precipitation, altitude and monsoon directions in Hulu Kelang area, Malaysia. Electron. J. Geotech. Eng. 2014, 19, 521–534. [Google Scholar]
  15. Mandaglio, M.C.; Gioffre, D.; Pitasi, A.; Moraci, N. Qualitative Landslide Susceptibility Assessment in Small Areas. Procedia Eng. 2016, 158, 440–445. [Google Scholar] [CrossRef] [Green Version]
  16. Zhang, W.G.; Liu, S.L.; Wang, L.Q.; Samui, P.; Chwala, M.; He, Y.W. Landslide Susceptibility Research Combining Qualitative Analysis and Quantitative Evaluation: A Case Study of Yunyang County in Chongqing, China. Forests 2022, 13, 1055. [Google Scholar] [CrossRef]
  17. Pradhan, A.M.S.; Dawadi, A.; Kim, Y.T. Use of different bivariate statistical landslide susceptibility methods: A case study of Kulekhani watershed, Nepal. J. Nepal. Geo. Soc. 2012, 44, 1–12. [Google Scholar] [CrossRef]
  18. Sarangam, A. Bivariate Analysis: An Easy Guide for Beginners in 2021. Available online: https://www.jigsawacademy.com/ (accessed on 18 September 2022).
  19. Javad, M.; Ahmad, B.; Barat, M.; Farshid, S. Using Frequency Ratio Method for Spatial Landslide Prediction. Res. J. Appl. Sci. En. Tech. 2014, 7, 3174–3180. [Google Scholar]
  20. Khan, H.; Muhammad, S.; Khan, M.A.; Bacha, M.A.; Shah, S.U.; Calligaris, C. Landslide susceptibility assessment using Frequency Ratio, a case study of northern Pakistan. Egypt. J. Remote Sens. Space Sci. 2018, 22, 11–24. [Google Scholar] [CrossRef]
  21. Sadisun, I.A.; Telaumbanua, J.A.; Kartiko1, R.D.; Dinata, I.A.; Pamela. Weight of Evidence Method for Landslide Susceptibility Mapping in Sigi Biromaru, Central Sulawesi. Earth Environ. Sci. 2021, 830, 012029. [Google Scholar] [CrossRef]
  22. Abd Majid, N.; Rainis, R.; Wan Ibrahim, W.M.M. Spatial Modeling of Various Slope Failures in Pulau Pinang using Frequency Ratio Method. Geografi 2017, 5, 13–26. [Google Scholar]
  23. Li, L.H.; Nahayo, L.; Habiyaremyed, G.; Christophe, M. Applicability and performance of statistical index, certain factor and frequency ratio models in mapping landslides susceptibility in Rwanda. Geocarto Int. 2022, 37, 638–656. [Google Scholar] [CrossRef]
  24. Selamat, S.N.; Abd Majid, N.; Taha, M.R.; Osman, A. Landslide Susceptibility Model Using Artificial Neural Network (ANN) Approach in Langat River Basin, Selangor, Malaysia. Land 2022, 11, 833. [Google Scholar] [CrossRef]
  25. Castro-Miguel, R.; Legorreta-Paulin, G.; Bonifaz-Alfonzo, R.; Aceves-Quesada, J.F.; Castillo-Santiago, M.A. Modeling spatial landslide susceptibility in volcanic terrains through continuous neighborhood spatial analysis and multiple logistic regression in La Ciénega watershed, Nevado de Toluca, Mexico. Nat. Hazards 2022, 113, 767–788. [Google Scholar] [CrossRef]
  26. Ali, S.A.; Parvin, F.; Pham, Q.B.; Khedher, K.M.; Dehbozorgi, M.; Rabby, Y.W.; Anh, D.T.; Nguyen, D.H. An ensemble random forest tree with SVM, ANN, NBT, and LMT for landslide susceptibility mapping in the Rangit River watershed, India. Nat. Hazards 2022, 113, 1601–1633. [Google Scholar] [CrossRef]
  27. Roy, P.; Ghosal, K.; Paul, P.K. Landslide susceptibility mapping of Kalimpong in Eastern Himalayan Region using a Rprop ANN approach. Earth Syst. Sci. 2022, 131, 130. [Google Scholar] [CrossRef]
  28. Panchal, S.; Shrivastava, A.K. Landslide hazard assessment using analytic hierarchy process (AHP): A case study of National Highway 5 in India. Ain Shams Eng. J. 2022, 13, 101626. [Google Scholar] [CrossRef]
  29. Das, S.; Sarkar, S.; Kanungo, D.P. GIS-based landslide susceptibility zonation mapping using the analytic hierarchy process (AHP) method in parts of Kalimpong Region of Darjeeling Himalaya. Environ. Monit. Assess. 2022, 194, 234. [Google Scholar] [CrossRef] [PubMed]
  30. Lateh, H.; Govindasamy, V. Residents Awareness Towards Landslide Disaster in Penang. Int. J. Sci. Eng. Res. 2012, 3, 1–7. [Google Scholar]
  31. Abd Majid, N.; Wan Ibrahim, W.M.M. GIS in Studying Slope Failure in Penang: Challenges and Potential. In Proceedings of the International Conference on Development and Socio-Spatial Inequalities, Penang, Malaysia, 18–20 August 2015. [Google Scholar]
  32. Radbruch, D.H.; Varnes, D.J. Landslides—Cause and effect. Bull. Int. Assoc. Eng. Geol. 1976, 13, 205–216. [Google Scholar] [CrossRef]
  33. Tay, L.T.; Alkhasawneh, M.S.H.; Lateh, H.; Hossain, M.K.; Kamil, A.A. Landslide Hazard Mapping of Penang Island Using Poisson Distribution with Dominant Factors. J. Civ. Eng. Res. 2014, 4, 72–77. [Google Scholar]
  34. Zhang, M.; Cao, X.L.; Peng, L.; Niu, R. Landslide susceptibility mapping based on global and local logistic regression models in Three Gorges Reservoir area, China. Environ. Earth Sci. 2016, 75, 958. [Google Scholar] [CrossRef]
  35. Chowdhuri, I.; Pal, S.C.; Chakrabortty, R.; Malik, S.; Das, B.; Roy, P. Torrential rainfall-induced landslide susceptibility assessment using machine learning and statistical methods of eastern Himalaya. Nat. Hazards 2021, 107, 697–722. [Google Scholar] [CrossRef]
  36. Cao, J.; Zhang, Z.; Wang, C.; Liu, J.; Zhang, L. Susceptibility assessment of landslides triggered by earthquakes in the Western Sichuan Plateau. CATENA 2019, 175, 63–76. [Google Scholar] [CrossRef]
  37. Chalkias, C.; Polykretis, C.; Karymbalis, E.; Soldati, M.; Ghinoi, A.; Ferentinou, M. Exploring Spatial non-stationary in the Relationships between Landslide Susceptibility and Conditioning Factors: A Local Modeling Approach using Geographically Weighted Regression. Bull. Eng. Geol. Environ. 2020, 79, 2799–2814. [Google Scholar] [CrossRef]
  38. Pravitasari, A.E.; Rustiadi, E.; Mulya, S.P.; Setiawan, Y.; Fuadina, L.N.; Murtadho, A. Identifying The Driving Forces of Urban Expansion and its Environmental Impact in Jakarta-Bandung Mega Urban Region. IOP Conf. Ser. Earth Environ. Sci. 2018, 149, 012044. [Google Scholar] [CrossRef] [Green Version]
  39. Public Health Columbia. Geographically Weighted Regression. Available online: https://www.publichealth.columbia.edu/research/population-health-methods/geographically-weighted-regression (accessed on 20 September 2022).
  40. Erener, A.; Duzgun, H.S.B. Improvement of Statistical Landslide Susceptibility Mapping by Using Spatial and Global Regression Methods in The Case of More and Romsdal (Norway). Landslides 2010, 7, 55–68. [Google Scholar] [CrossRef]
  41. Fotheringham, A.S.; Brunsdon, C.; Charlton, M. Geographically Weighted Regression: The Analysis of Spatially Varying Relationships; John Wiley & Sons: Chichester, UK, 2002. [Google Scholar]
  42. Li, Y.; Huang, S.; Li, J.; Huang, J.; Wang, W. Spatial Non-Stationarity-Based Landslide Susceptibility Assessment Using PCAMGWR Model. Water 2022, 14, 881. [Google Scholar] [CrossRef]
  43. Park, S.Y.; Kim, J.S. A Comparative Analysis of Landslide Susceptibility Assessment by Using Global and Spatial Regression Methods in Inje Area, Korea. Korean Soc. Surv. Geod. Photogramm. Cartogr. 2015, 33, 579–587. [Google Scholar] [CrossRef] [Green Version]
  44. Mennis, J. Mapping the Results of Geographically Weighted Regression. Cartogr. J. 2006, 43, 171–179. [Google Scholar] [CrossRef] [Green Version]
  45. Varnes, D.J. Landslide Hazard Zonation: A review of principles and practice. Commission on landslides of the IAEG, UNESCO. Nat. Hazards 1984, 3, 61. [Google Scholar]
  46. Lee, S.; Jasmi, A.T. Probabilistic landslide susceptibility and factor effect analysis. Environ. Geol. 2005, 47, 982–990. [Google Scholar] [CrossRef]
  47. Abd Majid, N.; Rainis, R.; Wan Ibrahim, W.M.M. Spatial Modeling Various Types of Slope Failure using Artificial Neural Network (Ann) in Pulau Pinang, Malaysia. Jurnal Teknologi 2018, 80. [Google Scholar] [CrossRef]
  48. Abd Majid, N.; Rainis, R. Application of Geographical Information Systems (GIS) and Discriminant Analysis in Modelling Slope Failure Incidence in Pulau Pinang, Malaysia. Sains Malaysiana 2019, 48, 1367–1381. [Google Scholar] [CrossRef]
Figure 1. Landslide impacts worldwide [8].
Figure 1. Landslide impacts worldwide [8].
Sustainability 15 00852 g001
Figure 2. Map of Penang Island, Malaysia.
Figure 2. Map of Penang Island, Malaysia.
Sustainability 15 00852 g002
Figure 3. The overall flowchart of the study.
Figure 3. The overall flowchart of the study.
Sustainability 15 00852 g003
Figure 4. Thematic layer maps: (a) distance to roads, (b) distance to rivers, (c) distance to faults, (d) slope angle, (e) slope aspect, (f) curvature, (g) annual rainfall average, (h) soil series, (i) lithology, and (j) land use.
Figure 4. Thematic layer maps: (a) distance to roads, (b) distance to rivers, (c) distance to faults, (d) slope angle, (e) slope aspect, (f) curvature, (g) annual rainfall average, (h) soil series, (i) lithology, and (j) land use.
Sustainability 15 00852 g004aSustainability 15 00852 g004bSustainability 15 00852 g004cSustainability 15 00852 g004dSustainability 15 00852 g004e
Figure 5. Estimated parameter values for landslide factors: (a) distance to roads, (b) distance to rivers, (c) distance to faults, (d) slope angle, (e) slope aspect, (f) curvature, (g) annual rainfall average, (h) soil series, (i) lithology, and (j) land use.
Figure 5. Estimated parameter values for landslide factors: (a) distance to roads, (b) distance to rivers, (c) distance to faults, (d) slope angle, (e) slope aspect, (f) curvature, (g) annual rainfall average, (h) soil series, (i) lithology, and (j) land use.
Sustainability 15 00852 g005aSustainability 15 00852 g005bSustainability 15 00852 g005cSustainability 15 00852 g005dSustainability 15 00852 g005eSustainability 15 00852 g005f
Figure 6. t-values of landslide factors: (a) distance to roads, (b) distance to rivers, (c) distance to faults, (d) slope angle, (e) slope aspect, (f) curvature, (g) annual rainfall average, (h) soil series, (i) lithology, and (j) land use.
Figure 6. t-values of landslide factors: (a) distance to roads, (b) distance to rivers, (c) distance to faults, (d) slope angle, (e) slope aspect, (f) curvature, (g) annual rainfall average, (h) soil series, (i) lithology, and (j) land use.
Sustainability 15 00852 g006aSustainability 15 00852 g006bSustainability 15 00852 g006cSustainability 15 00852 g006dSustainability 15 00852 g006e
Table 1. Data sources.
Table 1. Data sources.
DataSources
LandslidesPublic Works Department and field survey
Road networkOpen source
River networkDepartment of Irrigation and Drainage
Contour mapDepartment of Survey and Mapping Malaysia (JUPEM)
Land use mapPLANMalaysia
Geology mapDepartment of Minerals and Geoscience Malaysia
Soil Series mapDepartment of Agriculture
Rainfall daily measurementsDepartment of Irrigation and Drainage
Table 2. Multicollinearity diagnostics between explanatory variables.
Table 2. Multicollinearity diagnostics between explanatory variables.
Explanatory VariablesCollinearity Statistics
ToleranceVIF
Distance to road0.8111.234
Distance to river0.7151.399
Distance to fault0.7201.389
Aspect0.9431.061
Slope0.6131.632
Curvature0.9731.028
Land use0.9211.086
Lithology0.6031.659
Rainfall annual average0.9571.045
Soil0.7551.324
Table 3. Comparison diagnostics of global regression and GWLR models.
Table 3. Comparison diagnostics of global regression and GWLR models.
CriteriaModels
Global RegressionGWLR
Deviance849.931675739.175630
AICc872.202167800.856998
BIC/MDL925.784185945.196942
pdev0.3794570.460321
Table 4. Coefficient values for each variable.
Table 4. Coefficient values for each variable.
VariableCoefficients
MeanSTDMinMaxRange
Intercept7.0780984.3329531.90940317.83289015.923487
DistRoad−0.0039200.000640−0.005496−0.0025980.002898
DistRiver−0.0010080.000776−0.0032400.0003150.003555
DistFault−0.0005750.000249−0.001467−0.0001680.001300
Aspect0.0018560.000979−0.0003310.0030780.003409
Slope0.0131080.029410−0.0347430.0541240.088867
Curvature−0.3197390.167111−0.730562−0.0401480.690413
Rainfall average−0.0003290.000652−0.0016700.0005920.002263
Soil series−0.9427030.484519−2.336661−0.4190981.917564
Lithology−0.4071500.217785−0.9867680.1153801.102149
Land use−0.0705630.085388−0.2939050.0731510.367056
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zulkafli, S.A.; Abd Majid, N.; Rainis, R. Spatial Analysis on the Variances of Landslide Factors Using Geographically Weighted Logistic Regression in Penang Island, Malaysia. Sustainability 2023, 15, 852. https://0-doi-org.brum.beds.ac.uk/10.3390/su15010852

AMA Style

Zulkafli SA, Abd Majid N, Rainis R. Spatial Analysis on the Variances of Landslide Factors Using Geographically Weighted Logistic Regression in Penang Island, Malaysia. Sustainability. 2023; 15(1):852. https://0-doi-org.brum.beds.ac.uk/10.3390/su15010852

Chicago/Turabian Style

Zulkafli, Syaidatul Azwani, Nuriah Abd Majid, and Ruslan Rainis. 2023. "Spatial Analysis on the Variances of Landslide Factors Using Geographically Weighted Logistic Regression in Penang Island, Malaysia" Sustainability 15, no. 1: 852. https://0-doi-org.brum.beds.ac.uk/10.3390/su15010852

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop