1. Introduction
Land cover classification is important in enabling detailed studies of temporal and spatial environmental change, land resource management, and sustainable development [
1,
2]. Changes in land cover can affect the carbon (C) balance; for example, a study in Shandong Province, China, showed that, between 2010 and 2020, land cover change resulted in the loss of 106 × 10
4 t C stored in vegetation [
3].
The classification of land cover is usually based on natural geographic features such as vegetation type, climatic conditions and topographic features that enable the construction of different types of thematic classification, e.g., urban land [
4], biogeoclimatic ecosystems [
5], and forest types [
6]. Land classification methods traditionally rely on the historical data of land classification and field observations than can require a large amount of time and resources to process, as image-based land classification was mainly achieved through the visual interpretation of photogrammetry. Subsequently, the availability of remotely sensed data enabled land classification based on the statistical analysis of spectral features extracted from image pixels [
7]. As the availability and diversity of multi-source remote sensing data has increased there have been opportunities to greatly improve the accuracy of land classification.
Remote sensing has an important role in determining land cover types, as multi-sensor-derived waveband information can be used to classify land use cover quickly and reproducibly at different temporal and spatial scales [
8]. For example, Tadese et al. [
9] used remote sensing data as a basis for analyzing and understanding the long-term dynamics of land use and land cover change in the Awash River Basin. Remote sensing imagery can also be used to generate macro time series land cover datasets for a region, country, or even globally. An example is the Global Land Cover 30 series (GlobeLand30) dataset, which consists of ten primary land cover classes, i.e., water bodies, wetland, artificial surfaces, cultivated land, permanent snow/ice, forest, shrubland, grassland, bareland, and tundra [
10]. The release of GlobeLand30 provided a database for large-scale land cover change studies and has been used for large regional-scale studies [
11]. Whilst the above studies demonstrate the value of land classification at the spatial scale, the datasets are only available for specific years and are not regularly updated, as the spectral characteristics of land cover or landscape features can vary interannually. As a result, the sample points selected for analysis in one year are not optimal for other years, which can create issues related to training datasets and model migratability [
12]. To resolve this limitation, a sample point migration approach was developed, which enables the migration of classification thresholds for a feature from a single chronology to a long time series dataset [
13].
The Google Earth Engine (GEE) has been recognized as a powerful tool for processing large-scale Earth observation data, with the ability to access and process large amounts of multi-source, multi-scale, and time series remote sensing data via a cloud platform [
14]. The GEE provides access to a variety of datasets in an integrated system, including various satellite image sources, geophysical data, climate data, and demographic data that facilitate the use of time series and multi-source datasets for land cover mapping [
15,
16]. For example, Sidhu et al. [
17] made use of the GEE platform’s utility in processing raster and vector image manipulations for the spatio-temporal analysis of urban and wetland land cover types in two subregions of Singapore, affirming the spatio-temporal analysis capabilities of GEE. However, most existing studies focus on one land cover type or generate land cover maps for certain areas at specific times of image collection. As a result, these studies often find it difficult to incorporate long time series datasets. The utility of the GEE for land cover detection using annual Landsat-derived normalized difference vegetation index time series data was demonstrated by Huang et al. [
18] to create a dynamic map of the land cover change in Beijing over a 30-year period with an overall accuracy of 86.61%
The multi-petabyte curated catalogue of the geospatial datasets available in the GEE permits and improves classification results by reducing the likelihood of dataset gaps and uncertainty through the provision of multiple sources of data [
19]. Multi-source remote sensing data is particularly effective at improving the efficiency of land cover classification as the data fusion and integration of spectral, spatio-temporal, and thermal information from multiple sensors can improve the accuracy of classification [
20]. For example, Li et al. [
21] generated a land cover map of the entire African continent at a resolution of 10 m using a combination of Sentinel-2, Landsat-8, Nighttime Light, and MODIS data.
Machine learning algorithms such as maximum likelihood [
22], support vector machines [
23], and random forest (RF) [
24] are recognized as accurate and effective methods for analyzing large dimensional and complex spatio-temporal data when compared to traditional parametric algorithms [
25]. The selection of a good classification method is a key factor in the classification process that is dependent on the analysis objectives; for example, RF is one of the most frequently used supervised machine learning methods due to its high efficiency and accuracy in identifying single-class elements such as urban number spaces [
26] in remotely sensed imagery, as well as its ability to distinguish between multiple land types [
27,
28], time series data [
29], and complex farming areas [
30]. The improvement of machine learning methods to achieve efficient, fast, and accurate land classification for long time series remains a focus of research.
In this study, we implemented the RF classifier in the GEE to perform time series land classification at different spatial scales with Landsat-8 and Sentinel-2 datasets for the vegetation growing season in 2022. Our overarching aim was to use different land classification models constructed using multi-source remote sensing variables to establish an efficient, accurate, and general land classification model for time series datasets, and to identify land classification sample points and migration thresholds based on the differences in the sample point image values without land classification changes. Our objectives were to (1) determine the threshold value of sample point migration based on no change in land class; (2) analyze the accuracy of the land classification model produced using a 36-year time series of Landsat remote sensing imagery and high-precision Sentinel imagery based on those threshold values; and (3) determine the optimal RF land classification model based on different combinations of multi-source remote sensing variables and compare the impact of image resolution on the classification accuracy.
2. Materials and Methods
2.1. Study Area
Shanxi Province is located within the Loess Plateau and the Yellow River Basin (N34°34′–40°44′, E110°14′–114°33′) and occupies a total area of 156,700 km
2. Mountains account for more than 80% of the total surface area of the region, with its topography highest in the northeast and lowest in the southwest, with an average altitude of 1500 m. Shanxi Province is an important coal energy base in China, with its retained reserves of coal resources reaching 270.9 billion metric tons. Additionally, Shanxi Province contains seven national nature reserves and is an important ecological barrier between mining activities and the Yellow River Basin. Within the Jinzhong coal base of Shanxi Province, the Huodong National Coal Planning Area covers a total area of 4110 km
2. The region is widely forested and includes the Taiyue Mountain National Forest Park that is an intimate mix of mining and forestry operations. The study area and land classification sample sites are shown in
Figure 1.
2.2. Data Sources
The Landsat series of satellites collect data at a resolution of 30 m and have been providing fundamental data for long time series scientific research on a global scale since their launch in 1972. In this study, remotely sensed data from 1 June 2022 to 31 August 2022 was used to capture the spectral reflectance of vegetation and assist in the identification and extraction of information on land cover types, such as forests and grasslands, while effectively distinguishing bare ground and other landscape features.
Sentinel-2 satellite data offers 13 spectral bands, which include four 10 m, six 20 m, and three 60 m spatial resolution bands. MultiSpectral Instrument (MSI), Level-1C data is the standard of the Sentinel-2 archive and represents the top of the atmosphere (TOA) reflectance. Sentinel-2 imagery is commonly used to monitor land use and land cover change on a global scale and is designed to provide high-resolution, multispectral remote sensing data for monitoring surface change and environmental conditions.
In addition to the above images, we used the National Aeronautics and Space Administration (NASA) digital elevation model (DEM) and Sentinel-1 synthetic aperture radar (SAR) as multi-source remote sensing images for land classification. All the multi-source remote sensing images involved in land classification are shown in
Table 1.
The workflow of this analysis comprised the four phases described below: (1) pre-processing acquired imagery; (2) sample-point threshold acquisition; (3) land classification; and (4) accuracy assessment (
Figure 2).
2.3. Image Pre-Processing
The pre-processing of optical remote sensing images included image stitching, de-clouding, mosaicking, and cropping. In particular, the image de-clouding methods all remove clouds and cloud shadow elements by labeling the QA quality bands of Landsat and Sentinel-2 data and operating the mask bit by bit. The mosaic processes of the images were fused using the median method, which in turn resulted in the Landsat series of images from 1986–2022 and Sentinel-2 remote sensing images of the vegetation growing seasons from 2019–2022, respectively.
The Sentinel-1 polarized data from the GEE has officially undergone ground range detection (GRD) boundary noise removal, thermal noise removal, radiometric calibration, and radiometric correction processes. In this study, the VV and VH polarization bands in the interferometric wide swath (IW) mode, which are suitable for remote sensing studies of land surfaces, were selected. The DEM data were reprojected and resampled to extract variables such as elevation, slope, and aspect as topographic factors to participate in the construction of the land classification model.
2.4. Sample Point Selection
The land classification of Shanxi Province was divided into six types: forest land, grassland, arable land, bare land, water bodies, and impervious surfaces. Additionally, a mining land type was added to the land classification system to account for the Huodong mining area and to assist in the differentiation of the mining and forest in the Taiyue Mountain National Forest Park complex area.
Fixed sample points for different land classifications were selected by importing the sample points into Google Earth to determine their accuracy by comparing high-resolution remote sensing images. A total of 1507 sample points from the Landsat imagery and 1235 sample points from the Sentinel imagery were selected. In total, 70% of the sample points were used as training sample points and 30% as validation sample points in the classification process; the specific land classification sample points are shown in
Table 2.
2.5. Technical Method
2.5.1. Sample Migration
Spectral features and indices are common methods used to analyze remotely sensed imagery. Spectral features are calculated from ratios or differences between the reflectance or emissivity in different bands of the remotely sensed image. These features and indices can be used to extract feature information, monitor vegetation cover, and monitor water quality, among other things. In this study, the Normalized Difference Vegetation Index (NDVI), Normalized Difference Built-up Index (NDBI), Normalized Difference Water Index (NDWI), and Difference Vegetation Index (DVI) were used to calculate the difference in values between the forest, grassland, and cropland, respectively, from year to year, and NDBI and DVI were used to calculate the difference between built-up (working) land and bare land. The spectral characteristics of unchanged land types are counted over a number of years so that a reasonable range of thresholds can be determined. In the GEE, the ee.spectralDistance function was used for image difference statistics. The main purpose of this function is to compute the per-pixel spectral distance between two images.
2.5.2. Random Forest Algorithm
Random forest was used to train a decision tree with randomly selected samples and features from the dataset, with the results of the decision trees assessed to obtain a combined result. The advantage of using the RF algorithm is that it avoids the problem of overfitting and is reliable for handling data such as missing values and outliers.
2.5.3. Feature Model Construction
Comparison of the single and multi-source remote sensing variables was conducted by combining different dimensions of remote sensing variables to investigate their influence on the land classification results. Four remote sensing feature variables were selected: spectral band, spectral index, Terrain features, and SAR data, with the specific variable factors shown in
Table 3. In the construction of the multi-source remote sensing variables, five combinations of spectral band, spectral band + spectral index, spectral band + SAR, spectral band + spectral index + SAR, and spectral band + spectral index + terrain features + SAR were used, respectively.
2.5.4. Accuracy Assessment
Accuracy of the classification results was determined by calculating the overall accuracy (OA) as a ratio of the number of samples correctly classified to the total number of samples, which is a common measure of classifier performance. The Kappa coefficient is a statistic used to measure the agreement between classifiers or evaluators. It can be used to assess the agreement between two evaluators on a classification task. Kappa coefficient values range from −1 to 1, with higher values indicating a better agreement.
4. Discussion
In this study, the utility of the GEE cloud computing platform for building land cover classification models using multiple sources of Landsat and Sentinel remote sensing imagery at different spatial resolutions over a 36-year time series was assessed. High-accuracy spatiotemporal land cover classification maps can help to reveal the impact of human activities such as coal mining and urban expansion on land use over time, which could enhance our understanding of the impact of population growth and changes in demography, and provide an evidence base to facilitate future government policy decisions; for example, creating accurate assessments of the spatiotemporal changes in forest C stocks in the context of C accounting and net-zero targets [
31].
Cloud computing platforms such as GEE, PIE-Engine, and AI Earth have improved our access to the high-performance computing necessary to process large and complex datasets and facilitated an increase in both the speed and accuracy of land cover classification. The approach used in this study was to use the GEE platform to conduct classification based on sample point migration and determine the sample point threshold value required to detect a land cover classification change. The selection of the sample points’ migration method has the advantage of not requiring new sample points to be chosen for each time period image, thereby improving the efficiency of the classification process [
13].
The fusion of multi-source remote sensing data into composite data products has been shown to improve the accuracy of land cover classification [
32]. In this study, when assessing the classification of both Landsat and Sentinel multispectral images, differences in the classification for crops and grassland were apparent because the imagery obtained for the vegetative growing season did not have substantial differences in the image spectra between grassland and crops. This finding supports the need for multiple sources of remotely sensed images obtained with different sensors (e.g., the SAR and multispectral data available in the Landsat and Sentinel series of images) to accurately classify land cover.
In our comparison of publicly available land classification products (
Table 8), the land classification results for forested land ranged between 1136.74 and 1418.27 km
2 for the Google and ESA products, respectively, which is broadly consistent with our own classification of the forest land area. The higher estimation of forested land area by the ESA product is likely due to its inclusion of sparse forest land in the classification of forested land, whereas the variance between the other four products is only 50.88 km
2 despite their differences in image resolution.
Land cover classification based on the non-parametric RF algorithm is able to handle multi-dimensional and non-linear data sources whilst also removing the requirement for a balanced number of individual sample points [
27], unlike the non-parametric minimum distance, maximum likelihood, and Bayesian classification methods. The combination of multi-source remote sensing data and the RF method has been shown to perform land cover classification effectively and accurately. Random forest methods of land cover classification have generated higher accuracy outputs compared to other non-parametric machine learning methods such as support vector machines and artificial neural networks [
33,
34].
The generation of accurate land cover maps over a 36-year period has several challenges relating to the detection of land cover change and technological advances. Sensor technology is continually evolving, which has improved the diversity, quality, and quantity of the remote sensing datasets available for analysis. The difference in the satellite sensors between the Landsat-8 Operational Land Imager (OLI) and Sentinel-2 MultiSpectral Instrument (MSI) did not have a large impact on the land cover classification results of the same area, despite the higher resolution of the Sentinel-2-acquired datasets, which, in theory, should facilitate a more accurate land cover classification and reduce the misclassification of features and the necessity of filtering imagery [
35,
36]. However, the fusion of multi-source remote sensing datasets that incorporate textural features [
37] has resulted in greater improvements in classification than relying on the increased resolution of images. For example, the fusion of datasets from different sensors has been shown to improve the accuracy of land classification [
38], forest biomass estimation [
39], and natural disaster monitoring [
40]. The complex topography and forest species’ composition and density in the typical mountainous mining area used in this study demonstrated that the effective integration of topographic features such as elevation and slope can be more conducive to distinguishing forests from buildings and crops.