Next Article in Journal
Forest Road Detection Using LiDAR Data and Hybrid Classification
Next Article in Special Issue
Using Time Series Sentinel-1 Images for Object-Oriented Crop Classification in Google Earth Engine
Previous Article in Journal
Foliage Biophysical Trait Prediction from Laboratory Spectra in Norway Spruce Is More Affected by Needle Age Than by Site Soil Conditions
Previous Article in Special Issue
Rice-Field Mapping with Sentinel-1A SAR Time-Series Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Crop Height Estimation of Corn from Multi-Year RADARSAT-2 Polarimetric Observables Using Machine Learning

1
School of Geography and Information Engineering, China University of Geosciences (Wuhan), Wuhan 430074, China
2
Institute for Computer Research (IUII), University of Alicante, E-03080 Alicante, Spain
3
Department of Geography and Environment, The University of Western Ontario, London, ON N6A 5C2, Canada
4
Institute for Earth and Space Exploration, The University of Western Ontario, London, ON N6A 3K7, Canada
5
Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada
6
School of Geosciences and Info-Physics, Central South University, Changsha 410083, China
*
Author to whom correspondence should be addressed.
Submission received: 31 December 2020 / Revised: 18 January 2021 / Accepted: 19 January 2021 / Published: 23 January 2021

Abstract

:
This study presents a demonstration of the applicability of machine learning techniques for the retrieval of crop height in corn fields using space-borne PolSAR (Polarimetric Synthetic Aperture Radar) data. Multi-year RADARSAT-2 C-band data acquired over agricultural areas in Canada, covering the whole corn growing period, are exploited. Two popular machine learning regression methods, i.e., Random Forest Regression (RFR) and Support Vector Regression (SVR) are adopted and evaluated. A set of 27 representative polarimetric parameters are extracted from the PolSAR data and used as input features in the regression models for height estimation. Furthermore, based on the unique capability of the RFR method to determine variable importance contributing to the regression, a smaller number of polarimetric features (6 out of 27 in our study) are selected in the final regression models. Results of our study demonstrate that PolSAR observables can produce corn height estimates with root mean square error (RMSE) around 40–50 cm throughout the growth cycle. The RFR approach shows better overall accuracy in corn height estimation than the SVR method in all tests. The six selected polarimetric features by variable importance ranking can generate better results. This study provides a new perspective on the use of PolSAR data in retrieving agricultural crop height from space.

Graphical Abstract

1. Introduction

Crop height is an important agronomic descriptor related to crop type, biomass estimation, phenological stage, potential yield, detection of growth anomalies (e.g., diseases, pests, weather disasters, and cereal lodging), and precision fertilization [1,2,3]. Traditional methods to monitor crop height by visual inspection require a huge workforce over large areas [4]. Synthetic Aperture Radar (SAR), with its capability of imaging in day and night and all weather conditions and its sensitivity to the geometric and physical properties of the target, has shown to be an effective remote sensing technique in crop biophysical parameter retrieval at regional and global scales. For crop height estimation with SAR observations, the approaches can be generally categorized into three types: backscattering model methods, interferometric SAR (InSAR) methods, and data-driven empirical model methods [1].
The backscattering models for crop height retrieval include both physical models and semi-empirical models. The physical model developed for crop height usually adopts a discrete approach, such as the Radiative Transfer Theory Model (RTM), which is able to simulate the backscattering coefficient for crop targets from the perspective of fine electromagnetic scattering as a function of various geometric and physical parameters of the plant, such as canopy height, dielectric constant, number of layers and leaves, leaf angle and size, stem width, and so on [5,6,7,8,9,10,11,12]. The physical scattering model usually depends on the polarization and crop type. Due to the complexity of the physical scattering model, crop height estimation may require computationally expensive Monte Carlo simulations to relate the SAR measurements to parameters describing the entire canopy’s physical characteristics [1]. Moreover, the inversion process of model parameters often leads to ill-posed problems due to a high-dimensional parameter space [1,4,13,14]. Although the merging of a metamodel (e.g., the polynomial chaos expansion (PCE)) with the backscattering model enables a significant reduction of the computational cost and the complexity involved in the inversion scheme, the growth stage needs to be identified in advance to narrow the solution space [4,13,14]. For vegetation, the most popular semi-empirical model is the Water Cloud Model (WCM) proposed by Attema and Ulaby [15]. Due to its simplicity and practicability, the WCM has been extensively applied to soil moisture estimation and to the retrieval of various vegetation biophysical variables, such as leaf area index (LAI), aboveground biomass, and vegetation height [16,17,18]. However, the retrieval results from the original WCM often show low accuracies attributed to many assumptions and simplifications involved in the model. In past studies, many modifications of the WCM have been developed by considering more complex scattering mechanisms or more vegetation geometrical properties. Moreover, there were some studies reporting calibration of the model coefficients of the WCM for specific areas, vegetation types, or SAR sensors based on some ground measurements [19].
The InSAR method exploits the interferometric phase between two co-registered SAR images acquired in the same polarization to capture the height of the scattering phase center [20], which is approximately considered as the crop surface height in agricultural areas, like a digital surface model. Then, an external digital terrain model (DTM), also called vegetation-free digital elevation model (DEM), is required to derive the crop height itself [1]. In order to obtain accurate crop surface height, it needs some strict conditions in general. For example, the available SAR data is expected to work with short wavelengths and appropriate polarization to enable the scattering phase center to be located as close as possible to the top of the canopy. For the same purpose, a structurally dense crop is required as well. Since the crop height is usually very low with respect to forest height, a relatively large spatial baseline is required to reduce the height of ambiguity. Moreover, a short enough revisit time is expected to obtain high coherence since crop height is assumed unchanged within this time interval. In addition, the quality of the DTM data used to remove the underlying topography from crop surface height is also an unignorable factor. Although recently, single-polarization InSAR data at the L- and P-band have proven its capability to generate a comparable performance in DTM inversion in forest areas with respect to the traditional PolInSAR method with fully polarimetric SAR data [21], currently, the accurate DTM product over a vegetation area is mostly generated from other measurement technologies, such as light detection and ranging(LIDAR), polarimetric SAR interferometry (PolInSAR) [22,23,24], SAR Tomography (TomoSAR) [25], and field topographic mapping. The PolInSAR method combines the interferometric and polarimetric information to better separate the different scattering phase centers in the vegetation volume, which has been demonstrated to be useful for estimating vegetation structural parameters [26,27]. For vegetation height estimation, the PolInSAR technique has been validated in a variety of forest types with many airborne and few spaceborne datasets at different radar frequencies [22,23,28,29,30,31,32,33,34,35,36,37,38,39]. The performance is constrained by two key aspects: temporal decorrelation and spatial baseline. A shorter revisit time is expected to provide higher coherence, which is related to the quality of the interferometric phase [40]. A relatively large baseline is expected to provide enough sensitivity of height measurement. Since crops grow faster and have shorter heights than forests, the availability of PolInSAR data for crop height estimation is more constrained than for forest height inversion [1,2,3]. Until now, few successful examples of crop height retrieval with the PolInSAR method are restricted to data acquired in indoor experiments [41,42] and airborne campaigns [43,44]. With regard to spaceborne datasets, several authors have reported successful results with a dataset over paddy rice fields acquired from the science phase of the TanDEM-X mission (bistatic configuration) from April to September 2015, being the baselines especially adjusted to ten times the regular configuration, i.e., around 2–3 km [1,3]. Although the PolInSAR method shows the capability to produce accurate estimates of crop height, currently the available PolInSAR datasets for crop height monitoring are very limited.
Another available option to estimate crop height is the data-driven empirical model method, which in this study refers to the use of a regression approach to train an empirical model between some PolSAR observables and crop height. The unknown crop heights in a scene are predicted by the trained regression model and the corresponding PolSAR observables. A large number of previous studies have been reported to investigate the correlations between PolSAR observables (e.g., backscattering coefficients, polarimetric decomposition parameters, radar vegetation index, and correlation coefficients), and crop parameters (e.g., LAI, PAI, biomass, phenological stages, canopy coverage, and crop height) over different crop fields at different radar frequencies [45,46,47,48,49,50,51,52,53,54]. These research results have shown the potential of PolSAR observables for crop parameter retrieval. However, to date, there are few studies reporting crop height retrieval based on regression with PolSAR observables [55,56,57]. In these studies, due to the limitation of images available and field data collected, the volume of available observed samples for training is in general relatively small. The crop types studied are also limited, such as sunflower, wheat, and canola. Moreover, the number and types of selected PolSAR observables are limited, and an empirical relationship model or linear/polynomial regression model is usually chosen [55,56,57].
Due to the aforementioned limitations of the backscattering model methods and interferometric SAR methods in practice, this study is focused on data-driven empirical model methods. More specifically, the present study aims at providing a comprehensive demonstration and validation of crop height retrieval of corn by exploiting a large number of PolSAR observables and ground measurements with machine learning regression methods. A large dataset formed by multi-temporal C-band RADARSAT-2 (launched by the Canadian Space Agency) images and quasi-synchronous in situ measurements of crop height along three years over two geographically close study sites in Canada, with similar agricultural practices and climatic conditions, are exploited here. Corn has been chosen as the target crop for this experiment because it has significant socio-economic interest for humans’ staple food, a raw material of ethanol and animal feed, and it is the cereal with the highest production worldwide. Moreover, the height range of corn during the whole growing period is relatively large, reaching over 3 m in our study at the final stage, which results in more radar signature differences within the time-series SAR acquisitions. In addition, radar response to corn with RADARSAT-2 data have been well studied in previous literature [42,45,46,48,50,58,59,60]. Two typical machine learning regression methods, i.e., Random Forest Regression (RFR) and Support Vector Regression (SVR), are adopted for model training instead of linear regression. Moreover, the unique ability of the RFR method to provide variable importance contributing to the regression can help us understand the results and further investigate the performance after filtering the selection of input PolSAR observables.

2. Materials and Methods

2.1. Study Site and Dataset

As shown in Figure 1, two geographically close sites both located in Southwestern Ontario, Canada, were selected. One site was located in the west of London, and the other one was near Stratford. Both study sites were agricultural areas including mainly crop fields, a few buildings, and forests. Corn, wheat, and soybean were the dominant crop types in both study sites. There was also some alfalfa, hay, and grass growing in these areas. Both study sites were suitable for cultivating crops because of abundant precipitation, mild weather, and productive soil, with relatively flat topography. In both study areas, corn and soybean were seeded in May and harvested in October of the same year. In contrast, the winter wheat growing period crossed over two years, seeded in October and harvested in July of the following year. It should be noted that in both study areas, crop rotation did not need to be done in the same field. The farmers’ practice was to retain residuals for soil conservation. Therefore, one crop field may harvest residuals from another crop field from a previous year. For example, the cornfields may have residuals of wheat or soybean. All cornfields employed for height inversion and the locations of the sample points for collecting ground measurements are marked in green and red in Figure 1, respectively.
In total, 19 scenes of Fine Quad-Pol Wide (FQW) RADARSAT-2 data acquired in 2013, 2015, and 2018 were employed in this study. More specifically, eight scenes were acquired over the Stratford site in 2013 covering the whole corn growing period. Four scenes in 2015 and seven scenes in 2018 were acquired over the London site, which mainly covered the late growth stages. Table 1 shows the acquisition dates, beam modes, incidence angles, resolutions, and orbits of all RADARSAT-2 images.
For each year, ground measurements including crop height, crop type, ground photos, soil moisture, and crop phenological stage were recorded. Due to the limitations of weather conditions, human resources, and other reasons, the in situ fieldwork was sometimes not conducted on the exact dates that the RADARSAT-2 satellite overpassed. Since the maximum offset of time was only three days (which occurred on 1 July 2018), it was reasonable to assume that the corn heights did not change from the dates of satellite acquisitions to the ground campaign dates. As it is shown in Table 1, the final number of sample points for each image was different because of limitations due to weather conditions, human resources, and other logistical reasons. For the field campaign in 2013, 17 sample points per day within a maximum of five corn fields were selected for collecting ground measurements. Finally, 112 corn sample points over the Stratford site were collected. For the ground campaign in 2015, 25 sample points per day at a maximum of four cornfields were selected to conduct measurements. In total, 43 corn sample points over the London site were collected. For the ground campaign in 2018, 32 sample points per day in a maximum of four cornfields were exploited. In total, 192 corn sample points over the London site were collected. In summary, a total of 347 corn sample points were collected from the multi-year fieldwork campaigns. Three replicate height readings per sample point were carried out in 2013 and 2015, while twelve readings were conducted in 2018. The measured corn height had a wide range, with values between 3.5 cm and 333.75 cm. The average value of corn height of the ground samples on each fieldwork date ranged from 5.75 cm to 276.72 cm.

2.2. Polarimetric Observables

For a fully polarimetric SAR system, the acquired single look complex (SLC) data in H-V polarization basis can be represented by a 2 × 2 scattering matrix, i.e., [61,62],
S = [ S H H S H V S V H S V V ]
where S i j ( i , j = H   o r   V ) represents the scattering coefficient from transmitted polarization i and received polarization j . The scattering matrix S is used to describe a “pure single target” or deterministic target. For distributed targets, typical in natural media, the second-order statistics (covariance matrix or coherency matrix) are usually exploited to conduct polarimetric analysis. Under the assumption of reciprocal scattering ( S H V = S V H ), the lexicographic basis vector and the Pauli basis vector, respectively, can be expressed as [61,62]
k l = [ S H H 2 S H V S V V ] T
k P = 1 2 [ S H H + S V V S H H S V V 2 S H V ] T
Then, the corresponding covariance matrix C and the coherency matrix T are given as [61,62]
C = k l k l * T = 1 2 [ | S H H | 2 2 S H H S H V * S H H S V V * 2 S H V S H H * 2 | S H V | 2 2 S H V S V V * S V V S H H * 2 S V V S H V * | S V V | 2 ]
T = k P k P * T = 1 2 [ | S H H + S V V | 2 ( S H H + S V V ) ( S H H S V V ) * 2 ( S H H + S V V ) S H V * ( S H H S V V ) ( S H H + S V V ) * | S H H S V V | 2 2 ( S H H S V V ) S H V * 2 S H V ( S H H + S V V ) * 2 S H V ( S H H S V V ) * 4 | S H V | 2 ]
Based on these polarimetric observation matrices, a large number of polarimetric parameters can be extracted for crop monitoring applications [47,48,63,64]. According to their wide usage in crop monitoring studies, a total of 27 polarimetric observables were chosen in this study, as listed in Table 2. First, as the basic products provided by a fully polarimetric SAR system, radar backscattering coefficients in co-polar channels (HH, VV) and cross-polar channel (HV) were chosen, which corresponded to the diagonal elements (C11, C22, C33) in the covariance matrix. Due to their clear physical interpretation in terms of scattering mechanisms, radar backscattering coefficients in the Pauli channels were considered, which could be extracted from the coherency matrix (T11, T22). The widely used total backscattering power, SPAN, was also selected, which was extracted from the sum of the diagonal elements in either the covariance matrix or the coherency matrix (C11 + C22 + C33 or T11 + T22 + T33). The correlation and phase between polarimetric channels (in both the linear and the Pauli basis) were also exploited, which provided additional information about the scattering from the scene. In past studies, these observables have proven very useful for crop phenology monitoring and crop state detection based on multi-temporal analysis of the radar polarimetric response [48,63,65,66,67]. They provided four complex correlation coefficients, which resulted in eight real observables (amplitudes and phases). Moreover, three backscattering ratios between different linear channels (HH/VV, HV/HH, and HV/VV) were also considered, which had proven to be sensitive to target characteristics. A widely used approach for generating polarimetric features is polarimetric target decomposition, which can be generally categorized as either coherent polarimetric decomposition or incoherent decomposition [61,62]. Due to the capacity to describe distributed targets, incoherent polarimetric decomposition approaches are more suitable for interpreting most nature targets [61,62]. They can be further divided into model-based decomposition and eigenvector-eigenvalue based decomposition. As the pioneering and one of the most popular model-based decomposition methods, the Freeman-Durden three-component decomposition generates three scattering power parameters representing surface, double-bounce, and volume scattering mechanisms, respectively (Freeman and Durden, 1998), which were selected in this study. Additionally, three polarimetric parameters with clear physical meanings from the representative eigenvector-eigenvalue based decomposition proposed by Cloude and Pottier were also used, including the polarimetric scattering entropy H (the degree of scattering randomness in the scattering medium), the alpha angle α (the average dominant scattering mechanism), and the polarimetric anisotropy A (the relative importance between the second and the third scattering mechanism) [68].
Another model-based decomposition, Neumann decomposition, is aimed at describing vegetation scattering by considering the morphological characteristics of vegetation in scattering modeling [36,69,70]. It has shown the potential advantage of identifying more types of volume scenes [71]. Moreover, two of its output parameters have proven to provide physical meanings similar to Cloude-Pottier decomposition outputs [36,69,70]. A recent study has shown that the third parameter in the Neumann decomposition, the phase of the particle scattering anisotropy, is more effective in improving the classification accuracy with respect to the Cloude-Pottier decomposition [72]. Therefore, the three output parameters from the Neumann decomposition were employed. In addition, the radar vegetation index (RVI) [73] has shown high sensitivity to crop morphological features and thus was also considered in this study.
For each RADARSAT-2 image, a series of preprocessing steps including calibration, speckle filter, and geocoding was conducted. Sigma naught values were obtained by the calibration. A 9 × 9 boxcar filter was applied to reduce the speckle noise. Then, a coherency matrix was generated at each pixel. A Digital Elevation Model (PDEM) of the Ontario province, Canada, with a spatial resolution of 30 m was used to geocode the coherency matrix of each image in the Universal Transverse Mercator (UTM) geographic reference. Afterward, 27 polarimetric features listed in Table 2 were extracted for each image. The spatial resolution of all the polarimetric features was the same because they followed the same processing. The final pixel spacing in the geocoded products was 10 m. Furthermore, the corresponding feature vectors, for a total of 347 sample points during the whole RADARSAT-2 acquisition period, were obtained based on the geolocation records of sample points.

2.3. Machine Learning Method Used in This Study

Due to the high capacity of prediction, machine learning methods are frequently used for classification and regression problems in remote sensing studies [74,75]. In particular, support vector regression (SVR) and random forest regression (RFR) are two representative examples in this domain and thus were considered for crop height retrieval in this study.

2.3.1. Support Vector Regression (SVR)

SVR is the application of a well-known support vector machine model in regression, which has been used in agricultural biophysical parameter estimation with remote sensing data [76,77,78]. The core idea of SVR is to find an optimal approximating hyperplane to distinguish the input vectors and the predictor variables based on training data, which could be determined by solving a convex optimization problem. Theoretically, it is designed to establish an optimal linear separator and hence is suitable for linear data distribution. However, SVR can also handle nonlinear data distributions after embedded into a kernel framework. With a kernel function, the training data are nonlinearly transformed from the original space to a higher dimensional feature space. In this new space, the new data are better conditioned to make use of a linear separator.
In this study, an IDL-based tool, named imageSVM developed as a non-commercial product at the Geomatics Lab of Humboldt-Universität zu Berlin, specifically designed for support vector machine classification and regression analysis of remote sensing data, was used to carry out the SVR analysis. The radial basis function kernel, called the RBF kernel or the Gaussian kernel, was adopted in the regression model. The kernel parameter, the regularization parameter, and the Epsilon loss function parameter were required to be set to parameterize the SVR. A cross-validation strategy to tune these three parameters was used to reduce model overfitting.

2.3.2. Random Forest Regression (RFR)

Unlike SVR, RFR is an ensemble learning method, which uses the subset of the training data to construct a set of decision trees and adopts various non-parametric predictive models [79]. Like the morphological structure of a real tree, a decision tree includes a root node, multiple internal nodes (splits), and various terminal nodes (leaves). Starting from the root to a leaf, a set of decision rules is applied to subdivide the training data into smaller subsets. The predictor variable is assigned as the leaf node. A strategy of bootstrap sampling with replacement is used in RFR to create each individual decision tree. The excluded samples, called out-of-bag samples, are used for model validation. The final prediction is generated by averaging the results from individual decision trees to obtain better prediction performance. In addition, a beneficial property of RFR is that it can also provide estimates of variable importance in the regression, which allows for a better understanding of the sensitivity of the input features to the predictor variable.
In this study, an IDL-based tool, imageRF [80], particularly designed for random forest classification and regression analysis of remote sensing image data, was used to carry out the RFR analysis. The number of decision trees was set to 200 based on the overall consideration of the prediction accuracy and computation time. In the bootstrap sampling for each decision tree, one-third of the training data were set as out-of-bag samples for independent validation. The number of randomly selected features at each split node was determined by the square root of all features.

2.3.3. Experimental Design

From Table 1, it is evident that the corn was very short on 23 May, 2 June, and 16 June in 2013. The main backscattering contribution came from the soil, influenced by soil residual and tillage on these dates. The ground photos corresponding to these dates are shown in Figure 2. For this reason, we first carried out the tests by excluding these three observations. Consequently, 16 RADARSAT-2 images and 310 corn sample points were used. In addition, we performed tests involving these three images and all sample points (i.e., 19 RADARSAT-2 images and 347 sample points) and compared the results. In order to construct a regression model, training samples were required for model calibration. For this purpose, the sample points collected from multi-year fieldwork campaigns were randomly divided into two parts. A portion of the samples was dedicated to training samples for model calibration, while the remaining samples were assigned as testing samples for model validation. In our study, 80% of samples were used for calibration and 20% for validation. It is a common fact that the calibration and prediction accuracies of the regression model can be affected by features of training and testing samples including their distribution and numbers. In order to simulate more random scenarios and investigate the uncertainty of the accuracy, a strategy of bootstrap sampling with replacement was used. As some training samples may have been recycled using this strategy, samples were generated multiple times by random sampling to reduce bias in height estimation [81]. The entire sample points including both short and tall corn heights were considered in the bootstrap sampling method. In total, 10 realizations of random sampling (hereafter named as scenarios) were carried out for generating different datasets for training and testing.

3. Results.

3.1. Comparison between SVR and RFR

For each dataset, SVR and RFR were conducted. The statistical analysis of model calibration and validation for each dataset with both regression algorithms is shown in Table 3. As expected, results appear varied in different scenarios, which can be attributed to the dependence of the regression model accuracies on the training and testing sets. The results of RFR were generally better than the ones of SVR. RFR obtained overall lower values of root mean square error (RMSE) and higher values of Pearson correlation coefficient (R), despite an opposite behavior for scenario 8. For model calibration, the differences between the performances of both regression algorithms were notable (average RMSE = 22.36 cm for RFR and RMSE = 44.12 cm for SVR), whereas small differences were observed for model validation (average RMSE = 50.40 cm for RFR and RMSE = 54.69 cm for SVR).
In addition to the statistical indices shown in Table 3, Figure 3 illustrates the scatter plots of measured and predicted corn height obtained with both SVR and RFR methods in scenario 2, in which the regression produced the overall best accuracies. The RFR results exhibited a higher correlation than the results from SVR in both model calibration (see Figure 3a,c) and model validation (see Figure 3b,d). In detail, the model of RFR, in general, yielded overestimated values for lower corn height, while underestimation was observed for higher corn height (taller than around 225 cm). The SVR model generated overestimation and underestimation results for either lower corn height or for higher corn height, while a larger underestimation appeared for higher corn height.

3.2. Normalized Variable Importance of RFR

As explained above, the Random Forest algorithm can provide the relative importance of different input variables to the classification or regression accuracy. Therefore, this interesting property of RFR was been also used for further analysis.
Since RFR was carried out under 10 different scenarios and 27 input polarimetric observables were selected in our study, the output values of normalized variable importance of RFR regression for each scenario were variable. It is difficult and unfair to analyze the variable importance using values from one specific scenario. For this reason, Figure 4 shows the ranking of the importance of each polarimetric observables in the regression based on the average values of output normalized variable importance for the 10 scenarios. For better visualization, the parameters belonging to the same or similar group are marked with the same color. The contribution of the double-bounce component (Pd) from the Freeman-Durden decomposition was the most important variable among the selected 27 polarimetric observables. It demonstrated a strong correlation between double-bounce scattering and crop height. Moreover, considering the polarimetric decompositions used in this study, parameters from the Freeman-Durden decomposition appeared to be more relevant than parameters from the Neumann and Cloude-Pottier decompositions. The reasons may be attributed to the nature of the Freeman-Durden decomposition models, which are physically based rather than purely mathematical as in the Cloude-Pottier decomposition. Hence, its applicability is more general than the Neumann decomposition that assumes the dominance of volume scattering, which is not always the case for crops. In particular, the contribution of the volume scattering component (Pv) from the Freeman-Durden decomposition took second place in the variable importance ranking, just after Pd. Notably, the polarimetric anisotropy contributed more entropy and alpha angle from Cloude-Pottier decomposition.
The magnitude of the degree of orientation randomness of the particle scattering anisotropy was the most important variable from the Neumann decomposition. Among the backscattering coefficients, C22 played the most important role, which was highly related to the volume scattering component Pv, and took third place in the ranking. C33, which represented the VV polarization, showed less contribution and was placed in the last position among the linear backscattering coefficients. T22 was the second most important, and was highly related to the double-bounce scattering component Pd, which took fourth place. The total scattering power SPAN ranked behind T22 but was still at the front position. Among the backscattering ratios, HV/HH was the most important parameter, and the corresponding contribution was significantly larger than the other two linear ratios. Although co-polar correlation magnitude | ρ H H V V | took up a middle position similar to RVI in the ranking, most correlation magnitudes and polarimetric phases contributed less to the regression and were found at the end of the importance list. From the aforementioned analysis, it was clear to see that SAR parameters related to the double bounce and volume scattering components (e.g., Pd, Pv, C22, T22) showed high sensitivity to crop height and strongly drove height estimation for the RFR method. The reasons may be attributed to the nature of the scattering mechanisms which interact differently with different plant structures (e.g., stem, flower, leaf, tassel) as the crop development advances. For example, double-bounce and volume scattering components vary significantly with crop growth stages, which generally are low in the early stage and high in later development stages (e.g., stem elongation, tassel, and stigmata emergence).

4. Discussion

4.1. Tests with Fewer Polarimetric Observables

It is well known that using more features does not always generate better regression accuracies because of redundant or correlated information. Therefore, it is interesting to further check the regression accuracies in case fewer polarimetric observables can be used. From the diagram of variable importance ranking shown in Figure 3, the contributions of the first six polarimetric variables are obviously larger than the rest of the variables. Therefore, additional RFR tests were conducted by using only the first six polarimetric variables (i.e., Pd, Pv, C22, T22, HV/HH, and SPAN). At the same time, corresponding SVR tests were also carried out for comparison purposes. The statistical analyses of model calibration and validation for 10 different scenarios with SVR and RFR are presented in Table 4. It is apparent that regression estimation accuracies of SVR and RFR improve with respect to the previous results for each scenario in Table 3, even though sometimes model calibration accuracies decrease slightly. As in the previous results, RFR provides better accuracies than SVR in most scenarios. The differences between the two machine learning regression methods are smaller in the case of using fewer polarimetric variables. The average values of RMSE and R with RFR prediction are 47.76 cm and 0.79, while the corresponding values are 47.90 cm and 0.78 with SVR prediction, respectively. The best regression estimation results were produced in scenario 1 with RFR. The corresponding values of RMSE and R reach 42.69 cm and 0.84, respectively. To further analyze the results in scenario 1, the scatterplots of measured and estimated corn height with SVR and RFR methods are presented in Figure 5. The patterns of model calibration are similar to the ones shown in Figure 3. The overall distributions of RFR and SVR estimation results show similar patterns, in which overestimation is mostly observed for lower corn height, and underestimation is observed for taller corn height. To analyze the SVR and RFR results at different stages of corn growth and maintain sufficient samples for statistics, the simple division method for growth stages of corn proposed in [48] was adopted. The corn height of less than 150 cm is addressed as the early stage, and height that is taller than 150 cm is defined as the later stage [48]. After calculation of values of RMSE and R, statistics of model validation at an early stage and later stage for 10 different scenarios with SVR and RFR are presented in Table 5. It is clear to see that the results of SVR and RFR are better at the later stage than the ones at the early stage.
Based on the regression model, a corn height map can be generated on each date. For example, the map of estimated corn height on 15 September 2018 is presented in Figure 6. Locations of four cornfields and eight sample points in each field are marked on the map. The estimated corn height ranges between 80 and 295 cm. Considering most cornfields are at the late crop growing stage on that date, the results of estimated height are underestimated which is consistent with the previous analysis. In addition, the values of estimated height somehow show variation among different fields, which can be attributed to the diversity of cornfield conditions, such as soil moisture and roughness, topography, precipitation, and fertilization. Table 6 shows the measured and estimated corn heights in 32 sample points. The difference is very small at some points and large at other points. The calculated RMSE and R values are 32.61 cm and 0.59, respectively.

4.2. Tests with All Images Including Very Short Corn Height

In previous tests, the first three images in 2013 are excluded due to their very short corn height. It is interesting to check the performance when all images, including very short corn height, are used. Ten realizations of SVR and RFR tests were carried out again, respectively. As in previous tests, 80% of samples were used for calibration and 20% for validation. However, 19 RADARSAT-2 images and 347 corn samples were used this time. The statistical analysis of model calibration and validation for each dataset with both regression algorithms is shown in Table 7. As expected, the results still show variation in different scenarios and the results of RFR are better than SVR. However, the values of RMSE and R are worse than the results in Table 3 (using 16 RADARSAT-2 images with 310 corn samples). The average values of RMSE and R for RFR in model validation are 54.55 cm and 0.83, and the values for SVR are 56.75 cm and 0.80, respectively. It somehow indicates the limitation of this kind of regression methods, i.e., very short height will affect the accuracy of estimation.

4.3. Limitations and Future Research

In this study, two common machine learning techniques used for scientific purposes, i.e., RFR and SVR, were evaluated for crop height estimation of corn from multi-year RADARSAT-2 polarimetric observables. There are some limitations to these methods. Firstly, the methods depend on the availability of a large number of sample data and a good sample distribution. In our case, although the number of samples is large, corn samples have more values of heights taller than 2 m and fewer values between 1 m and 2 m. This might affect the accuracy of model calibration and estimation. Secondly, the output results somehow show overestimation and underestimation. The results show worse performance at the early growth stages, especially in the case of estimating very short crop height. Applying piecewise regression methods at different growing stages may improve the results. Third, the PolSAR data are acquired in different modes with different incidence angles, which might also influence the accuracy of estimation. Additionally, field conditions such as soil moisture, irrigation, and fertilization, may also affect the estimation results.
Future work will focus on testing these techniques for height estimation of other crop types, such as wheat, soybean, and rice. Moreover, it is worth investigating how changes in crop physiology associated with crop development and precipitation (or irrigation regimes) could contribute to changes in the priority of different PolSAR parameters as the crop matures along the full season. Tests and analysis with PolSAR data at other radar frequencies (such as TerraSAR-X at X band, ALOS-2 at L band) over different sites will also be investigated in future research.

5. Conclusions

This study presents a demonstration of crop height retrieval based on space-borne PolSAR data with machine learning techniques. The techniques have been tested with RADARSAT-2 data in cornfields covering the whole corn growing period. The potential of two popular machine learning regression algorithms (SVR and RFR) was investigated, including an identification of the relevant features by means of the normalized variable importance of RFR. A set of 27 representative PolSAR observables was initially selected and analyzed in this work. The results show a satisfactory prediction performance for corn height mapping at a large scale, with RMSE around 40–50 cm considering the whole growth cycle, with corn height over 3 m at late stages. The RFR approach overall outperforms the SVR method in all tests. The best result is generated by the RFR method when selecting a subset of six polarimetric features, with an RMSE of 42.8 cm, which indicates that fewer selected polarimetric features can generate better results than using all features. Regarding the analysis of the relative importance of all polarimetric features considered, results highlight that parameters related to double-bounce and volume scattering are the most important polarimetric features for corn height estimation. In addition, the HV/HH ratio appears to be the most representative among all three backscattering ratios. Compared with past studies on crop height retrieval with SAR data, this research provides a potentially efficient method and a new perspective on the use of PolSAR data.

Author Contributions

Conceptualization, Q.X. and J.W.; methodology, Q.X.; software, Q.X.; validation, Q.X. and C.L.; formal analysis, Q.X., J.W. and X.P.; investigation, Q.X. and J.W.; resources, J.W.; data curation, Q.X., C.L., J.W. and J.S.; writing—original draft preparation, Q.X.; writing—review and editing, J.W., J.M.L.-S., J.Z., J.S., X.P. and J.D.B.-B.; visualization, Q.X.; supervision, J.W.; project administration, Q.X. and J.W.; funding acquisition, Q.X., J.W., J.Z., H.F. and J.M.L.-S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by the National Natural Science Foundation of China (Grant No. 41804004, 41820104005, 41531068, 41904004), the Canadian Space Agency SOAR-E program (Grant No. SOAR-E-5489), the Fundamental Research Funds for the Central Universities, China University of Geosciences (Wuhan) (Grant No. CUG190633), and the Spanish Ministry of Science, Innovation and Universities, State Research Agency (AEI) and the European Regional Development Fund under project TEC2017-85244-C2-1-P.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

RADARSAT-2 Data and Products © MacDonald, Dettwiler and Associates Ltd. (2013, 2015, 2018)—All Rights Reserved. RADARSAT is an official trademark of the Canadian Space Agency. The authors would like to thank the Canadian Space Agency and Agriculture and Agri- Food Canada for providing the RADARSAT-2 data, and Yang Song, Yu Bo, Xiaodong Huang from GITA lab, UWO for their help with field work. In addition, the authors acknowledge A&L Canada Inc. for the access to the corn fields.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Erten, E.; Lopez-Sanchez, J.M.; Yuzugullu, O.; Hajnsek, I. Retrieval of agricultural crop height from space: A comparison of SAR techniques. Remote Sens. Environ. 2016, 187, 130–144. [Google Scholar] [CrossRef] [Green Version]
  2. Lopez-Sanchez, J.M.; Ballester-Berman, J.D. Potentials of polarimetric SAR interferometry for agriculture monitoring. Radio Sci. 2009, 44, 1–20. [Google Scholar] [CrossRef] [Green Version]
  3. Lopez-Sanchez, J.M.; Vicente-Guijalba, F.; Erten, E.; Campos-Taberner, M.; Garcia-Haro, F.J. Retrieval of vegetation height in rice fields using polarimetric SAR interferometry with TanDEM-X data. Remote Sens. Environ. 2017, 192, 30–44. [Google Scholar] [CrossRef] [Green Version]
  4. Yuzugullu, O.; Erten, E.; Hajnsek, I. Assessment of Paddy Rice Height: Sequential Inversion of Coherent and Incoherent Models. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 3001–3013. [Google Scholar] [CrossRef]
  5. Karam, M.; Amar, F.; Fung, A.K.; Mougin, E.; Lopes, A.; Le Vine, D.M.; Beaudoin, A. A microwave polarimetric scattering model for forest canopies based on vector radiative transfer theory. Remote Sens. Environ. 1995, 53, 16–30. [Google Scholar] [CrossRef]
  6. Karam, M.; Fung, A.; Lang, R.; Chauhan, N. A microwave scattering model for layered vegetation. IEEE Trans. Geosci. Remote Sens. 1992, 30, 767–784. [Google Scholar] [CrossRef] [Green Version]
  7. Karam, M.; Fung, A.; Antar, Y. Electromagnetic wave scattering from some vegetation samples. IEEE Trans. Geosci. Remote Sens. 1988, 26, 799–808. [Google Scholar] [CrossRef] [Green Version]
  8. Liu, Y.; Chen, K.-S.; Xu, P.; Li, Z.-L. Modeling and Characteristics of Microwave Backscattering From Rice Canopy over Growth Stages. IEEE Trans. Geosci. Remote Sens. 2016, 54, 6757–6770. [Google Scholar] [CrossRef]
  9. Toure, A.; Thomson, K.; Edwards, G.; Brown, R.; Brisco, B. Adaptation of the MIMICS backscattering model to the agricultural context-wheat and canola at L and C bands. IEEE Trans. Geosci. Remote Sens. 1994, 32, 47–61. [Google Scholar] [CrossRef]
  10. Ulaby, F.T.; Sarabandi, K.; McDonald, K.; Whitt, M.; Dobson, M.C. Michigan microwave canopy scattering model. Int. J. Remote Sens. 1990, 11, 1223–1253. [Google Scholar] [CrossRef]
  11. Ni, W.; Sun, G.; Ranson, K.J.; Zhang, Z.; He, Y.; Huang, W.; Guo, Z. Model-Based Analysis of the Influence of Forest Structures on the Scattering Phase Center at L-Band. IEEE Trans. Geosci. Remote Sens. 2013, 52, 3937–3946. [Google Scholar] [CrossRef]
  12. Zhang, Y.; Liu, X.; Su, S.; Wang, C. Retrieving canopy height and density of paddy rice from Radarsat-2 images with a canopy scattering model. Int. J. Appl. Earth Obs. Geoinf. 2014, 28, 170–180. [Google Scholar] [CrossRef]
  13. Yuzugullu, O.; Marelli, S.; Erten, E.; Sudret, B.; Hajnsek, I. Determining Rice Growth Stage with X-Band SAR: A Metamodel Based Inversion. Remote Sens. 2017, 9, 460. [Google Scholar] [CrossRef] [Green Version]
  14. Yuzugullu, O.; Erten, E.; Hajnsek, I. Estimation of Rice Crop Height from X- and C-Band PolSAR by Metamodel-Based Optimization. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2016, 10, 194–204. [Google Scholar] [CrossRef] [Green Version]
  15. Attema, E.P.W.; Ulaby, F.T. Vegetation modeled as a water cloud. Radio Sci. 1978, 13, 357–364. [Google Scholar] [CrossRef]
  16. Yang, Z.; Li, K.; Shao, Y.; Brisco, B.; Liu, L. Estimation of Paddy Rice Variables with a Modified Water Cloud Model and Improved Polarimetric Decomposition Using Multi-Temporal RADARSAT-2 Images. Remote Sens. 2016, 8, 878. [Google Scholar] [CrossRef] [Green Version]
  17. Homayouni, S.; McNairn, H.; Hosseini, M.; Jiao, X.; Powers, J. Quad and compact multitemporal C-band PolSAR observations for crop characterization and monitoring. Int. J. Appl. Earth Obs. Geoinf. 2019, 74, 78–87. [Google Scholar] [CrossRef]
  18. Mandal, D.; Kumar, V.; McNairn, H.; Bhattacharya, A.; Rao, Y. Joint estimation of Plant Area Index (PAI) and wet biomass in wheat and soybean from C-band polarimetric SAR data. Int. J. Appl. Earth Obs. Geoinf. 2019, 79, 24–34. [Google Scholar] [CrossRef]
  19. Baghdadi, N.; El Hajj, M.; Zribi, M.; Bousbih, S. Calibration of the Water Cloud Model at C-Band for Winter Crop Fields and Grasslands. Remote Sens. 2017, 9, 969. [Google Scholar] [CrossRef] [Green Version]
  20. Bamler, R.; Hartl, P. Synthetic aperture radar interferometry. Inverse Probl. 1998, 14, R1–R54. [Google Scholar] [CrossRef]
  21. Fu, H.; Zhu, J.; Wang, C.; Zhao, R.; Xie, Q. Underlying Topography Estimation Over Forest Areas Using Single-Baseline InSAR Data. IEEE Trans. Geosci. Remote Sens. 2018, 57, 2876–2888. [Google Scholar] [CrossRef]
  22. Fu, H.; Wang, C.; Zhu, J.; Xie, Q.; Zhang, B. Estimation of Pine Forest Height and Underlying DEM Using Multi-Baseline P-Band PolInSAR Data. Remote Sens. 2016, 8, 820. [Google Scholar] [CrossRef] [Green Version]
  23. Liao, Z.; He, B.; Van Dijk, A.I.; Bai, X.; Quan, X. The impacts of spatial baseline on forest canopy height model and digital terrain model retrieval using P-band PolInSAR data. Remote Sens. Environ. 2018, 210, 403–421. [Google Scholar] [CrossRef]
  24. Fu, H.; Zhu, J.; Wang, C.; Li, Z. Underlying topography extraction over forest areas from multi-baseline PolInSAR data. J. Geod. 2018, 92, 727–741. [Google Scholar] [CrossRef]
  25. Peng, X.; Li, X.; Wang, C.; Fu, H.; Du, Y. A Maximum Likelihood Based Nonparametric Iterative Adaptive Method of Synthetic Aperture Radar Tomography and Its Application for Estimating Underlying Topography and Forest Height. Sensors 2018, 18, 2459. [Google Scholar] [CrossRef] [Green Version]
  26. Papathanassiou, K.; Cloude, S. Single-baseline polarimetric SAR interferometry. IEEE Trans. Geosci. Remote Sens. 2001, 39, 2352–2363. [Google Scholar] [CrossRef] [Green Version]
  27. Cloude, S.R.; Papathanassiou, K.P. Polarimetric SAR interferometry. IEEE Trans. Geosci. Remote Sens. 1998, 36, 1551–1565. [Google Scholar] [CrossRef]
  28. Xie, Q.; Zhu, J.; Wang, C.; Fu, H.; Lopez-Sanchez, J.M.; Ballester-Berman, J.D. A Modified Dual-Baseline PolInSAR Method for Forest Height Estimation. Remote Sens. 2017, 9, 819. [Google Scholar] [CrossRef] [Green Version]
  29. Fu, H.; Wang, C.; Zhu, J.; Xie, Q.; Zhao, R. Inversion of vegetation height from PolInSAR using complex least squares adjustment method. Sci. China Earth Sci. 2015, 58, 1018–1031. [Google Scholar] [CrossRef]
  30. Garestier, F.; Dubois-Fernandez, P.C.; Papathanassiou, K.P. Pine Forest Height Inversion Using Single-Pass X-Band PolInSAR Data. IEEE Trans. Geosci. Remote Sens. 2007, 46, 59–68. [Google Scholar] [CrossRef]
  31. Garestier, F.; Dubois-Fernandez, P.; Champion, I. Forest Height Inversion Using High-Resolution P-Band Pol-InSAR Data. IEEE Trans. Geosci. Remote Sens. 2008, 46, 3544–3559. [Google Scholar] [CrossRef]
  32. Hajnsek, I.; Kugler, F.; Lee, S.-K.; Papathanassiou, K.P. Tropical-Forest-Parameter Estimation by Means of Pol-InSAR: The INDREX-II Campaign. IEEE Trans. Geosci. Remote Sens. 2009, 47, 481–493. [Google Scholar] [CrossRef] [Green Version]
  33. Garestier, F.; Dubois-Fernandez, P.; Champion, I.; Le Toan, T. Pine forest investigation using high resolution P-band Pol-InSAR data. Remote Sens. Environ. 2011, 115, 2897–2905. [Google Scholar] [CrossRef]
  34. Kugler, F.; Schulze, D.; Hajnsek, I.; Pretzsch, H.; Papathanassiou, K.P. TanDEM-X Pol-InSAR Performance for Forest Height Estimation. IEEE Trans. Geosci. Remote Sens. 2014, 52, 6404–6422. [Google Scholar] [CrossRef]
  35. Praks, J.; Kugler, F.; Papathanassiou, K.P.; Hajnsek, I.; Hallikainen, M. Height Estimation of Boreal Forest: Interferometric Model-Based Inversion at L- and X-Band versus HUTSCAT Profiling Scatterometer. IEEE Geosci. Remote Sens. Lett. 2007, 4, 466–470. [Google Scholar] [CrossRef]
  36. Neumann, M.; Ferro-Famil, L.; Reigber, A. Estimation of Forest Structure, Ground, and Canopy Layer Characteristics from Multibaseline Polarimetric Interferometric SAR Data. IEEE Trans. Geosci. Remote Sens. 2009, 48, 1086–1104. [Google Scholar] [CrossRef] [Green Version]
  37. Zhang, B.; Fu, H.; Zhu, J.; Peng, X.; Xie, Q.; Lin, D.; Liu, Z. A Multibaseline PolInSAR Forest Height Inversion Model Based on Fourier-Legendre Polynomials. IEEE Geosci. Remote Sens. Lett. 2020, 1–5. [Google Scholar] [CrossRef]
  38. Xie, Y.; Fu, H.; Zhu, J.; Wang, C.; Xie, Q. A LiDAR-Aided Multibaseline PolInSAR Method for Forest Height Estimation: With Emphasis on Dual-Baseline Selection. IEEE Geosci. Remote Sens. Lett. 2020, 17, 1807–1811. [Google Scholar] [CrossRef]
  39. Kumar, S.; Govil, H.; Srivastava, P.K.; Thakur, P.K.; Kushwaha, S.P.S. Spaceborne Multifrequency PolInSAR-Based Inversion Modelling for Forest Height Retrieval. Remote Sens. 2020, 12, 4042. [Google Scholar] [CrossRef]
  40. Zhang, Z.; Wang, C.; Zhang, H.; Tang, Y.; Liu, X. Analysis of Permafrost Region Coherence Variation in the Qinghai–Tibet Plateau with a High-Resolution TerraSAR-X Image. Remote Sens. 2018, 10, 298. [Google Scholar] [CrossRef] [Green Version]
  41. Sagues, L.; Lopez-Sanchez, J.M.; Fortuny, J.; Fabregas, X.; Broquetas, A.; Sieber, A. Indoor experiments on polarimetric SAR interferometry. IEEE Trans. Geosci. Remote Sens. 2000, 38, 671–684. [Google Scholar] [CrossRef]
  42. Ballester-Berman, J.; Lopez-Sanchez, J.M.; Fortuny-Guasch, J. Retrieval of biophysical parameters of agricultural crops using polarimetric SAR interferometry. IEEE Trans. Geosci. Remote Sens. 2005, 43, 683–694. [Google Scholar] [CrossRef]
  43. Lopez-Sanchez, J.M.; Hajnsek, I.; Ballester-Berman, J.D. First Demonstration of Agriculture Height Retrieval with PolInSAR Airborne Data. IEEE Geosci. Remote Sens. Lett. 2012, 9, 242–246. [Google Scholar] [CrossRef]
  44. Pichierri, M.; Hajnsek, I.; Papathanassiou, K.P. A Multibaseline Pol-InSAR Inversion Scheme for Crop Parameter Estimation at Different Frequencies. IEEE Trans. Geosci. Remote Sens. 2016, 54, 4952–4970. [Google Scholar] [CrossRef] [Green Version]
  45. Nasirzadehdizaji, R.; Balik Sanli, F.; Abdikan, S.; Cakir, Z.; Sekertekin, A.; Ustuner, M. Sensitivity Analysis of Multi-Temporal Sentinel-1 SAR Parameters to Crop Height and Canopy Coverage. Appl. Sci. 2019, 9, 655. [Google Scholar] [CrossRef] [Green Version]
  46. Jiao, X.; McNairn, H.; Shang, J.; Pattey, E.; Liu, J.; Champagne, C. The sensitivity of RADARSAT-2 polarimetric SAR data to corn and soybean leaf area index. Can. J. Remote Sens. 2011, 37, 69–81. [Google Scholar] [CrossRef]
  47. Erten, E.; Taskin, G.; Lopez-Sanchez, J.M. Selection of PolSAR Observables for Crop Biophysical Variable Estimation with Global Sensitivity Analysis. IEEE Geosci. Remote Sens. Lett. 2019, 16, 766–770. [Google Scholar] [CrossRef]
  48. Liao, C.; Wang, J.; Shang, J.; Huang, X.; Liu, J.; Huffman, T. Sensitivity study of Radarsat-2 polarimetric SAR to crop height and fractional vegetation cover of corn and wheat. Int. J. Remote Sens. 2017, 39, 1475–1490. [Google Scholar] [CrossRef]
  49. Sarti, M.; Migliaccio, M.; Nunziata, F.; Mascolo, L.; Brugnoli, E. On the sensitivity of polarimetric SAR measurements to vegetation cover: The Coiba National Park, Panama. Int. J. Remote Sens. 2017, 38, 6755–6768. [Google Scholar] [CrossRef]
  50. Hosseini, M.; McNairn, H.; Merzouki, A.; Pacheco, A. Estimation of Leaf Area Index (LAI) in corn and soybeans using multi-polarization C- and L-band radar data. Remote Sens. Environ. 2015, 170, 77–89. [Google Scholar] [CrossRef]
  51. Wiseman, G.; McNairn, H.; Homayouni, S.; Shang, J. RADARSAT-2 Polarimetric SAR Response to Crop Biomass for Agricultural Production Monitoring. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 4461–4471. [Google Scholar] [CrossRef]
  52. Reisi-Gahrouei, O.; Homayouni, S.; McNairn, H.; Hosseini, M.; Safari, A. Crop biomass estimation using multi regression analysis and neural networks from multitemporal L-band polarimetric synthetic aperture radar data. Int. J. Remote Sens. 2019, 40, 6822–6840. [Google Scholar] [CrossRef]
  53. Mandal, D.; Kumar, V.; Lopez-Sanchez, J.M.; Bhattacharya, A.; McNairn, H.; Rao, Y.S. Crop biophysical parameter retrieval from Sentinel-1 SAR data with a multi-target inversion of Water Cloud Model. Int. J. Remote Sens. 2020, 41, 5503–5524. [Google Scholar] [CrossRef]
  54. Mandal, D.; Hosseini, M.; McNairn, H.; Kumar, V.; Bhattacharya, A.; Rao, Y.; Mitchell, S.; Robertson, L.D.; Davidson, A.; Dabrowska-Zielinska, K. An investigation of inversion methodologies to retrieve the leaf area index of corn from C-band SAR data. Int. J. Appl. Earth Obs. Geoinf. 2019, 82, 101893. [Google Scholar] [CrossRef]
  55. Fieuzal, R.; Baup, F. Estimation of leaf area index and crop height of sunflowers using multi-temporal optical and SAR satellite data. Int. J. Remote Sens. 2016, 37, 2780–2809. [Google Scholar] [CrossRef]
  56. Yang, H.; Chen, E.; Li, Z.; Yang, G.; Xu, X.; Yuan, L.; Feng, Q.; Zhao, L. Capability of multi-temporal Radarsat-2 data in monitoring canola crop and its plant height. In Proceedings of the 2014 the Third International Conference on Agro-Geoinformatics, Beijing, China, 11–14 August 2014; pp. 1–5. [Google Scholar]
  57. Canisius, F.; Shang, J.; Liu, J.; Huang, X.; Ma, B.; Jiao, X.; Geng, X.; Kovacs, J.M.; Walters, D. Tracking crop phenological development using multi-temporal polarimetric Radarsat-2 data. Remote Sens. Environ. 2018, 210, 508–518. [Google Scholar] [CrossRef]
  58. El Hajj, M.; Baghdadi, N.; Bazzi, H.; Zribi, M. Penetration Analysis of SAR Signals in the C and L Bands for Wheat, Maize, and Grasslands. Remote Sens. 2018, 11, 31. [Google Scholar] [CrossRef] [Green Version]
  59. Ballester-Berman, J.D.; Lopez-Sanchez, J.M.; Fortuny-Guasch, J. Retrieval of height and topography of corn fields by polarimetric SAR interferometry. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, IGARSS’04, Anchorage, AK, USA, 20–24 September 2004; Volume 2, pp. 1228–1231. [Google Scholar]
  60. Liu, C.; Shang, J.; Vachon, P.W.; McNairn, H. Multiyear Crop Monitoring Using Polarimetric RADARSAT-2 Data. IEEE Trans. Geosci. Remote Sens. 2012, 51, 2227–2240. [Google Scholar] [CrossRef]
  61. Cloude, S.R. Polarisation: Applications in Remote Sensing Polarisation; Oxford University Press: New York, NY, USA, 2010; ISBN 978-0-19-956973-1. [Google Scholar]
  62. Lee, J.; Pottier, E. Polarimetric Radar Imaging: From Basics to Applications; CRC Press: Boca Raton, FL, USA, 2009; ISBN 142005497X. [Google Scholar]
  63. Vicente-Guijalba, F.; Martinez-Marin, T.; Lopez-Sanchez, J.M. Dynamical Approach for Real-Time Monitoring of Agricultural Crops. IEEE Trans. Geosci. Remote Sens. 2014, 53, 3278–3293. [Google Scholar] [CrossRef] [Green Version]
  64. Huapeng, L.; Zhang, C.; Zhang, S.; Atkinson, P.M. Full year crop monitoring and separability assessment with fully-polarimetric L-band UAVSAR: A case study in the Sacramento Valley, California. Int. J. Appl. Earth Obs. Geoinf. 2019, 74, 45–56. [Google Scholar] [CrossRef] [Green Version]
  65. Lopez-Sanchez, J.M.; Ballester-Berman, J.D.; Hajnsek, I. First Results of Rice Monitoring Practices in Spain by Means of Time Series of TerraSAR-X Dual-Pol Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2010, 4, 412–422. [Google Scholar] [CrossRef]
  66. Lopez-Sanchez, J.M.; Cloude, S.R.; Ballester-Berman, J.D. Rice Phenology Monitoring by Means of SAR Polarimetry at X-Band. IEEE Trans. Geosci. Remote Sens. 2012, 50, 2695–2709. [Google Scholar] [CrossRef]
  67. Lopez-Sanchez, J.M.; Vicente-Guijalba, F.; Ballester-Berman, J.D.; Cloude, S.R. Influence of Incidence Angle on the Coherent Copolar Polarimetric Response of Rice at X-Band. IEEE Geosci. Remote Sens. Lett. 2014, 12, 249–253. [Google Scholar] [CrossRef] [Green Version]
  68. Cloude, S.; Pottier, E. An entropy based classification scheme for land applications of polarimetric SAR. IEEE Trans. Geosci. Remote Sens. 1997, 35, 68–78. [Google Scholar] [CrossRef]
  69. Neumann, M.; Ferro-Famil, L.; Jager, M.; Reigber, A.; Pottier, E. A polarimetric vegetation model to retrieve particle and orientation distribution characteristics. In Proceedings of the 2009 IEEE International Geoscience and Remote Sensing Symposium, Cape Town, South Africa, 12–17 July 2009; Volume 4, p. IV-145. [Google Scholar]
  70. Neumann, M.; Saatchi, S.S.; Ulander, L.M.H.; Fransson, J.E.S. Assessing Performance of L- and P-Band Polarimetric Interferometric SAR Data in Estimating Boreal Forest Above-Ground Biomass. IEEE Trans. Geosci. Remote Sens. 2012, 50, 714–726. [Google Scholar] [CrossRef]
  71. Xie, Q.; Zhu, J.; Lopez-Sanchez, J.M.; Wang, C.; Fu, H. A Modified General Polarimetric Model-Based Decomposition Method with the Simplified Neumann Volume Scattering Model. IEEE Geosci. Remote Sens. Lett. 2018, 15, 1229–1233. [Google Scholar] [CrossRef] [Green Version]
  72. Xie, Q.; Wang, J.; Liao, C.; Shang, J.; Lopez-Sanchez, J.M.; Fu, H.; Liu, X. On the Use of Neumann Decomposition for Crop Classification Using Multi-Temporal RADARSAT-2 Polarimetric SAR Data. Remote Sens. 2019, 11, 776. [Google Scholar] [CrossRef] [Green Version]
  73. Kim, Y.; Van Zyl, J. Comparison of forest parameter estimation techniques using SAR data. In Proceedings of the IGARSS 2001, Scanning the Present and Resolving the Future, IEEE 2001 International Geoscience and Remote Sensing Symposium (Cat. No.01CH37217), Sydney, NSW, Australia, 9–13 July 2001; Volume 3, pp. 1395–1397. [Google Scholar]
  74. Zhu, X.X.; Tuia, D.; Mou, L.; Xia, G.-S.; Zhang, L.; Xu, F.; Fraundorfer, F. Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources. IEEE Geosci. Remote Sens. Mag. 2017, 5, 8–36. [Google Scholar] [CrossRef] [Green Version]
  75. Zhang, L.; Zhang, L.; Du, B. Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art. IEEE Geosci. Remote Sens. Mag. 2016, 4, 22–40. [Google Scholar] [CrossRef]
  76. Tuia, D.; Verrelst, J.; Alonso, L.; Perez-Cruz, F.; Camps-Valls, G. Multioutput Support Vector Regression for Remote Sensing Biophysical Parameter Estimation. IEEE Geosci. Remote Sens. Lett. 2011, 8, 804–808. [Google Scholar] [CrossRef]
  77. Karimi, Y.; Prasher, S.O.; Madani, A.; Kim, S. Application of support vector machine technology for the estimation of crop biophysical parameters using aerial hyperspectral observations. Can. Biosyst. Eng. 2008, 50, 713–720. [Google Scholar]
  78. Siegmann, B.; Jarmer, T. Comparison of different regression models and validation techniques for the assessment of wheat leaf area index from hyperspectral data. Int. J. Remote Sens. 2015, 36, 4519–4534. [Google Scholar] [CrossRef]
  79. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  80. Waske, B.; Van Der Linden, S.; Oldenburg, C.; Jakimow, B.; Rabe, A.; Hostert, P. Imager—A user-oriented implementation for remote sensing image analysis with Random Forests. Environ. Model. Softw. 2012, 35, 192–193. [Google Scholar] [CrossRef]
  81. Austin, P.C.; Tu, J.V. Bootstrap Methods for Developing Predictive Models. Am. Stat. 2004, 58, 131–137. [Google Scholar] [CrossRef]
Figure 1. Locations and Pauli RGB images of the two study sites (Stratford and London). (a) Geographical locations of the two study sites; (b) Pauli RGB image acquired on 26 June 2013 at the Stratford site; (c) Pauli RGB image acquired on 23 June 2015 at the London site; (d) Pauli RGB image acquired on 1 July 2018 at the London site. All boundaries of cornfields and sample points for collecting ground measurements are highlighted in green and red, respectively. (RADARSAT-2 Data and Products © MacDonald, Dettwiler and Associates Ltd. (2013, 2015, 2018)—All Rights Reserved. RADARSAT is an official trademark of the Canadian Space Agency).
Figure 1. Locations and Pauli RGB images of the two study sites (Stratford and London). (a) Geographical locations of the two study sites; (b) Pauli RGB image acquired on 26 June 2013 at the Stratford site; (c) Pauli RGB image acquired on 23 June 2015 at the London site; (d) Pauli RGB image acquired on 1 July 2018 at the London site. All boundaries of cornfields and sample points for collecting ground measurements are highlighted in green and red, respectively. (RADARSAT-2 Data and Products © MacDonald, Dettwiler and Associates Ltd. (2013, 2015, 2018)—All Rights Reserved. RADARSAT is an official trademark of the Canadian Space Agency).
Remotesensing 13 00392 g001
Figure 2. Field photos of corn in 2013. (a) 25 May; (b) 4 June; (c) 16 June.
Figure 2. Field photos of corn in 2013. (a) 25 May; (b) 4 June; (c) 16 June.
Remotesensing 13 00392 g002
Figure 3. Comparison between measured and estimated corn height with scenario 2 in Table 3: (a) calibration of the Random Forest Regression (RFR) model; (b) validation of the RFR model; (c) calibration of the Support Vector Regression (SVR) model; (d) validation of the SVR model.
Figure 3. Comparison between measured and estimated corn height with scenario 2 in Table 3: (a) calibration of the Random Forest Regression (RFR) model; (b) validation of the RFR model; (c) calibration of the Support Vector Regression (SVR) model; (d) validation of the SVR model.
Remotesensing 13 00392 g003
Figure 4. Normalized variable importance of RFR in regression for corn height based on averaged output values of 10 scenarios. Different types of polarimetric variables are represented as follows: Backscattering coefficients and SPAN (red), Correlation magnitude (light green), Polarimetric phase (blue), Backscattering ratios (orange), Freeman-Durden decomposition (purple), Cloude-Pottier decomposition (black), Neumann Decomposition (yellow) and RVI (green). A description of how variables were named can be found in Table 2.
Figure 4. Normalized variable importance of RFR in regression for corn height based on averaged output values of 10 scenarios. Different types of polarimetric variables are represented as follows: Backscattering coefficients and SPAN (red), Correlation magnitude (light green), Polarimetric phase (blue), Backscattering ratios (orange), Freeman-Durden decomposition (purple), Cloude-Pottier decomposition (black), Neumann Decomposition (yellow) and RVI (green). A description of how variables were named can be found in Table 2.
Remotesensing 13 00392 g004
Figure 5. Comparison between measured and estimated corn height with scenario 1 in Table 4: (a) calibration of the RFR model; (b) validation of the RFR model; (c) calibration of the SVR model; (d) validation of the SVR model.
Figure 5. Comparison between measured and estimated corn height with scenario 1 in Table 4: (a) calibration of the RFR model; (b) validation of the RFR model; (c) calibration of the SVR model; (d) validation of the SVR model.
Remotesensing 13 00392 g005
Figure 6. Map of corn height on 15 September 2018 based on RFR model prediction.
Figure 6. Map of corn height on 15 September 2018 based on RFR model prediction.
Remotesensing 13 00392 g006
Table 1. RADARSAT-2 Images and Ground Data Acquired for 2013, 2015, and 2018.
Table 1. RADARSAT-2 Images and Ground Data Acquired for 2013, 2015, and 2018.
DateModeIncidenceResolutionOrbitFieldwork DateNumber of Corn Sample Points Average Corn Height
(cm)
Study Site
23 May 2013FQ9W27.2 ~ 30.55.1 × 4.7Ascending24 May 201345.75Stratford
2 June 2013FQ19W37.7 ~ 40.44.7 × 4.7Ascending4 June 20131610.06
16 June 2013FQ9W27.2 ~ 30.55.1 × 4.7Ascending16 June 20131725.13
26 June 2013FQ19W37.7 ~ 40.44.7 × 4.7Ascending24 June 2013/
25 June 2013
1759.87
10 July 2013FQ9W27.2 ~ 30.55.1 × 4.7Ascending10 July 201317142.29
20 July 2013FQ19W37.7 ~ 40.44.7 × 4.7Ascending21 July 201311214.35
3 August 2013FQ9W27.2 ~ 30.55.1 × 4.7Ascending3 August 201313254.84
13 August 2013FQ19W37.7 ~ 40.44.7 × 4.7Ascending13 August 2013/
14 August 2013
17260.78
23 June 2015FQ10W28.4 ~ 31.65.5 × 4.7Ascending23 June 20152588.44London
10 August 2015FQ10W28.4 ~ 31.65.5 × 4.7Ascending11 August 20156266.61
3 September 2015FQ10W28.4 ~ 31.65.5 × 4.7Ascending3 September 20156265.61
13 September 2015FQ20W38.6 ~ 41.35.1 × 4.7Ascending13 September 20156276.72
1 July 2018FQ10W28.4 ~ 31.65.5 × 4.7Ascending4 July 201824182.07
25 July 2018FQ10W28.4 ~ 31.65.5 × 4.7Ascending25 July 201832252.76
1 August 2018FQ5W22.5 ~ 26.05.0 × 4.7Ascending2 August 201832275.22
18 August 2018FQ10W28.4 ~ 31.65.5 × 4.7Ascending18 August 201832267.77
25 August 2018FQ5W22.5 ~ 26.05.0 × 4.7Ascending25 August 20188214.99
1 September 2018FQ1W17.2 ~ 21.24.8 × 4.7Ascending1 September 201832267.04
15 September 2018FQ9W27.3 ~ 30.55.1 × 4.7Descending11 September 201832267.22
Table 2. List of 27 polarimetric observables selected in this study.
Table 2. List of 27 polarimetric observables selected in this study.
Polarimetric ObservableDescription
C11, C22, C33,Backscattering coefficients in the linear polarization channels
T11, T22Backscattering coefficients in the Pauli polarization channels
SPANTotal backscattering power
HHVV|, |ρHVVV|, |ρHHHV|, |ρHH+VV,HHVV|Correlation between polarimetric channels
ϕHHVV, ϕHVVV, ϕHHHV, ϕHH+VV,HHVVPhase difference between polarimetric channels
HH/VV, HV/HH, HV/VVBackscattering ratios
Ps, Pd, PvScattering Power from different scattering mechanisms derived from Freeman-Durden decomposition
H, A, αEntropy, anisotropy, alpha angle from Cloude-Pottier decomposition
|δ|, ϕδ, τMagnitude and phase of the particle scattering anisotropy, the degree of orientation randomness derived from Neumann decomposition
RVIRadar Vegetation Index
Table 3. Statistics of model calibration and validation for 10 different datasets with Support Vector Regression (SVR) and Random Forest Regression (RFR) in case of using all 27 polarimetric variables.
Table 3. Statistics of model calibration and validation for 10 different datasets with Support Vector Regression (SVR) and Random Forest Regression (RFR) in case of using all 27 polarimetric variables.
ScenarioModel CalibrationModel Validation
SVRRFRSVRRFR
RMSE (cm)RRMSE (cm)RRMSE (cm)RRMSE (cm)R
142.050.8422.150.9856.610.6452.810.74
243.040.8223.010.9849.640.7648.730.85
351.140.7522.650.9851.350.7549.270.82
443.100.8222.530.9849.620.7649.830.82
541.270.8421.950.9858.490.6451.730.78
641.280.8421.980.9858.490.6451.820.78
750.370.7522.190.9854.170.7550.930.80
842.590.8322.350.9848.920.8151.100.80
943.080.8222.280.9849.630.7648.820.84
1043.300.8222.510.9849.560.7648.990.84
Average44.120.8122.360.9854.690.7350.400.81
Table 4. Statistics of model calibration and validation for 10 different datasets with SVR and RFR in case of using the top six polarimetric variables in the variable importance ranking.
Table 4. Statistics of model calibration and validation for 10 different datasets with SVR and RFR in case of using the top six polarimetric variables in the variable importance ranking.
ScenarioModel CalibrationModel Validation
SVRRFRSVRRFR
RMSE (cm)RRMSE (cm)RRMSE (cm)RRMSE (cm)R
146.420.7922.330.9745.450.7942.690.84
249.140.7622.690.9748.040.7847.810.79
345.840.7921.910.9750.750.7448.070.78
445.730.7922.040.9746.200.8048.120.79
545.340.8121.530.9753.890.7250.680.74
648.690.7721.280.9751.570.7351.290.73
745.810.8022.420.9745.380.8146.140.82
845.820.8022.440.9745.380.8147.070.81
945.710.7922.310.9746.140.8047.720.79
1045.740.7922.360.9746.220.8048.010.79
Average46.420.7922.130.9747.900.7847.760.79
Table 5. Statistics of model validation at two different growing stages for 10 different datasets with SVR and RFR in case of using the top six polarimetric variables in the variable importance ranking.
Table 5. Statistics of model validation at two different growing stages for 10 different datasets with SVR and RFR in case of using the top six polarimetric variables in the variable importance ranking.
ScenarioModel Validation
SVR (Height < 150 cm)SVR (Height > 150 cm)RFR (Height < 150 cm)RFR (Height > 150 cm)
RMSE (cm)RRMSE (cm)RRMSE (cm)RRMSE (cm)R
153.910.5243.410.2762.020.5137.230.40
259.610.1645.160.3249.190.6447.510.26
344.460.8052.010.1048.470.9247.980.10
451.750.3444.910.3550.710.5847.550.26
555.130.7653.490.3168.370.7243.550.30
660.290.6748.460.3566.060.7145.580.26
759.990.3741.990.4764.460.4841.710.41
859.990.3741.990.4764.590.4942.890.38
951.760.3444.840.3549.000.6047.440.28
1051.750.3444.940.3551.060.6347.330.28
Average54.860.4746.120.3357.390.6344.880.29
Table 6. The measured and estimated corn heights of 32 sample points on 15 September 2018 based on RFR model prediction.
Table 6. The measured and estimated corn heights of 32 sample points on 15 September 2018 based on RFR model prediction.
Field NameCorn Height (cm)12345678
C1Measured274.83251.5295.17286.83290.67285.5235.75271.41
Estimated252.55228.26283.05275.69279.13274.72244.21263.61
C2Measured273.92301371.58260.42241.08283.17298.75283.92
Estimated255.66259.90255.99255.32244.74273.18282.48227.11
C3Measured255.75251.67219.83248.08162.5220.67253.5204.33
Estimated258.35261.41228232.20199.91254.53248.51261.60
C4Measured276.25288.50290.75288.08296.58298.92306284.25
Estimated270.46280.46246.22252.87255.87284.98268.49262.39
RMSE (cm)32.61
R0.59
Table 7. Statistics of model calibration and validation for 10 different datasets with SVR and RFR in case of using all 27 polarimetric variables and all 19 RADARSAT-2 images.
Table 7. Statistics of model calibration and validation for 10 different datasets with SVR and RFR in case of using all 27 polarimetric variables and all 19 RADARSAT-2 images.
ScenarioModel CalibrationModel Validation
SVRRFRSVRRFR
RMSE (cm)RRMSE (cm)RRMSE (cm)RRMSE (cm)R
148.630.8722.380.9958.810.8054.410.84
248.530.8622.630.9958.750.8255.620.86
350.050.8622.220.9959.870.7554.560.83
450.450.8722.780.9954.370.7652.980.78
549.090.8722.200.9955.160.7954.730.84
648.270.8722.180.9957.640.8357.620.84
749.130.8623.050.9953.440.8553.150.88
849.950.8622.430.9954.860.7650.650.82
948.510.8722.140.9956.530.8155.590.82
1049.230.8622.470.9958.050.7856.200.83
Average49.180.8722.450.9956.750.8054.550.83
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Xie, Q.; Wang, J.; Lopez-Sanchez, J.M.; Peng, X.; Liao, C.; Shang, J.; Zhu, J.; Fu, H.; Ballester-Berman, J.D. Crop Height Estimation of Corn from Multi-Year RADARSAT-2 Polarimetric Observables Using Machine Learning. Remote Sens. 2021, 13, 392. https://0-doi-org.brum.beds.ac.uk/10.3390/rs13030392

AMA Style

Xie Q, Wang J, Lopez-Sanchez JM, Peng X, Liao C, Shang J, Zhu J, Fu H, Ballester-Berman JD. Crop Height Estimation of Corn from Multi-Year RADARSAT-2 Polarimetric Observables Using Machine Learning. Remote Sensing. 2021; 13(3):392. https://0-doi-org.brum.beds.ac.uk/10.3390/rs13030392

Chicago/Turabian Style

Xie, Qinghua, Jinfei Wang, Juan M. Lopez-Sanchez, Xing Peng, Chunhua Liao, Jiali Shang, Jianjun Zhu, Haiqiang Fu, and J. David Ballester-Berman. 2021. "Crop Height Estimation of Corn from Multi-Year RADARSAT-2 Polarimetric Observables Using Machine Learning" Remote Sensing 13, no. 3: 392. https://0-doi-org.brum.beds.ac.uk/10.3390/rs13030392

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop