Next Article in Journal
The Ground-Based Absolute Radiometric Calibration of Landsat 8 OLI
Previous Article in Journal
A One Year Landsat 8 Conterminous United States Study of Cirrus and Non-Cirrus Clouds
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Impact of Positional Errors on Soft Classification Accuracy Assessment: A Simulation Analysis

1
State Key Laboratory of Earth Surface Processes and Resource Ecology, College of Resources Science and Technology, Beijing Normal University, Beijing 100875, China
2
Department of Natural Resources and the Environment, University of New Hampshire, 56 College Road, Durham, NH 03824, USA
*
Author to whom correspondence should be addressed.
Remote Sens. 2015, 7(1), 579-599; https://0-doi-org.brum.beds.ac.uk/10.3390/rs70100579
Submission received: 27 October 2014 / Accepted: 25 December 2014 / Published: 7 January 2015

Abstract

:
Validating or accessing the accuracy of soft classification maps has rapidly developed over the past few years. This assessment employs a soft error matrix as generalized from the traditional, hard classification error matrix. However, the impact of positional error on the soft classification is uncertain and whether the well-accepted half-pixel registration accuracy is suitable for the soft classification accuracy assessment is unknown. In this paper, a simulation analysis was conducted to examine the influence of positional error on the overall accuracy (OA) and kappa in soft classification accuracy assessment under different landscape conditions (i.e., spatial characteristics and spatial resolutions). Results showed that with positional error ranging from 0 to 3 soft pixels, the OA-error varied from 0 to 44.6 percent while the kappa-error varied from 0 to 93.7 percent. Landscape conditions with smaller mean patch size (MPS) and greater fragmentation produced greater positional error impact on the accuracy measures at spatial resolutions of 1 and 2 unit distances. However, this trend did not hold for spatial resolutions of 5 and 10 unit distances. A half of a pixel was not sufficient to keep the overall accuracy error and kappa error under 10 percent. The results indicate that for soft classification accuracy assessment the requirement for registration accuracy is higher and depends greatly on the landscape characteristics. There is a great need to consider positional error for validating soft classification maps of different spatial resolutions.

Graphical Abstract

1. Introduction

In the past few decades, earth observation satellites (EOS) have provided large amounts of remotely sensed data depicting the earth’s surface at a variety of spatial and temporal scales. Global or regional land cover maps from these remotely sensed data are typically based on image classification and a great number of classification methods have been developed, ranging from classical ones such as minimum distance [1] to more advanced ones such as support vector machine [2]. Generally, two groups of classification methods exist: per-pixel (hard) and sub-pixel (soft) classification [3]. The hard classification methods classify the remotely sensed data into maps where each pixel belongs to a single land cover/vegetation/thematic class, whereas the soft classification methods categorize each pixel into several classes simultaneously [3]. Soft classification methods have recently gained wider use because of their ability to continuously represent and estimate the area of land cover at the sub-pixel level, especially for land cover mapping using coarse resolution data where mixed pixels dominate.
Accordingly, as an indispensable part of the classification process, there is a great need to develop methods for validating or accessing the accuracy of soft classification maps because the error matrix developed for hard classification accuracy assessment [4] may not be appropriate for use in representing the soft classification accuracy [5]. The major issue is that accuracy information will be lost if the Boolean operator typically used in a conventional error matrix to record the attribute in the reference sample unit as 1 for “correct” and 0 for “wrong” is used when determining the accuracy of a soft classification map [5]. In other words, the entire pixel is no longer either right or wrong, but rather soft classification labels parts of the pixel into different classes. Given this issue, much effort has been made to generalize the conventional error matrix and adapt it for use in soft classification. Until now, various approaches have been advanced for the sub-pixel error matrix based on a number of mathematical theories such as minimum operator [5], multiplication operator [6] and composite operator [7,8]. From the soft error matrix, a number of descriptive and analytical statistical measures can be calculated including overall accuracy (oa), kappa coefficient (kappa), user’s accuracy (ua) and producer’s accuracy (pa) [4]. It is worth noting that all these methods for constructing the soft error matrix were developed under the assumption that the classified map is perfectly co-registered between the reference data and the ground. However, in practice, no map is free of positional errors and the assessment of thematic accuracy must include or incorporate some measure of positional accuracy [9,10,11].
Positional error is regarded as one of the great sources of uncertainty in accuracy assessment [12,13,14]. Positional errors mainly arise from the geometric distortion of the image, including earth rotation effects, scanning system, and the random spacecraft movement [15], which take on different forms of distortion such as scaling, rotation, translation and scan skewing [16]. Image registration includes two steps to correct these issues. The first is called systematic correction using ephemeris data from the spacecraft and results in approximate image coordinates. The second step is precision correction which registers the image to a series of ground features by using ground control points. For accuracy assessment, the positional error between the classification map and the reference data is not limited only to image registration, but is also subject to GPS error [17] because the ground reference data are often collected using a GPS receiver. The synchronization error between the satellite and the code-phased GPS receiver results in a locational error ranging from 5 to 20 meters [18]. The assumption therefore, is that the difference between the classification map and reference data results only from the classification error not being satisfied. However, the non-thematic errors arising from misregistration can be a significant contribution to the error matrix that may then mask the real thematic accuracy information. Powell et al. [19] determined that more than 30% of thematic errors for Landsat TM could be attributed to the positional errors. This issue is more severe in heterogeneous landscapes where several map classes are located together [11]. Intuitively, for soft classification, coarser spatial resolution increases the number of mixed pixels in the classification maps [20], making the soft error matrix more sensitive to positional errors because even a slight location error between the classification and the reference map can substantially change the proportion of map classes within that mixed pixel.
Higher resolution images are often widely accepted as a substitute for the ground reference data used in validating the classification map. A half-pixel registration accuracy between the classification map and the reference map has often been reported [21] and accepted as common, especially for moderate resolution imagery such as Landsat Thematic Mapper imagery. However, whether this requirement holds for soft classification accuracy assessment and whether it changes with the varied spatial resolutions is still unknown.
Many projects have been conducted in the area of land cover dynamics, e.g., map comparison for change detection analysis. It has been shown that a slight location error can result in a great impact on the accuracy of change detection [16,22,23] and that a registration accuracy of 0.2 pixels is required to guarantee the error of change detection is less than 10% [16,24]. It was also found that the effect of positional error is influenced by the heterogeneity of the land cover. The more fragmented land cover maps produce the greater effect on the change detection error owing to positional error [25]. Aggregation-based or object-based change detection methods have been suggested to reduce the impact of the position on the change detection accuracy [26,27,28,29]. Studies of misresgistration issues in change detection analysis and soft classification have therefore provided the impetus for this paper. These issues include modeling the analysis of positional error and the study of the spatial characteristics of the land cover.
In this paper, therefore, the spatial characteristics of land cover and spatial resolution were simulated to test the sensitivity of thematic errors caused by the positional errors on soft classification accuracy assessment. The purpose of this study was to: (1) examine the impact of positional accuracy on the measures derived from the soft error matrix; (2) provide insight into how this effect changes with varied landscape structure and spatial resolution; and (3) facilitate the consideration of registration for future soft classification accuracy assessment.

2. Methods

2.1. Data Simulation

This paper explores the impact of the positional error on the soft classification using simulated soft maps. Many factors, such as interpreters’ skills, sampling errors and positional errors may simultaneously affect the accuracy assessment of the soft classification using real images. Therefore, using real images makes it nearly impossible to systematically analyze the effect of misregistration. The use of controlled, simulated maps could facilitate the study of thematic errors caused only by positional error without considering the thematic errors caused by other factors. Furthermore, a simulated map is independent of labor-intensive and time-consuming fieldwork necessary to create the classified map and assess the map accuracy. Simulating soft maps can be problematic; we thus followed the strategy developed by [30] in which the soft classification map and its reference map are generated by transforming the initially simulated hard categorical map using a scaling method. The generation of soft maps and modeling positional errors are detailed in following steps.

2.1.1. Simulate Hard Categorical Map with Different Spatial Characteristics

A series of hard categorical maps representing various landscape characteristics were simulated in this study using the SIMMAP software. This software uses the Modified Random Clusters (MRC) method to simulate landscapes whose spatial characteristics are similar to those widely found in real landscapes [31]. SIMMAP has been widely applied for creating controlled, artificial landscapes to study landscape patterns [32,33,34]. Therefore, these simulated maps were used here in place of real landscapes in order to control the factors influencing registration error. The spatial characteristics of various artificial landscapes were controlled using five main parameters: number (NUM) and abundance (A) of the classes, initial probability (P), linear dimension of the pattern (L), minimum mapping unit (MMU) and neighborhood criteria (NC) [31]. Most of these parameters are easily understandable from their names. Initial probability (P) denotes the degree of fragmentation of landscape pattern, and the neighborhood criterion (NC) is the pattern with anisotropy that patches can be placed in a certain dominant direction. The smaller the initial probability, the more fragmented the landscape pattern. The simulated maps will be in a simple random pattern when P is equal to 0, an aggregated pattern with larger patches when P increases, and a single patch filling the entire pattern when P exceeds 0.59 [31]. The MMU adjusts the size of the smallest patch in the simulated map, ranging from 1 (single pixel) to its maximum value 99. Each pixel on the hard categorical map was assigned to a single value ranging from 1 to NUM, representing the class legend (i.e., number of land cover classes). We defined the spatial resolution of the hard pixel as one unit-pixel and the length of pixel as one unit distance in this paper because there is no definition of spatial resolution for a map simulated in SIMMAP.
We simulated three groups of hard categorical maps representing various landscapes with MMU equal to 1, 10 and 99 respectively within which P varied from 0.35 to 0.58 resulting in 24 different maps The spatial characteristics of the simulated maps are listed in Table 1 and are characterized by the following indices: number of patches (NP), mean patch size (MPS), edge length (EL) and mean shape index (MSI). NP and MPS are expressed in pixels and EL is expressed in unit pixel-distance. MSI denotes the irregularity or elongation of the shapes in the pattern with minimum value of 1 and the higher values indicating more convoluted and elongated shapes. The calculation of these indices can be found in the SIMMAP manual [35]. Generally, with higher MMU and P, the NP and EL decreases while the MPS and MSI increases. The MPS is lower in the first group of eight maps, medium in the second group of eight maps, and large in the third group of eight maps. Within each group of eight maps, the fragmentation level changes the NP, EL, MPS and MSI. To simplify the analysis, only two land cover classes (NUM = 2) were considered. The dimensions of each map were 2000 lines by 2000 columns and the abundances of each map class were 60% and 40%, respectively. The simulated landscapes for P equal to 0.35, 0.54 and 0.58 and MMU equal 1, 10 and 99 are presented in Figure 1. These maps are representative of a variety of landscape conditions found on typical land cover maps.
Figure 1. Selectively simulated landscapes (thematic maps) with P being 0.35, 0.54 and 0.58 respectively and MMU being 1, 10 and 99 respectively.
Figure 1. Selectively simulated landscapes (thematic maps) with P being 0.35, 0.54 and 0.58 respectively and MMU being 1, 10 and 99 respectively.
Remotesensing 07 00579 g001
Table 1. The spatial characteristics of the simulated hard categorical maps and the percent of mixed pixels after aggregation using different scales.
Table 1. The spatial characteristics of the simulated hard categorical maps and the percent of mixed pixels after aggregation using different scales.
MMUPNPMPSELMSIPercent of Mixed Pixels (%)
1 × 12 × 25 × 510 × 10
10.3551,44377.761.46 × 1061.33035.8392.2799.98
0.4043,50291.951.30 × 1061.32032.2788.8899.91
0.4536,796108.181.13 × 1061.29028.3083.3699.60
0.5031,887125.44938,7071.24023.3174.3198.17
0.5230,088132.09846,4901.21021.1769.4596.97
0.5428,996137.95751,6051.19018.8163.3994.79
0.5628,433140.63647,4671.16015.9155.7190.24
0.5828,599140.06532,0191.14013.0647.3284.47
100.3517,096233.971.23 × 1061.82030.5283.9599.25
0.4014,269280.331.11 × 1061.78027.7380.0798.59
0.4511,493348.04970,8601.74024.1173.7097.00
0.508479457.20789,2031.67019.7363.6892.52
0.527918505.18707,1031.62017.5457.7888.62
0.546896580.05606,5291.56015.2351.3583.13
0.566496615.76506,5301.50012.7143.5574.84
0.586205644.76383,2761.4409.6033.4161.54
990.3535671121.39780,2432.89019.5157.9181.74
0.4032041234.57756,0502.81018.8157.5782.51
0.4526052145.92710,1762.75017.4555.7382.50
0.5018642312.14594,0922.64014.8949.4477.98
0.5216022496.88530,2332.58013.2344.9373.61
0.5411983338.90456,3932.50011.5139.5767.50
0.5610303883.50358,1732.3408.7130.6054.08
0.587495340.45241,0952.0805.9421.0939.07

2.1.2. Generate Soft Classification Map by Aggregation

For hard classification accuracy assessment, whether the impact of positional error is severe or not can depend on the heterogeneity of landscape. Fragmented landscapes increase class boundaries. Pixels located near or at the map class boundary could create thematic errors due to the positional error while the pixels inside the patch would be insensitive to the positional errors [11]. In contrast, for soft classification accuracy assessment, the impact of positional errors not only depends on the heterogeneity of landscape but also may depend on the spatial resolution, which determines the heterogeneity within the pixel. Images/pixels with lower spatial resolution have more mixed pixels and accordingly even slight location errors could largely change the proportion of each class on the reference map. In this paper, the soft classification map was generated by scaling up the hard categorical maps generated using SIMMAP. We aggregated each cluster of pixels on the hard categorical map into a new soft pixel containing a vector denoting the proportion of each class (Figure 2). We varied the cluster size as 1 × 1, 2 × 2, 5 × 5, and 10 × 10 pixels respectively to model different spatial resolutions of the soft map. Accordingly, the spatial resolution of the generated soft maps is 1, 2, 5 and 10 unit distances, respectively, representing from higher resolution to lower resolution. The 1 × 1 cluster size would make all the soft pixels pure since this is the same resolution of the original hard map. Although this situation does not exist in reality, we still included it here for comparison. The 10 × 10 cluster size would make almost all the soft pixels mixed. The percentage of mixed pixels in the soft maps using different scales for all simulated hard categorical maps is presented in Table 1. The greater the fragmentation and the smaller the mean patch size of the landscape, the higher will be the percentage of the mixed pixels. We regarded these aggregated maps as the soft classification maps. If we compared these soft classification maps with the hard categorical maps using a soft classification matrix [7], their thematic accuracy would be one hundred percent.
Figure 2. Example of a simulated soft classification and reference pixel (spatial resolution: 5 unit distance) at levels of 0.0, 0.1, 0.2 and 0.3 soft pixel’s positional error respectively in in both x and y direction. The proportion of two classes was greatly changed due to the different levels of positional errors.
Figure 2. Example of a simulated soft classification and reference pixel (spatial resolution: 5 unit distance) at levels of 0.0, 0.1, 0.2 and 0.3 soft pixel’s positional error respectively in in both x and y direction. The proportion of two classes was greatly changed due to the different levels of positional errors.
Remotesensing 07 00579 g002

2.1.3. Simulating Positional Errors

Once the soft classification maps were created, it was necessary to simulate positional errors to evaluate their impact on the thematic accuracy of the map. Positional errors typically vary spatially over the map [12]. In this study, we simulated the positional errors using a uniform model as it is very complicated to model these positional errors practically. The uniform model assumed that the positional errors were equally distributed between the classification map and the reference map and were modeled by translation using the methods developed by [16,17] in research on quantifying the effect of misregistration in change detection analysis. Therefore, for a soft pixel at location (x, y) on the soft classification map generated as previously described, we used the cluster at location (x + i, y + i) on the hard categorical map as the reference unit, where i is the number of soft pixels off its original position and i varied from 0 to 3 soft pixels with an addition of 0.1 soft pixels. After translation, the reference unit is 2 i from the corresponding soft pixel. This factor is used because registration accuracy is commonly reported as “the registration accuracy between the classification map and reference map is within a half of a pixel.” However, it is important to note that a half pixel for different spatial resolutions implies a different absolute distance. For example, a half of a pixel of Landsat TM 5 is 15 m while half of a pixel of MODIS imagery is 500 m. In this paper, a half-soft pixel’s misregistration in a soft map with a spatial resolution of 10 unit distance means 5 unit distance, while for a soft map with a spatial resolution of 5 unit distance means 2.5 unit distance. Sub-pixel misregistration is performed as shown in Figure 2. We apply this translation model to every soft classification pixel on the soft maps (except the soft pixels located at the four borders of the map), then record the proportion of change for every soft pixel by shift in order to create the soft reference map with positional errors. Figure 2 shows an example of the generation of the soft classification pixel and its reference pixel with positional errors of 0.1, 0.2 and 0.3 soft pixels.

2.2. Accuracy Assessment

Once the soft classification and reference maps were generated and the maps were subjected to registration error, an assessment could then be conducted to evaluate the errors. In this study, the composite operator developed by [7] was used for constructing the soft classification matrix (SCM) because of its three practical and reasonable characteristics: sum of entries being 1, diagonal matrix for identical maps, and constant marginal totals. The composite operator consists of different calculations for the diagonal and off-diagonal elements (Equation (1)). The diagonal elements (where i equals j) employ the minimum operator, the ruling principle of which is that the agreement of elements cannot exceed the minimum value of two proportions. The off-diagonal elements employ the mathematical formula in Equation (1) when ij. The core idea of this formula is that the proportion of class i incorrectly classified as class j, denoted as pnij, is computed by distributing the commission proportion of class i c n i p n i i according to the ratio of omission proportion of class j c n i p n i i to the total omission proportion of all classes j = 1 k ( r n j p n j j ) [7]. Table 2 shows the sub-pixel error matrix constructed for the nth soft pixel. The sum of each row and column is equal to the proportion of each class classified and the true proportion of reference data, respectively. The sum of the diagonal and off-diagonal elements is equal to one unit in the SCM for each sampled pixel. The SCM was first constructed for each soft pixel and then the final SCM was created by averaging the SCMs for all pixels between the soft classification map and corresponding reference map (Table 3). From the SCM, we derived overall accuracy and kappa which are the most commonly used accuracy measures at the map level in remote sensing.
p n i j = { M I N ( c n i , r n j ) , i = j ( c n i p n i i ) × [ ( r n j p n j j ) / j = 1 k ( r n j p n j j ) ] , i j
Overall accuracy and kappa will fluctuate given the varied positional errors introduced in the classification map and reference data. When the soft classification map and reference data are perfectly registered, the overall accuracy and kappa represent the actual thematic accuracy. However, in this study, we are particularly interested in the component of the thematic error that is introduced by the positional errors only. This part of the thematic error equals the accuracy resulting from the soft error matrix without positional errors minus its counterpart with positional errors. In this paper, the thematic error for overall accuracy is denoted as the OA-error and for kappa the kappa-error. In this study, the overall accuracy is 100% and kappa is 1 when there is no positional error because the reference data were created from the land cover maps and there is no error.
Table 2. Sub-pixel error matrix for the nth pixel.
Table 2. Sub-pixel error matrix for the nth pixel.
ClassificationReferenceTotal
Class 1Class 2Class k
Class 1pn11pn12pn1kcn1
Class 2pn21pn22pn2kcn2
Class kpnk1pnk2pnkkcn3
Totalrn1rn2rnk
Table 3. Error matrix for soft classification.
Table 3. Error matrix for soft classification.
ClassificationReferenceTotal
Class 1Class 2Class k
Class 1P11P12P1kP1+
Class 2P21P22P2kP2+
Class kPk1Pk2PkkPk+
TotalP+1P+2P+k
Accuracy measures O a = P i i n × 100 % ,   K a p p a = n i k P i i i k P i + P + i n 2 i k P i + P + i
U a = P i i P i +   P a = P j j P + j

3. Results

Figure 3, Figure 4, Figure 5 and Figure 6 show the relationship between OA-errors and the positional errors while Figure 7, Figure 8, Figure 9 and Figure 10 show the link between kappa-errors and the positional errors where the spatial resolution is 1, 2, 5, and 10 unit distances, respectively. Each figure presents the relationship divided into three groups by MPS: large MPS or mean patch size (1121.39–5340.05), middle MPS (233.97–644.76, and small MPS (77.76–140.06) from the left to the right. Within each group, the P value (representing the fragmentation) varies from 0.35 to 0.58 and the MPS also varies accordingly. The bottom abscissa value is the relative distance based on the soft pixel associated with a given spatial resolution while the top abscissa value is the absolute distance based on the unit distance corresponding to the relative distance. The same relative positional error appears to have different absolute distance because of the spatial resolution.
Figure 3 reveals the impact of a positional error ranging from 0 to 3 pixels on the OA-error with varying fragmentation and MPS at a spatial resolution of 1 unit distance. It was determined that as a positional error increases, so do the OA-errors. When there was no positional error, there was no OA-error. The rate of growth is different depending on the amount of fragmentation and MPS. For example, the rate is very high when the P is 0.35 and MPS is 77.76, whereas the rate is low when P is 0.58 and MPS is 5340.45. Generally, smaller MPS and higher fragmentation (lower P values) produced a greater positional error effect on OA-error. As can be seen in Figure 3, the corresponding lines in the left group (MPS of larger size) are lower than the middle group (MPS of medium size) which is lower than the right group (MPS of small size). At the positional error of three soft pixels, the largest impact results in an OA-error of 44.6% while the least impact produces only 10.4%.
Figure 3. Impact of positional error on OA-error at different landscape characteristics using a spatial resolution of 1 unit distance.
Figure 3. Impact of positional error on OA-error at different landscape characteristics using a spatial resolution of 1 unit distance.
Remotesensing 07 00579 g003
Figure 4. Impact of positional error on OA-error at different landscape characteristics using a spatial resolution of 2 unit distance.
Figure 4. Impact of positional error on OA-error at different landscape characteristics using a spatial resolution of 2 unit distance.
Remotesensing 07 00579 g004
Figure 5. Impact of positional error on OA-error at different landscape characteristics using a spatial resolution of 5 unit distance.
Figure 5. Impact of positional error on OA-error at different landscape characteristics using a spatial resolution of 5 unit distance.
Remotesensing 07 00579 g005
Figure 6. Impact of positional error on OA-error at different landscape characteristics using a spatial resolution of 10 unit distance.
Figure 6. Impact of positional error on OA-error at different landscape characteristics using a spatial resolution of 10 unit distance.
Remotesensing 07 00579 g006
Figure 7. Impact of positional error on kappa-error at different landscape characteristics using a spatial resolution of 1 unit distance.
Figure 7. Impact of positional error on kappa-error at different landscape characteristics using a spatial resolution of 1 unit distance.
Remotesensing 07 00579 g007
Figure 8. Impact of positional error on kappa-error at different landscape characteristics using a spatial resolution of 2 unit distance.
Figure 8. Impact of positional error on kappa-error at different landscape characteristics using a spatial resolution of 2 unit distance.
Remotesensing 07 00579 g008
Figure 9. Impact of positional error on kappa-error at different landscape characteristics using a spatial resolution of 5 unit distance.
Figure 9. Impact of positional error on kappa-error at different landscape characteristics using a spatial resolution of 5 unit distance.
Remotesensing 07 00579 g009
Figure 10. Impact of positional error on kappa-error at different landscape characteristics using a spatial resolution of 10 unit distance.
Figure 10. Impact of positional error on kappa-error at different landscape characteristics using a spatial resolution of 10 unit distance.
Remotesensing 07 00579 g010
The results of Figure 3 also hold in Figure 4 where the spatial resolution is 2 unit distance. The only difference is that the 2 unit distance spatial resolution creates less impact of positional error on the OA-error than the 1 unit distance with the same amount of absolute distance but creates greater impact with the same amount of relative distance. For instance, the 3 unit distance positional error created nearly 28.8% OA-error when P equals 0.35 with MPS being 1121.39 at the spatial resolution of 1 unit distance, whereas its counterpart created 27.2% OA-error at the spatial resolution of 2 unit distance. However, the 3 soft pixel positional error created nearly 28.8% OA-error when P equals 0.35 with MPS being 1121.39 at the spatial resolution of 1 unit distance, whereas its counterpart creates 35.5% OA-error at the spatial resolution of 2 unit distance. It was also found that some of the lines with P being 0.35 and 0.40 show a trend of becoming stable in the middle and small MPS groups when positional errors are greater than 1.5 soft pixels.
These results change when the spatial resolution is increased to 5 unit distance (Figure 5). Compared with Figure 3 and Figure 4, the lowest impact remained the same while the highest impact is not when P equals to 0.35 but when P is equal to 0.45 or 0.50. The lines at P equals 0.35, 0.40 and 0.45 show a trend becoming stable after the positional errors reach 1 soft pixel. Compared to what was shown in Figure 3 and Figure 4, the corresponding lines with the same fragmentation lie in different groups of MPS and do not have much difference.
Compared to the spatial resolutions of 1, 2, and 5 unit distances, the pattern at a spatial resolution of 10 unit distance are clearly different (Figure 6). The highest impact is shown on the line with P being 0.54 or 0.56 for all groupings of MPS. However, the lowest impact is shown on the line with P being 0.58 when the positional error is small, but changes to P being 0.35 when the positional error is large. The corresponding lines in the left group (MPS of larger size) are higher than the middle group (MPS of medium size) which is higher than the right group (MPS of small size), which is the complete opposite of the results from Figure 3 and Figure 4.
A comparison of Figure 3 to Figure 6 by MPS grouping shows that under the same spatial characteristics, the spatial resolution alters the impact of positional error on the OA-error. For example, with P being 0.58 and MPS being 5340.45, the OA-error ranges from 0% to 10.4% at the spatial resolution of 1 unit distance while it varies from 0% to 34.2% at the spatial resolution of 10 unit distance. As the spatial resolution becomes coarser, the group of lines becomes denser, meaning that the degree of influence of fragmentation is less. It also shows as the spatial resolution becomes coarser, the three groups of MPS do not change the shape of the lines as much with the same fragmentation.
The same analysis that was performed with OA-error can also be shown using kappa-error. Compared to the impact of positional error on the OA-error, the effect on the kappa-error is higher (Figure 7, Figure 8, Figure 9 and Figure 10). With three soft pixel positional errors, the kappa-error ranges from 0% to 93.7%. The patterns of fragmentation and MPS are the same as was found in OA-error analysis.
Table 4 shows the required registration accuracy (# of soft pixels) to keep the OA and kappa error under 10% for different landscape patterns at a spatial resolution of 1, 2, 5, and 10 unit distance respectively. A value of 10% was selected here as a reasonable amount of error contribution from positional error in a typical thematic accuracy assessment. It is clear that for most landscape patterns simulated here, a half of a soft-pixel registration accuracy is not sufficient to guarantee the OA-error is less than 10% except when the spatial resolution is 1 unit distance. To retain the OA-error at less than 10%, the required registration accuracy for a spatial resolution of 1 unit distance ranges from 0.3 to 2.8 soft pixels with an average of 0.85 pixels. This requirement then translates to a range of from 0.2 to 1.4 soft pixels, 0.1 to 0.6 soft pixels, and 0.2 to 0.4 soft pixels at spatial resolutions of 2, 5, and 10 unit distances, respectively. To retain the kappa-error at less than 10%, the required registration accuracy for spatial resolution of 1, 2, 5, and 10 unit distances range from 0.1 to 1.1 soft pixels, 0.1 to 0.6 soft pixels, 0.1 to 0.3 soft pixels and less than 0.1 to 0.1soft pixels, respectively. As the spatial resolution gets coarser, the required average positional accuracy varied from 0.86 to 0.22 for overall accuracy and from 0.29 to less than 0.1 for kappa.
Table 4. The required registration accuracy (# soft pixels) to keep the OA- and kappa-errors under 10% for different landscape characteristics at the spatial resolution of 1, 2, 5, 10 unit distances respectively. (“<” represents “less than”).
Table 4. The required registration accuracy (# soft pixels) to keep the OA- and kappa-errors under 10% for different landscape characteristics at the spatial resolution of 1, 2, 5, 10 unit distances respectively. (“<” represents “less than”).
PMPSLess than 10% OA-ErrorLess than 10% Kappa-Error
1251012510
0.3577.760.30.20.10.20.10.10.10.1
0.4091.950.30.20.10.20.10.1<0.10.1
0.45108.180.40.20.10.20.10.1<0.1<0.1
0.50125.440.50.30.20.20.20.10.1<0.1
0.52132.090.50.30.20.20.20.10.1<0.1
0.54137.950.60.40.20.20.20.10.10.1
0.56140.630.80.50.30.20.30.20.10.1
0.58140.061.10.60.30.30.40.20.10.1
0.35233.970.30.20.10.20.10.1<0.10.1
0.40280.330.40.20.10.20.10.1<0.10.1
0.45348.040.40.30.20.20.10.1<0.1<0.1
0.50457.200.60.30.20.20.20.10.1<0.1
0.52505.180.70.40.20.20.20.10.1<0.1
0.54580.050.90.50.20.20.30.20.10.1
0.56615.761.10.60.30.20.40.20.10.1
0.58644.761.50.80.40.30.60.30.20.1
0.351121.390.60.30.20.20.20.10.1<0.1
0.401234.570.60.40.20.20.20.10.1<0.1
0.452145.920.70.40.20.20.20.10.1<0.1
0.502312.140.90.50.20.20.30.20.1<0.1
0.522496.881.00.50.30.20.40.20.1<0.1
0.543338.901.20.60.30.20.40.20.1<0.1
0.563883.501.70.90.40.30.60.30.10.1
0.585340.452.81.40.60.41.10.60.30.1
Average0.850.460.230.220.290.170.12<0.1

4. Discussion

Quantifying the accuracy of sub-pixel land cover maps derived from the soft classification of remote sensing images is crucial for the future development of fields such as ecological and environmental modeling which incorporate land cover information as an important variable. The error matrix has been widely applied for accuracy assessment because of its practicality and effectiveness. Recent research has shown that much effort has been made to generalize a soft classification error matrix from the traditional one. The soft error matrix has the same accuracy measures as the traditional one. However, the impact of positional error on the soft classification thematic accuracy is uncertain and has not been addressed explicitly. In this paper, the impact of positional errors on thematic accuracy of soft classification was examined including the effects introduced by landscape structure and spatial resolution. A series of soft classifications with varied landscape patterns and the corresponding reference data were simulated and a sub-pixel misregistration model was implemented. The thematic errors caused by positional errors were reported using overall accuracy and kappa.
Our results showed that the effects of positional error on thematic accuracy as computed from a soft error matrix cannot be ignored. Most previous research in this field on the impact of positional accuracy on thematic accuracy used Landsat Thematic Mapper (TM) imagery and did not examine soft classification. Our work employed artificially generated maps representing various landscape patterns and simulation of soft classifications making our results difficult to directly compare to the previous work. However, there are still some important similarities between our work and this previous work. Patterson and Williams [36] reported that positional error could reach up to one pixel and could be located anywhere in a 3 × 3 pixel grouping [37]. Given this amount of positional error, the results of our simulations showed errors of overall accuracy ranging from 4.12% to 34.3%, whereas the error of kappa ranged from 8.47% to 71.76%. Kappa is more sensitive to the positional error largely because kappa uses the off-diagonal elements of the error matrix in the calculation of the statistic, whereas overall accuracy does not. These results are consistent with [19,38] who indicated that 30% percent of thematic errors were attributed to misregistration for Landsat pixels. Therefore, the results of this research provide further confirmation of previous work.
This paper determined that a half-pixel (as determined for Landsat Thematic Mapper imagery) is not appropriate for all spatial resolutions when performing a soft classification accuracy assessment. In this paper, a half-pixel was found to be sufficient only when the map used was of 1 unit distance spatial resolution. As the spatial resolution becomes coarser, the positional requirement increases. Therefore, the positional accuracy standard for the soft classification accuracy assessment should be updated and the future requirement for registration accuracy must consider the spatial resolution of the imagery. In reality, the half-pixel standard was defined from the application of SPOT and Landsat TM imagery [14,39]. Whether this is suitable for the MODIS and other remotely sensed images with lower spatial resolution also needs to be examined. From our research, we can deduce that the required positional accuracy for MODIS is greater than the half-pixel required for SPOT and Landsat TM.
In the past few years, much effort has been made to develop complicated algorithms for generalizing the conventional error matrix to the soft error matrix [7,8]. However, requirements suitable for the hard classification accuracy may not be suitable for the soft one. We suggest examining the assumptions, procedures, and accuracy measures created for the hard error matrix and applying these for the soft classification accuracy assessment including sampling scheme and explanation of accuracy measures. This research has also provided some insights when considering the sampling location for collecting the reference data for comparison with the map in creating the error matrix. It has shown that the chance of misclassifying a pixel increased in places where the patch size was smaller and heterogeneity was higher [20,40]. The aim of any sampling design for accuracy assessment is to present these thematic errors proportionally in the error matrix and, therefore, a great deal of effort had been made to develop the most efficient sampling schemes to minimize the sampling error [41,42]. However, the existence of positional error introduces false information into the error matrix, thereby causing more error than just the error due to sampling. In addition, the influence of the positional error was also higher where the landscape was more heterogeneous with smaller patch size, especially when the spatial resolution was higher. Therefore, sampling in only the homogeneous areas will result in overestimating the accuracy [43] but sampling that includes the heterogeneous areas risks increasing the false accuracy information. How to design sampling schemes considering positional errors requires further research.
Our research has complemented the efforts to assess the impact of positional error on the thematic accuracy under different spatial resolutions and spatial characteristics of landscape for soft classification. This impact was a function of landscape variables, degree of registration, and spatial resolution. At high spatial resolutions we noted that the more fragmented and the smaller patch sizes of the landscape, the greater the effect of positional error. However, at lower spatial resolutions, the trend noted for higher spatial resolutions changed. This trend reversal for lower spatial resolutions can be explained in that the larger pixel sizes have more chance to shift to a new location, although the proportion of class membership within the pixel remains similar when the landscape is heterogeneous. Therefore, it appears as if the pixel has not moved even though it has.
This research has also provided some implications for soft change detection analysis. If the reference map and classification map in this paper are treated as two maps from different dates, all the results and conclusions are suitable for the change detection using soft classification. These results then complement the work of [16] who examined the requirement of positional accuracy for change detection using hard classification for Landsat TM data and concluded that less than one-fifth of a pixel is required to achieve a change detection error of less than 10%.
A number of limitations also remain. First, this research only employs artificial images to create landscape patterns, and the differences between these simulated images and real satellite images need consideration. Second, this research only took into account two classes with fixed class proportions as dictated by the Modified Random Clusters model for simulation. Simulating more than two classes with different spatial characteristics is difficult to keep the proportion of classes the same. We can speculate that more classes will increase the impact of the positional errors because it increases the fragmentation and decreases the patch size. We can also deduce that one dominating class with other minor classes will decrease the impact because it increases the mean patch size. Finally, this paper ignored the distribution of the positional errors. Positional errors are not equally distributed over the spatial map [12] and the modeling of the distribution of the positional errors will be a task for future research.

5. Conclusions

This paper conducted an analysis of the impact of positional error on thematic accuracy for soft accuracy assessment using simulated maps of various landscape patterns at different spatial resolutions. Some of the results of this work were quite predictable, while others were not, thus having specific implications on the impact of positional error on thematic error when assessing map accuracy. Not surprisingly, increased fragmentation of the landscape increased the impact of positional error on thematic accuracy. Also not surprisingly, initially larger pixels increased the impact of positional error on thematic accuracy as the larger pixels cause increased fragmentation. However, what is surprising is that this trend did not continue as the pixel size increased. Instead, the larger pixels reached a point in which the proportion of map classes within a large, soft pixel reached an equilibrium regardless of the positional error. This result has special significance when considering low spatial resolution imagery such as AVHRR or MODIS data. Another very interesting and practical result of our work showed that a half-pixel is not sufficient to keep the thematic error under 10% for all spatial resolutions for soft classification accuracy assessment. A half-pixel registration error is commonly accepted for moderate resolution imagery such as Landsat Thematic Mapper and SPOT. A 10% error contribution from positional error is a reasonable amount that might be accepted in a thematic map accuracy assessment. However, at various spatial resolutions tested in this study, it was determined that a half-pixel registration error does not keep the thematic error under 10%. Detailed figures were provided to allow the reader to determine the impact of positional error at varying landscape patterns and spatial resolutions. Finally, the impact of positional error on the thematic accuracy changes with variation in landscape variables, degree of registration, and spatial resolution. There is a great need to update the positional requirements for soft accuracy assessment according to the map spatial resolution and landscape structure. In addition, the results and conclusions in this paper are also suitable for soft change detection analysis. Careful consideration of the issues and analysis described in this paper will result in improved soft accuracy assessment in the future.

Acknowledgments

This study was supported by both the Major Project of High Resolution Earth Observation Systems and the National Natural Science Foundation of China (No. 41301444). The authors are grateful for this support. All work on this study was conducted while the first author was a visiting researcher in the Basic and Applied Spatial Analysis Lab (BASAL), Department of Natural Resources and the Environment, University of New Hampshire under the direction of Russell G. Congalton. Use of these facilities including support from the New Hampshire Agricultural Experiment Station is gratefully acknowledged.

Author Contributions

The concept of this research was initially developed by Jianyu Gu in discussion with Russell G. Congalton. Jianyu Gu also discussed this idea with Yaozhong Pan and received his input. Jianyu Gu and Russell G. Congalton then developed a flow chart of the entire process. Jianyu Gu conducted the analysis with significant feedback from Russell G. Congalton. Jianyu Gu wrote the initial draft of the paper which was edited significantly by Russell G. Congalton until the final draft was produced. This draft was shared with Yaozhong Pan for additional comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Erbek, F.S.; Özkan, C.; Taberner, M. Comparison of maximum likelihood classification method with supervised artificial neural network algorithms for land use activities. Int. J. Remote Sens. 2004, 25, 1733–1748. [Google Scholar] [CrossRef]
  2. Mountrakis, G.; Im, J.; Ogole, C. Support vector machines in remote sensing: A review. ISPRS J. Photogramm. Remote Sens. 2011, 66, 247–259. [Google Scholar] [CrossRef]
  3. Lu, D.; Weng, Q. A survey of image classification methods and techniques for improving classification performance. Int. J. Remote Sens. 2007, 28, 823–870. [Google Scholar] [CrossRef]
  4. Congalton, R.G. A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens. Environ. 1991, 37, 35–46. [Google Scholar] [CrossRef]
  5. Binaghi, E.; Brivio, P.A.; Ghezzi, P.; Rampini, A. A fuzzy set-based accuracy assessment of soft classification. Pattern Recognit. Lett. 1999, 20, 935–948. [Google Scholar] [CrossRef]
  6. Lewis, H.G.; Brown, M. A generalized confusion matrix for assessing area estimates from remotely sensed data. Int. J. Remote Sens. 2001, 22, 3223–3235. [Google Scholar] [CrossRef]
  7. Pontius, R.G.; Cheuk, M.L. A generalized cross-tabulation matrix to compare soft-classified maps at multiple resolutions. Int. J. Geogr. Inf. Sci. 2006, 20, 1–30. [Google Scholar] [CrossRef]
  8. Silvan-Cardenas, J.L.; Wang, L. Sub-pixel confusion-uncertainty matrix for assessing soft classifications. Remote Sens. Environ. 2008, 112, 1081–1095. [Google Scholar] [CrossRef]
  9. Husak, G.J.; Hadley, B.C.; McGwire, K.C. Landsat thematic mapper registration accuracy and its effects on the IGBP validation. Photogramm. Eng. Remote Sens. 1999, 65, 1033–1039. [Google Scholar]
  10. Loveland, T.; Reed, B.; Brown, J.; Ohlen, D.; Zhu, Z.; Yang, L.; Merchant, J. Development of a global land cover characteristics database and IGBP discover from 1 km AVHRR data. Int. J. Remote Sens. 2000, 21, 1303–1330. [Google Scholar] [CrossRef]
  11. Foody, G.M. Status of land cover classification accuracy assessment. Remote Sens. Environ. 2002, 80, 185–201. [Google Scholar] [CrossRef]
  12. Brown, K.M.; Foody, G.M.; Atkinson, P.M. Modelling geometric and misregistration error in airborne sensor data to enhance change detection. Int. J. Remote Sens. 2007, 28, 2857–2879. [Google Scholar] [CrossRef]
  13. Eastman, R.D.; Le Moigne, J.; Netanyahu, N.S.; IEEE. Research issues in image registration for remote sensing. In Proceedings of the 2007 IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), Minneapolis, MN, USA, 17–22 June 2007; Volume 1–8, pp. 3233–3240.
  14. Congalton, R.G.; Green, K. Assessing the Accuracy of Remotely Sensed Data: Principles and Practices; CRC Press/Taylor & Francis: New York, NY, USA, 2009. [Google Scholar]
  15. Jensen, J.R. Introductory Digital Image Processing: A Remote Sensing Perspective; Prentice-Hall Inc.: Upper Saddle River, NJ, USA, 1996. [Google Scholar]
  16. Dai, X.L.; Khorram, S. The effects of image misregistration on the accuracy of remotely sensed change detection. IEEE Trans. Geosci. Remote Sens. 1998, 36, 1566–1577. [Google Scholar] [CrossRef]
  17. McRoberts, R.E. The effects of rectification and global positioning system errors on satellite image-based estimates of forest area. Remote Sens. Environ. 2010, 114, 1710–1717. [Google Scholar] [CrossRef]
  18. Cooke, W.H. Forest/non-forest stratification in Georgia with Landsat Thematic Mapper data. In Proceedings of the First Annual Forest Inventory and Analysis Symposium, San Antonio, TX, USA, 2–3 November 1999; McRoberts, R.E., Reams, G.A., van Deusen, P.C., Eds.; Gen. Tech. Rep. NC-213. U.S. Department of Agriculture, Forest Service, North Central Research Station: St. Paul, MN, USA, 2000; pp. 28–30. [Google Scholar]
  19. Powell, R.L.; Matzke, N.; de Souza, C.; Clark, M.; Numata, I.; Hess, L.L.; Roberts, D.A. Sources of error in accuracy assessment of thematic land-cover maps in the Brazilian Amazon. Remote Sens. Environ. 2004, 90, 221–234. [Google Scholar] [CrossRef]
  20. Smith, J.H.; Stehman, S.V.; Wickham, J.D.; Yang, L.M. Effects of landscape characteristics on land-cover class accuracy. Remote Sens. Environ. 2003, 84, 342–349. [Google Scholar] [CrossRef]
  21. Le Moigne, J.; Netanyahu, N.S.; Eastman, R.D. Image Registration for Remote Sensing; Cambridge University Press: New York, NY, USA, 2011. [Google Scholar]
  22. Roy, D.P. The impact of misregistration upon composited wide field of view satellite data and implications for change detection. IEEE Trans. Geosci. Remote Sens. 2000, 38, 2017–2032. [Google Scholar] [CrossRef]
  23. Hua-Mei, C.; Varshney, P.K.; Arora, M.K. Performance of mutual information similarity measure for registration of multitemporal remote sensing images. IEEE Trans. Geosci. Remote Sens. 2003, 41, 2445–2454. [Google Scholar] [CrossRef]
  24. Townshend, J.R.G.; Justice, C.O.; Gurney, C.; McManus, J. The impact of misregistration on change detection. IEEE Trans. Geosci. Remote Sens. 1992, 30, 1054–1060. [Google Scholar] [CrossRef]
  25. Carmel, Y.; Dean, D.J.; Flather, C.H. Combining location and classification error sources for estimating multi-temporal database accuracy. Photogramm. Eng. Remote Sens. 2001, 67, 865–872. [Google Scholar]
  26. Zhang, M.; Lin, H.; Zeng, S.; Li, J.; Shi, J.; Wang, G. Impacts of plot location errors on accuracy of mapping and scaling up aboveground forest carbon using sample plot and Landsat TM data. IEEE Geosci. Remote Sens. 2013, 10, 1483. [Google Scholar] [CrossRef]
  27. Chen, G.; Zhao, K.; Powers, R. Assessment of the image misregistration effects on object-based change detection. ISPRS J. Photogramm. Remote Sens. 2014, 87, 19–27. [Google Scholar] [CrossRef]
  28. Carmel, Y. Controlling data uncertainty via aggregation in remotely sensed data. IEEE Geosci. Remote Sens. 2004, 1, 39–41. [Google Scholar] [CrossRef]
  29. Carmel, Y. Aggregation as a means of increasing Thematic map accuracy. In 7th International Conference on GeoComputation, Southampton, UK, 8–10 September 2005; pp. 29–38.
  30. Chen, J.; Zhu, X.; Imura, H.; Chen, X. Consistency of accuracy assessment indices for soft classification: Simulation analysis. ISPRS J. Photogramm. Remote Sens. 2010, 65, 156–164. [Google Scholar] [CrossRef]
  31. Saura, S.; Martínez-Millán, J. Landscape patterns simulation with a modified random clusters method. Landsc. Ecol. 2000, 15, 661–678. [Google Scholar] [CrossRef]
  32. Li, X.; He, H.S.; Wang, X.; Bu, R.; Hu, Y.; Chang, Y. Evaluating the effectiveness of neutral landscape models to represent a real landscape. Landsc. Urban. Plan. 2004, 69, 137–148. [Google Scholar] [CrossRef]
  33. Hagen-Zanker, A. An improved Fuzzy Kappa statistic that accounts for spatial autocorrelation. Int. J. Geogr. Inf. Sci. 2009, 23, 61–73. [Google Scholar] [CrossRef]
  34. Emilio Rafael, D.-V.; Manuel Francisco, M.-P.; Pedro, Á.-Á. Use of simulated and real data to identify heterogeneity domains in scale-divergent forest landscapes. For. Ecol. Manag. 2009, 258, 2490–2500. [Google Scholar]
  35. Saura, S. Simmap 2.0, landscape categorical spatial patterns simulation software. User’s Manual. Available online: http://www2.montes.upm.es/personales/saura/software.html#simmap (accessed on 8 August 2003).
  36. Patterson, P.L.; Williams, M.S. Effects of registration errors between remotely sensed and ground data on estimators of forest area. For. Sci. 2003, 49, 110–118. [Google Scholar]
  37. Goodchild, M.F. Integrating GIS and remote sensing for vegetation analysis and modeling: Methodological issues. J. Veg. Sci. 1994, 5, 615–626. [Google Scholar] [CrossRef]
  38. Riemann, R.; Hoppus, M.; Lister, A. Using arrays of small ground sample plots to assess the accuracy of Landsat TM-derived forest-cover maps. In Proceedings of the 4th International Symposium on Spatial Accuracy Assessment in Natural Resources and Environmental Sciences, Amsterdam, The Netherlands, 12–14 July 2000; pp. 541–548.
  39. Congalton, R.G. Thematic and positional accuracy assessment of digital remotely sensed data. In Proceedings of the 7th Annual Forest Inventory and Analysis Symposium, Portland, ME, USA, 3–6 October 2005.
  40. Zhou, W.; Cadenasso, M.L. Effects of patch characteristics and within patch heterogeneity on the accuracy of urban land cover estimates from visual interpretation. Landsc. Ecol. 2012, 27, 1291–1305. [Google Scholar] [CrossRef]
  41. Stehman, S.V. Basic probability sampling designs for thematic map accuracy assessment. Int. J. Remote Sens. 1999, 20, 2423–2441. [Google Scholar] [CrossRef]
  42. Stehman, S.V.; Wickham, J.D.; Fattorini, L.; Wade, T.D.; Baffetta, F.; Smith, J.H. Estimating accuracy of land-cover composition from two-stage cluster sampling. Remote Sens. Environ. 2009, 113, 1236–1249. [Google Scholar] [CrossRef]
  43. Plourde, L.; Congalton, R.G. Sampling method and sample placement: How do they affect the accuracy of remotely sensed maps? Photogramm. Eng. Remote Sens. 2003, 69, 289–297. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Gu, J.; Congalton, R.G.; Pan, Y. The Impact of Positional Errors on Soft Classification Accuracy Assessment: A Simulation Analysis. Remote Sens. 2015, 7, 579-599. https://0-doi-org.brum.beds.ac.uk/10.3390/rs70100579

AMA Style

Gu J, Congalton RG, Pan Y. The Impact of Positional Errors on Soft Classification Accuracy Assessment: A Simulation Analysis. Remote Sensing. 2015; 7(1):579-599. https://0-doi-org.brum.beds.ac.uk/10.3390/rs70100579

Chicago/Turabian Style

Gu, Jianyu, Russell G. Congalton, and Yaozhong Pan. 2015. "The Impact of Positional Errors on Soft Classification Accuracy Assessment: A Simulation Analysis" Remote Sensing 7, no. 1: 579-599. https://0-doi-org.brum.beds.ac.uk/10.3390/rs70100579

Article Metrics

Back to TopTop