A Fully Automatic, Interpretable and Adaptive Machine Learning Approach to Map Burned Area from Remote Sensing

Stroppiana, Daniela; Bordogna, Gloria; Sali, Matteo; Boschetti, Mirco; Sona, Giovanna; Brivio, Pietro Alessandro

doi:10.3390/ijgi10080546

Open AccessArticle

A Fully Automatic, Interpretable and Adaptive Machine Learning Approach to Map Burned Area from Remote Sensing

¹

Institute for Electromagnetic Sensing of the Environment (IREA), Consiglio Nazionale delle Ricerche, 20133 Milan, Italy

²

Department of Civil and Environmental Engineering (DICA), Politecnico di Milano, 20133 Milan, Italy

^*

Author to whom correspondence should be addressed.

ISPRS Int. J. Geo-Inf. 2021, 10(8), 546; https://0-doi-org.brum.beds.ac.uk/10.3390/ijgi10080546

Submission received: 9 June 2021 / Revised: 6 August 2021 / Accepted: 11 August 2021 / Published: 13 August 2021

(This article belongs to the Special Issue Multi-Hazard Spatial Modelling and Mapping)

Download

Browse Figures

Versions Notes

Abstract

:

The paper proposes a fully automatic algorithm approach to map burned areas from remote sensing characterized by human interpretable mapping criteria and explainable results. This approach is partially knowledge-driven and partially data-driven. It exploits active fire points to train the fusion function of factors deemed influential in determining the evidence of burned conditions from reflectance values of multispectral Sentinel-2 (S2) data. The fusion function is used to compute a map of seeds (burned pixels) that are adaptively expanded by applying a Region Growing (RG) algorithm to generate the final burned area map. The fusion function is an Ordered Weighted Averaging (OWA) operator, learnt through the application of a machine learning (ML) algorithm from a set of highly reliable fire points. Its semantics are characterized by two measures, the degrees of pessimism/optimism and democracy/monarchy. The former allows the prediction of the results of the fusion as affected by more false positives (commission errors) than false negatives (omission errors) in the case of pessimism, or vice versa; the latter foresees if there are only a few highly influential factors or many low influential ones that determine the result. The prediction on the degree of pessimism/optimism allows the expansion of the seeds to be appropriately tuned by selecting the most suited growing layer for the RG algorithm thus adapting the algorithm to the context. The paper illustrates the application of the automatic method in four study areas in southern Europe to map burned areas for the 2017 fire season. Thematic accuracy at each site was assessed by comparison to reference perimeters to prove the adaptability of the approach to the context; estimated average accuracy metrics are omission error = 0.057, commission error = 0.068, Dice coefficient = 0.94 and relative bias = 0.0046.

Keywords:

interpretable machine learning; OWA operators; wildfires; mapping burned areas; explainable fusion

1. Introduction

Data science comprises methods and techniques such as machine learning, statistics, data mining, pattern recognition and “soft” computing for discovering knowledge in the form of both patterns and relationship from large volumes of data, in order to understand actual phenomena [1,2]. These technologies are particularly suited for processing satellite and aerial remote sensing imageries. Disturbance phenomena such as wildfires, flooding, landslides can be identified as abrupt changes in the surface conditions induced by a rapid and unforeseen event. Thus, thematic mapping of remotely sensed data relies on the spatio-temporal patterns of the target surface to be extracted from both spectral signatures and signatures’ change. Operational monitoring of the environmental systems imposes quick and efficient methods based on large-scale data, readily available to the agencies [3], and it asks for automatic algorithms able to extract information from big data. In this work, we propose a fully automatic, interpretable and adaptive data fusion algorithm applied to the case study of burned area (BA) mapping from remotely sensed data. Interpretability is an important property of automatic systems in order to capture the trust of those who have to use their results: criteria of the mapping that lead to a certain result should be understandable to learn more about the phenomenon. This concern is expressed in the recent Presidential Address (February 2021) in the issue of the Geological Society of America (GSA) magazine “GSA Today” that cites four principles that data processing should satisfy, and one of them is “I can’t use something I don’t understand”.

In recent years, machine learning (ML), and specifically deep learning, has become very successful in environmental sciences [4,5,6] including mapping and monitoring the status of the territory and changes induced by disturbances, such as wildfires [7], thanks to the high accuracy of the predictions [8,9,10]. Nevertheless, the criteria it learns and applies, besides requiring unbiased big data for training that are not always available, as in our case, can be hardly explainable to both experts and decision makers due to its black-box nature. Furthermore, ex-post explainability of ML is considered an essential aspect of the European Union’s way to Artificial Intelligence [11], in line with the European General Data Protection Regulation (GDPR) [12,13,14] that restricted the use of black-box ML [15]. Specifically, GDPR promotes transparency of automated systems, by requiring that they provide meaningful information about the logics, and a justification of outcomes in order to enable understanding [16]. Although some attempts have been made to create methods for explaining black-box deep learning, the way forward is to design models that are inherently interpretable [17].

Therefore, the approach proposed here is an interpretable and explainable method based on a ML algorithm that exploits active fire points as a small training set for learning an Ordered Weighted Averaging (OWA) operator defined within fuzzy set theory [18]; this operator is used for the fusion of multiple partial evidence maps of burned areas, each one derived from diverse factors identified based on expert’s knowledge.

The approach is applied to map burned areas in Mediterranean ecosystems. Wildfires are a complex process caused by the simultaneous occurrence of several interrelated factors (e.g., ignition source, fuel characteristics and composition, weather conditions and topography) [10]. Fire monitoring from remotely sensed data is mainly carried out by observing two distinct surface conditions: the presence of a fire front and the area affected by a fire. The fire front can be captured at the time of satellite overpass and detected as thermal hot spot; we refer to it as active fire (AF) or hot spot points. The area affected by a fire shows a change in the vegetation cover and/or in the ground surface: we refer to it as burned areas (BAs) [19]. Active fires are useful indicators of fire presence and fire timing but they do not provide a direct estimate for the total burned area [20].

Remote Sensing (RS) technology has been proven to bring key source data for monitoring and modelling complex disturbance phenomena affecting the environment, e.g., wildfires, insects and disease [21] for their capability of adapting to changing environmental conditions [22]. Several satellite missions carry on-board sensors specifically designed for monitoring fires and thermal anomalies (e.g., NOAA-AVHRR, NASA Terra&Aqua MODIS, NASA/NOAA VIIRS) which, combined with other systems for Earth Observation (e.g., Landsat and Sentinel missions), provide data with variable spatial, temporal and spectral resolutions. In particular, data acquired by both Landsat and Sentinel missions have recently been widely exploited for fire monitoring for their medium spatial resolution (10 to 30 m) that allows small and fragmented burned areas to be mapped. Moreover, their free availability opens unprecedented opportunities to the scientific community [23]. However, Landsat missions offer the longest archive of medium resolution RS data, the 16-day revisit cycle, that is often increased by cloud cover, which could be a limitation for fire monitoring. The multispectral instrument (MSI) onboard the Sentinel-2 A and B (S2) satellites offers enhanced spatial and temporal resolutions, suitable for burned area mapping [24]; in fact, if both S2 satellites are combined, global median revisit time interval has been estimated to be 3.7 days [25].

Our approach incorporates a region growing (RG) algorithm [26] adaptively tuned to automatically map burned areas from imagery by exploiting spectral change in surface reflectance induced by the fire in the visible (VIS) to near and short infrared (NIR, SWIR) wavelength domain. In this context, a fuzzy multi-criteria approach is applied to fuse degrees of partial evidence derived from pixel reflectance values in the S2 bands by means of OWA operators to derive a single value of global evidence of burn. OWAs can model distinct aggregation strategies through their weighting vector applied to the input values reordered by their magnitude. Different OWAs lead to distinct burned area maps depicting the phenomenon with variable rates of accuracy: underestimation (false negative/omission error) and/or overestimation (false positive/commission error) errors are a function of several factors, including site and fire characteristics. Hence, OWAs applying different fusion strategies can be used to extract seed and growing layers to tune the RG algorithm, which relies on binding conditions for the selection of the seeds while conditions for identifying candidate boundaries (i.e., the limits for the region growing) can be looser. A RG algorithm expands seed pixels over pixels with low burn signal but connected to more reliable seeds. Indeed, in digital image processing, RG is a segmentation algorithm that exploits spatial adjacency of pixels with similar characteristics to create clusters, and it has been widely used for thematic classification of satellite images [20,27,28,29,30].

The approach proposed here is a hybridization of our previous proposal [31] by incorporating an adaptation mechanism based on a ML algorithm, partially knowledge and data driven defined in Goffi et al. [32] to map standing water areas, in order to fully automatize the mapping process. The novel contribution is the automated adaptation mechanism for the generation of both the seed and growing layers used by the RG algorithm. Hence, we propose: (i) to exploit a small training set of fire points to automatically define the OWA for computing the seed layer, and (ii) to automatically choose the OWA for the generation of the growing layer exploiting heuristic rules defined to minimize the final error in the resulting map of the RG algorithm based on the interpretation of the decision attitude of the OWAs.

This attitude is modelled by quantifying two dimensions: pessimism/optimism and democracy/monarchy. Pessimism is the tendency of the OWA to generate more commission errors than omission errors, and vice versa in the case of optimism. Democracy foresees if there are only a few highly influential factors or many low influential ones that determine the fused result. In facts, democracy is related to considering as necessary all input data (i.e., factors deemed influential by the expert to map the undesired status of burned area) in order to determine the fused maps, or just a few of the inputs in the case of monarchy. A democratic fusion hints to the fact that the mapped burned areas are identified thanks to all factors exhibiting low evidence, while a monarchical fusion indicates that the burned areas are identified thanks to a few factors providing high evidence. We state that, when the fusion is democratic, the patterns of the burned areas are more homogeneous with respect to the factors than in the case of monarchy, when the influential factors may be very different from region to region.

Since the OWA is learnt from training data, we can explain ex-post its proneness to generating commission/omission errors, and in taking into account all low influential or a few high influential factors (i.e., partial evidence of burn). This information is exploited to generate the growing layer so as to tune the RG algorithm that expands the seeds to generate the final burned area map, thus adapting the processing to site characteristics. Furthermore, this automatic adaptation can take place even when reducing or changing the input factors, without the need of human intervention.

The algorithm is applied to map burned areas at four distinct sites in southern Europe during the 2017 summer fire season. The accuracy of the output BA map is assessed by comparison with reference fire perimeters. Validation, besides demonstrating the high accuracy achieved by the algorithm, also demonstrates that the predictions provided by the learnt OWA operators in terms of degrees of pessimism/democracy can be appropriately used to tune the RG algorithm so as to maximize accuracy of the final map.

2. Materials

As case study, for applying the BA mapping algorithm from Sentinel-2 (S2) satellite images, we selected four sites in southern Europe where large fires occurred in 2017, a severe fire year for the European continent, due to abnormal droughts and heat waves [33]. Extreme weather conditions led to large fires affecting, in particular, Portugal, Spain, southern France, Greece and Italy [34]. In particular, we selected sites in Spain and Greece as shown in Figure 1. The geospatial dataset is composed of (i) S2 images, (ii) active fire points and (iii) reference burned area perimeters.

The algorithm, exploiting S2 spectral bands as input, delivers BA maps depicting the total area affected by fires. The S2 multispectral instrument (MSI) measures the Earth’s reflected radiance in 13 spectral bands from VIS/NIR to SWIR with a spatial resolution ranging from 10 m to 60 m [35]. Table 1 reports the characteristics of the MSI reflectance bands of interest for this study. Spectral bands in Table 1 were selected as potential inputs for the BA algorithm; in a previous work [31] each reflectance band and the temporal difference Δ between post-fire and pre-fire reflectance were analyzed to identify the most suitable ones for discriminating burned and unburned surfaces: post-fire S2 reflectance in Red-Edge and NIR (RE2, RE3, NIR) and temporal difference Δ between pre-fire and post-fire S2 reflectance in the same bands and SWIR2 (ΔRE2, ΔRE3, ΔNIR, ΔSWIR2).

Over each site, a pair of clear-sky S2 images were selected before (pre-fire date) and after (post-fire date) major fire events (Figure 2, Table 2). The sooner the image is acquired after the fire event the easier is the detection due to a stronger spectral signature of burn, as a consequence of fire on vegetation compound. For this reason, in general, we select the first available clear image after the event [36]. In this study, the temporal difference between pre- and post-fire images is on average 20 days.

S2 images were downloaded as Level 1C from Copernicus Open Access Hub [37] and processed with the Sen2r [38] toolbox developed in R, released under the GNU General Public License version 3 (GPL-3) and available on GitHub [39]. The Sen2r toolbox makes available functions to process Level-1C images for atmospheric correction to deliver Bottom of Atmosphere (BOA) reflectance images in the VIS-NIR-SWIR wavelengths (S2 bands 2 to 12) at 10 m spatial resolution (after resampling of the lower spatial resolution SWIR spectral bands available at 20 m).

In this study, a burned area represents the area affected by a fire that shows a change in vegetation cover and/or in ground surface that can be detected by RS data [40]; hence, a BA map is a geo-spatial product, generally representing a binary thematic information (burned/unburned) as grid/raster or vector/polygon format, and delivered by the algorithm that takes S2 images as input.

Active fires (AFs) were downloaded from the Fire Information for Resource Management System (FIRMS, https://earthdata.nasa.gov/earth-observation-data/near-real-time/firms, last access 1 July 2021) and used as training data. FIRMS distributes Near Real-Time (NRT) and archived active fire data from the NASA’s Moderate Resolution Imaging Spectroradiometer (MODIS), aboard the Terra and Aqua satellites [41], and NASA’s Visible Infrared Imaging Radiometer Suite (VIIRS), aboard the joint NASA/NOAA Suomi National Polar orbiting Partnership (Suomi NPP) [42]. Both MODIS and VIIRS fire datasets are accompanied by a layer of detection confidence of each individual fire, ranging in 0-100%. MODIS and VIIRS products were subset to extract fires detected between the S2 pre-fire and post-fire dates for each study site (Figure 2).

At the four sites, reference fire perimeters were downloaded from the Copernicus Emergency Management Service (EMS). EMS delivers on-demand and near-real time (hours/days) geospatial information in support of emergency management activities: this information is and derived from processing and analysis of satellite imagery acquired immediately after natural or man-made disasters such as floods, droughts and forest fires. EMS delivers ready-to-print maps and geographic dataset (vector package) for fire perimeter and fire damage grading (burn severity) derived from very high-resolution (VHR) multispectral images (https://emergency.copernicus.eu/mapping/list-of-activations-rapid, last access 1 July 2021). In the EMS products, reference date is the date of the VHR image data acquired for producing the delineation and damage mapping. Table 2 reports the dates of the S2 pre- and post-fire image pairs and fire reference perimeters. EMS fire perimeters were used as reference/ground truth for the validation of the BA maps produced by the algorithm.

3. Methods

The multi-criteria data-fusion approach proposed in this work builds on fuzzy sets theory to aggregate information provided by a set of input features, or contributing factors, by applying Ordered Weighted Averaging Operators (OWA) [18]. The approach implements a fusion function through an aggregation of multiple inputs to provide a reliable evaluation of the target phenomenon by modelling a reinforcement of evidence provided by redundant and complementary information.

In the case study of BA mapping, the aggregation of S2 bands and their temporal difference (Δ) can provide a reliable evaluation of the occurrence of fire, based on both the spectral signature of burned areas and the spectral reflectance change induced by the effect of a fire on the vegetated surface. Hence, input features are the seven spectral bands and difference identified by Sali et al. [31] as providing the greatest separability between burned and unburned surfaces: RE2, RE3, NIR, ΔRE2, ΔRE3, ΔNIR, ΔSWIR2 (Table 1). Spectral bands and difference are interpreted by Membership functions (MF) of the fuzzy sets, which assign to each pixel a membership degree (MD) in [0, 1] that is the partial evidence of burn as brought by a single input: the closer the value to 1, the greater the evidence of burn. This step first normalizes the domains of all input factors to a common domain, so as to make them comparable and consistent, by, at the same time, enhancing the signal of burned conditions. MFs can be defined with different approaches according to available expertise and training data [43,44,45]; in this study, we exploited MFs from our previous work defined as parameterized sigmoid-shaped functions from training data [31].

The MDs values are aggregated by means of a fusion function defined as an OWA operator to provide a synthetic score of global evidence of burn, as brought by redundant/complementary inputs. OWAs can model different attitudes/semantics. Global evidence of burned areas obtained with different OWAs (e.g., ranging from extreme conditions of minimum and maximum operators) is used as seed (OWA_seed) and a grow (OWA_grow) layers by the Region Growing (RG) algorithm that exploits spatial connectivity of burned pixels. Here, we present a new formalization of the BA algorithm where the input to the RG algorithm can be automatically generated from training AF points.

3.1. Ordered Weighted Averaging Operators (OWA)

An OWA of dimension N and weighting vector W, with ∑i = 1, ... N w_i = 1, aggregates N input values [d₁, …, d_N] and computes an aggregated value a in [0, 1] as follows [46]:

O W A : {[0, 1]}^{N} \to [0, 1] a = O W A ([d_{1}, \dots, d_{N}]) = \sum_{i = 1}^{N} w_{i} * g_{i}

(1)

in which g_i is the i-th largest value of the d₁, …, d_N. In this case study,

([d_{1}, \dots, d_{N}])

are the MDs of the seven input factors (N = 7): RE2, RE3, NIR, ΔRE2, ΔRE3, ΔNIR, ΔSWIR2.

Input values d₁,…, d_N, are rearranged from the greatest to the smallest; reordering is a fundamental step of OWA operators, meaning that a specific weight w_i is not univocally associated with the specific i-th input, but rather it is associated with the i-th position of the reordered inputs. In the case of BA mapping, OWAs can adapt to burned surface spectral characteristics, that can vary with site characterisitcs, by selecting pixel by pixel the input feature that brings the greatest evidence of burn. Different weighting vectors W lead to different OWAs, including Max, Min and arithmetic mean operators:

$W_{A N D} = [0, \dots, 0, 1]$	$O W A_{A N D} ([d_{1}, \dots, d_{N}]) = \min \{d_{1}, \dots, d_{N}\}$
$W_{O R} = [1, \dots, 0, 0]$	$O W A_{O R} ([d_{1}, \dots, d_{N}]) = \max \{d_{1}, \dots, d_{N}\}$
$W_{A l m o s t A N D} = [0, \dots, 0.5, 0.5]$	$O W A_{A l m o s t A N D} ([d_{1}, \dots, d_{N}]) = \frac{1}{2} \min \{d_{1}, \dots, d_{N}\} + \frac{1}{2} \min {\{d_{1}, \dots, d_{N}\} - \{\min \{d_{1}, \dots, d_{N}\}\}$
$W_{A v e r a g e} = [\frac{1}{N}, \dots, \frac{1}{N}]$	$O W A_{A v e r a g e} ([d_{1}, \dots, d_{N}]) = \frac{1}{N} \sum_{j = 1}^{N} d_{j}$
$W_{A l m o s t O R} = [0.5, 0.5, 0, \dots, 0]$	$O W A_{A l m o s t O R} ([d_{1}, \dots, d_{N}]) = \frac{1}{2} \max \{d_{1}, \dots, d_{N}\} + \frac{1}{2} \max {\{d_{1}, \dots, d_{N}\} - \{\max \{d_{1}, \dots, d_{N}\}\}$

It can be proved that OWA operators satisfy commutativity, monotonicity and idempotency and are bounded by Max and Min operators:

Min ([d₁, …, d_N]) ≤ OWA([d₁, …, d_N]) <= Max([d₁, …, d_N])

(2)

3.2. Semantics of Ordered Weighted Averaging Operators (OWA)

The semantics of an OWA operator with weighting vector W has been characterized by two measures [46]: the measures of orness and of dispersion. The measure of orness(W) ∈ [0, 1] is defined as follows:

o r n e s s (W) = \frac{1}{N - 1} (\sum_{j = 1}^{N} (N - j) * w_{j}) \begin{matrix}  \end{matrix}

(3)

This measure characterizes the degree to which the aggregation is similar to an OR (Max) operator. Generally, in decision making, it is related to the tolerance of the decision maker, intended as his/her attitude to accept that only some criteria are satisfied, while intolerant decision makers demand that most or even all criteria are satisfied [47]. In other terms, orness measures the degree to which the OWA operator has a conjunctive behaviour.

It can be shown that, when the input values d₁, ..., d_N are degrees of partial evidence of an undesired phenomenon from N distinct sources, i.e., the greater they are, the more severe the evidence, we can assess the following interpretation of orness in relation to the fusion attitude, in which the fusion function is regarded as a decision maker agent [18,48,49]:

orness[1, …, 0] = 1 indicates a pessimistic attitude of the fusion applied to minimize the risk of underestimating the spatial extent of a critical phenomenon (i.e., nothing is disregarded, any single source alone is trusted and taken into consideration to map the phenomenon extent);

orness[0, … 1] = 0 indicates an optimistic attitude of the fusion, applied to minimize false positives due to overestimation of the effects of a critical phenomenon (i.e., one wants to prioritize anomalies pointed out by all sources since any source alone is not trusted by itself);

orness[1/N, …, 1/N] = 0.5 indicates a balanced and neutral attitude towards over and under estimation of the phenomenon extent.

Notice that, in this interpretation, high values to aggregate are considered with a negative connotation.

The dispersion measure can also be defined to qualify the semantics of an OWA operator depending on the form of the weighting vector and representing how much of the information in all the input values is used by an OWA. The idea behind its definition is that the greater the dispersion, the more democratic the aggregation of the correspondent OWA, since it uses information from more sources/factors. Several dispersion measures have been proposed [50], the first of which is based on the concept of information entropy of W:

d i s p e r s i o n (W) = - \sum_{N}^{i = 1} w_{i} * l n (w_{i})

(4)

This definition of dispersion is an entropy and satisfies the following properties:

Minimum value is obtained when w_i = 1 for some i, then dispersion(W) = 0,
Maximum value is obtained when w_i = 1/N for all i, then dispersion(W) = ln(N).

3.3. Fusion Attitude based on Optimism and Democracy

As stated in the previous section, a fusion function can be regarded as an automatic decision maker agent, which can be pessimistic or optimistic on the impact of a critical or anomalous event such as a wildfire. When it is fully pessimistic, it requires the worst scenario to be identified, thus tolerating more false positive alarms (commission greater than omission errors) in order to be cautious and not to neglect any possible critical situation. Conversely, when it is fully optimistic, it means that it tolerates false negative (omission greater than commission errors), in order to analyze only the priorities that demand intervention. In between these two extreme cases, there is a continuum of blending of optimistic and pessimistic attitudes. Perfectly in the middle, we have a neutral attitude that equally balances optimism and pessimism. We can define a variable ps in [0, 1] to quantify the desired degree of pessimism of a fusion attitude that assumes values 1 or 0 in the cases of full pessimism or full optimism, respectively, and 0.5 in case of neutral attitude.

Another dimension of the fusion attitude is the level of democracy that it applies among the multiple input factors to determine the result. Democracy depends on both the number of the factors and the degree of evidence they provide, to determine the final result. In the case of maximum democracy, all factors are considered equally influential, while in the case of full monarchy, the rule of one drives to the result. In between these two extremes, we can have a blend of democracy and monarchy.

We define a variable dm in {1/N, 2/N, …1}, with N is the number of factors, to represent the degree of democracy of a fusion attitude the meaning of which is the percentage of factors it considers. When dm = 1/N, we have full monarchy (the rule of one) while dm=1 corresponds to full democracy (one head one vote), while intermediate values 1/N < x/N < 1, with 1 ≤ x ≤ N, specify a blend of the two extremes.

If we consider an OWA with weighting vector W that fuses partial evidence of burn as a decision maker agent, we can define its attitude by the pair pessimism ps in [0, 1] and democracy dm in {1/N, 2/N, …1}. Pessimism is related to its orness (3) and democracy dm to the dispersion measure (4) as follows:

ps = orness(OWA)

(5)

d m = \exp (d i s p e r s i o n (W)) / N

(6)

In order to make it easier to understand the semantics of the OWA operators once we computed the pair (ps, dm), in Figure 3, we provide linguistic expressions that allow humans to interpret the correspondent attitudes of the OWA operators to generate more omissions/commissions errors in heterogeneous/homogenous areas. In the bi-dimensional space of pessimism (ps, rows) and democracy (dm, columns) distinct quadrants correspond to given attitudes.

Pessimism and optimism are determined by high and low degrees of evidence, respectively, of an undesired status of the environment due to wildfires: a high value is considered a pessimistic view of the status, i.e., something negative, while a low value is an optimistic view, something positive. Thus, an OR (AND) fusion is regarded as a pessimistic (optimistic) attitude since one trusts the most pessimistic (optimistic) criterion, and thus is prone to generate more commission then omission errors (vice versa).

Moreover, a democratic fusion function indicates that all factors are needed to capture the characteristics of all burned areas, meaning that these surfaces are likely to have homogeneous conditions. Conversely, if the fusion is monarchical, it means that each burned area may exhibit its own highly influential factor, an thus may have heterogeneous conditions.

3.4. Learning OWA Weighting Vector from Training Points

An OWA operator, i.e., its weighting vector W, can be learnt from data assumed as ground truth by applying a ML algorithm; training data can be used to this aim being a highly reliable evidence of the phenomenon under investigation. The OWA operator can be defined by iteratively minimizing errors between OWA results and training points.

Given K georeferenced training points in the map a₁, … a_K (in our case: points labelled as active fires-Afs), they are assumed as ground truth. By knowing their geographic coordinates, we can associate to each of them the MDs [a_i1, …a_iN] of the N input factors having the same coordinates such that we obtain the following antecedent-consequent rules that must be satisfied:

a₁₁, ...., a_N1 → a₁
…
a_1K, ...., a_NK → a_K

(7)

In the case study of BA mapping, the values a₁, … a_K, are defined on a continuous scale [0, 1] to quantify the extent of the evidence of burn in the specific location (1 full evidence, 0 no evidence, and intermediate values mean partial evidence).

The learning mechanism starts at epoch L = 0 by assuming as initial OWA₀ operator the weighted average (balanced and neutral attitude), that is defined with weighting vector W₀ = [1/N, …1/N]. Then, at each epoch L, it iteratively determines the weighting vector W_L = [w_1L, …, w_NL] of OWA_L that minimizes the error existing between the results of its application to all the antecedents of the rules in (7) and the values a₁, … a_K, of the training points (i.e., the consequents of the rules). Formally, this is equivalent to applying the following rule:

W_L such that | Λ_i(L) − Λ_i(L + 1)| < ε ≈ 0 or L = L_max

(8)

where

Λ_i(L + 1) = Λ_i(L) − βw_iL (argmax_i(a_1k, ...., a_Nk) − OWA_L(a_1k, ...., a_Nk))∗(OWA_L(a_1k, ...., a_Nk) − a_k)

(9)

in which β ∈ (0, 1] is a learning rate parameter, and the i-th weighting vector element at epoch L is defined as follows:

w_{i L} = e^Λi(L)/∑_j=1,…N e^Λj(L) ∀i = 1, …, N

(10)

While in Goffi et al. [32], field observations of standing water were used as training points to learn the best OWA operator for a given site, directly to map flooded areas, in this work, by exploiting Afs points as training data, a vector W was learnt for each site and used (i) to quantify the fusion attitude site by site and (ii) to define the OWA for the generation of the seed layer for the RG algorithm.

3.5. Workflow of the Automatic BA Mapping Algorithm

The workflow of the automatic BA algorithm is shown in Figure 4 in which the grey box enhances the novel part that has been embedded in our previous proposal [31]. Since the algorithm is applied at the pixel level, one independently from the others, we can refer to input/output of any step of the workflow as rasters/layers/maps, intended as georeferenced and co-registered matrices of pixels. First, remote sensing data are collected for each study area. A set of features is computed, identified as contributing factors to determine partial evidence of burn. In this case, input features are spectral bands and temporal difference. These features are subjected to membership functions of fuzzy sets defined on the domain of the features and computing degrees of evidence of burn in [0, 1] (i.e., MD for membership degree). Both the identification of the most suited features and the definition of the membership functions have been carried out in a previous study [31]. Since the same features and functions have been tested over the sites of the present experiment, there was no need to readapt them.

Once MDs are computed, the ML algorithm described in Section 3.4 is applied independently at each site by taking as training data AF points within the site, and by using them to learn the OWA operator. The learnt OWA is subsequently applied to aggregate MD values of the site to generate the seed layer for the RG algorithm. In order to choose the most appropriate growing layer, we exploit the information on the attitude of the learnt OWA operator by computing its degrees of pessimism (ps) and democracy (dm) as described in Section 3.3 by applying Formulae (3) (4) and (5) (6).

While the knowledge of dm can be useful to the expert, in order to have an idea of the homogeneity or heterogeneity of the BAs’ features in each site, the degree of pessimism ps is used in the RG algorithm to select the growing layer (OWA_grow) that should minimize predicted errors by applying heuristic rules such as the following ones:

If ps > 0.75 then OWA_grow = OWA_{Almost_AND}
If 0.5 ≤ ps ≤ 0.75 then OWA_grow = OWA_Average
If 0.25 ≤ ps < 0.5 then OWA_grow = OWA_{Almost_OR}
If 0 ≤ ps < 0.25 then OWA_grow = OWA_OR

(11)

Hence, rules in (11) formalize a simplified choice of the OWA_grow to counter-balance the tendency to increase the commission/omission errors by containing the seeds’ expansion depending on the learnt OWA_seed. The rationale of this heuristic can be understood by ordering the OWA operators by the increasing value of their pessimism ps:

OWA_AND < OWA_{Almost_AND} < OWA_Average < OWA_{Almost_OR} < OWA_OR
0 < 0.08 < 0.5 < 0.92 < 1

(12)

When ps is closer to the neutral attitude (ps ≈ 0.5) than to the extreme ps = 1 corresponding to full pessimism, the prediction on the type of error is very uncertain, meaning that omission and commission are more or less balanced. In this case, to maintain a balanced behavior, OWA_grow is chosen as the OWA_Average. When ps is closer to full pessimism, (ps = 1), it is more likely that the seed layer contains more commission than omission errors, hence the expansion is contained by selecting an OWA_grow that is more optimistic than OWA_seed, i.e., OWA_{Almost_AND}. On the other side, if ps is closer to full optimism (ps = 0), seeds are likely to be affected by more omission than commission, and thus they are maximally expanded in an attempt to decrease the omission by selecting OWA_OR. Since in our experiment we assumed that it is preferable to have more commission than omission, we introduced a rule for values of ps in between 0.25 and 0.5 that would hint to a slight prevalence of omission, so as to relax the expansion in an attempt to decrease the omission thus selecting OWA_{Almost_OR}. Notice that these rules have been set a priori, based on the rationale, and without any tuning on experimental data.

After the selection of the most appropriate OWA_grow, the region growing (RG) algorithm is run in Harris IDL language (https://www.l3harrisgeospatial.com/docs/region_grow.html, last access 1 July 2021). Region growing is a procedure that groups pixels or sub regions into larger regions based on pre-defined criteria [26]. The RG algorithm is an iterative algorithm that starting from initial seeds extracted from the OWA_seed layer (OWA_learn, in this case), it searches the eight-neighbor connected pixels and it includes in the new seed layer for the next iteration only those pixels that satisfy the constraint OWA_grow > 0. Initial seeds are pixels with OWA_learn > 0.9. The output is a raster map with pixel value, RG_score, in [0, 1].

3.6. Validation Metrics

Validation is the assessment of thematic accuracy of burned area maps, derived from the output raster of the RG algorithm. Over each site, the output RG_score rasters are converted to binary, burned/unburned maps, prior to comparison with reference EMS fire perimeters. Since EMS products are distributed as shapefile, rasterization is necessary for pixel by pixel comparison to build the error/confusion matrix (Table 3). An error matrix is a square array of numbers organized in rows and columns, which expresses the number of sample units (i.e., pixels, clusters of pixels, or polygons) assigned to a particular class relative to the actual class, as indicated by the reference data [51]. In RS literature, classification is generally a multi-class problem (e.g., land cover classification) and the error/confusion matrix is an array with the number of columns/rows > 2.

In the modelling literature, the evaluation of model forecast generates a 2 × 2 square matrix, where columns and rows are labelled as false/negative or true/positive occurrence; in this case, predicted values can be true positives (TP), false negatives (FN), true negatives (TN) and false positives (FP) [52].

BA mapping is a binary classification problem (burned/unburned) and the error/confusion matrix could be assimilated to the one proposed in the evaluation of modelling forecast, but we maintain the RS formalization as shown in Table 3 and widely used in the RS literature [53,54,55,56].

Various summary metrics can be derived from the error/confusion matrix; in this work we selected the following ones which are those commonly used in remote sensing: commission error, omission error, Dice Coefficient (DC) [57] and relative bias (Table 4).

4. Results

4.1. Learning the OWA Operator for Seed Layer Computation

The algorithm described in Section 3.5 and depicted in Figure 4 was applied to the four study sites. As stated above, from S2 image pairs for each site, pre- and post-fire, we selected the seven input features, RE2, RE3, NIR, ΔRE2, ΔRE3, ΔNIR, ΔSWIR2, that are converted to membership degrees MDs by applying the MFs to generate seven partial evidence maps. These maps are fused by applying two distinct OWAs operators to generate OWA_seed and OWA_grow for the RG algorithm. In this case, OWA_seed = OWA_learn that is defined at each site by applying the ML algorithm described in Section 3.4 exploiting AF points: its weighting vectors, W, are reported in Table 5 for each site. Results show that at all sites except Kalamos, the learnt operators end towards a pessimistic attitude, generating more commission than omission errors. However, this prediction is highly uncertain due to the closer value of ps to the neutral attitude (ps = 0.5) than to the full pessimistic attitude 1 (ps = 0.55, ps = 0.70 and ps = 0.54, respectively). In Kalamos, the prediction is that there is slightly more omission, even if it is highly uncertain too (ps = 0.4). Furthermore, at all sites except Calar, the learnt operator is nearly monarchical, thus exploiting few partial evidence maps to determine the result (i.e., the global evidence). In Calar, the nearly democratic operator combines all partial evidence degrees. This is also apparent by looking at the elements of the single weighting vector W of Calar, which is never null (w_i > 0, ∀i = 1 to 7), while for the other vectors, we have at least two or more null elements.

The OWA_grow layer was then chosen by applying rules defined in (11), based on the values of pessimism ps in each site: at all the sites except Kalamos, predicted OWA_grow = OWA_Average, while in Kalamos predicted OWA_grow = OWA_{almost_OR}.

4.2. Burned Area Mapping Accuracy

The RG algorithm was applied to automatically generate the final map of burned areas by masking out not-vegetated (bare soil and urban classes) and agricultural regions based on the Corine Land Cover map [58]. Output BA maps are validated by comparison with reference fire perimeters from the EMS products.

We also compared the result of the proposed fully automatic algorithm with those yielded by using the semi-automatic algorithm proposed in Sali et al. [31], in which both OWA_seed and OWA_grow layers were manually selected with several different combinantions. In this comparative analysis, seeds were extracted from OWA_AND (OWA_AND > 0.9) to simulate the requirement of highest reliability of seed pixels and identified by the most restrictive operators (i.e., implanting an AND/Min condition) and from the OWA_learn to simulate a fully automatic condition; for the grow layer, we tested OWA_Average, OWA_AlmostOR and OWA_OR. All RG output maps were compared to EMS fire reference perimeters for accuracy assessment: the confusion matrices and metrics are reported in Table A1 and Table A2. In the tables, the combination of OWA_seed and OWA_grow that achieves the greatest accuracy at each site is highlighted in bold; the same information is also reported in the last column of Table 5, together with the increase in the Dice coefficient (Δdc) brought by the best performing algorithm with respect to the fully automatic one, implemented with the predicted OWA_grow.

Finally, we also performed an ablation study by removing the RG algorithm and considering as direct result the map generated by OWA_learn in each site, and comparing it with the map obtained by applying other OWAs, specifically, OWA_AND, OWA_AlmostAND, OWA_Average, OWA_AlmostOR and OWA_OR, with weighting vectors defined in Section 3.2. This study was performed to assess the utility of the RG algorithm, that is expected to reduce commission errors. Results are synthetized by the accuracy metrics (defined in the previous Section 3.6), and used to compare the experiments; metrics are also reported in graphical form in Figure 5 and discussed in Section 5.

By looking Table 5, we can observe that the fully automatic algorithm proposed in this paper using the predicted OWA_grow (Table 5, Predicted OWA) together with the best-performing semi-automatic algorithm in two areas (Huelva and Zakynthos). At both sites, the predicted OWA_grow = OWA_Average, the one with manual selection yielding best performance (dc > 0.9).

In the Calar site, predicted OWA_grow = OWA_Average, while the best performing semi-automatic version corresponds to OWA_grow = OWA_{Almost_OR}. In Kalamos, predicted OWA_grow = OWA_{Almost_OR}, while the best performing one corresponds to OWA_grow = OWA_OR. Nevertheless, there is a negligible difference in mapping accuracy, as quantified by the Dice coefficient; indeed, accuracy loss (Δdc) is 0.01 and 0.007 for Calar and Kalamos, respectively.

We can also notice that, in both of these two cases, the predicted OWA_grow operators have a smaller pessimism than the best performing ones. This suggests a revision of the heuristic rules which were set a priori, just based on the rationale and not on experimental tuning. We can conclude that the fully automatic algorithm performs equally or very close to the best semi-automatic algorithm with manual setting of both OWA_seed and OWA_grow for generating the seed and grow layers.

Figure 5 depicts accuracy metrics bar plots (commission, omission, Dice coefficient and relative bias) for the tested combinations of OWAs with (RG) and without RG (noRG); numeric values are summarized in Table A1 and Table A2.

Average dice coefficient of the full automatic algorithm over all sites is 0.94 ± 0.03 (±one standard deviation) that equals the average accuracy over all sites of the best performing semi-automatic algorithms in each site. They differ for the average relative bias that is slightly positive 0.004 (omission > commission) for the fully automatic algorithm, while the best performing algorithms, on average, produce a negative bias equal to −0.005. The balance between average omission and commission is greater for the full automatic algorithm (average oe = 0.057 and average ce = 0.068), while the best performing semi-automatic algorithm tends to generate more commission (average ce = 0.08) than omission (average oe = 0.044). These results show that the fully automatic algorithm has a more balanced behavior in terms of omission and commission than the best performing semi-automatic algorithm at each site and equal accuracy.

By analyzing the ablation study, i.e., by comparing the results obtained when using or not the RG algorithm, it is clear that the RG algorithm is very necessary to decrease commission errors. The first clear outcome is that using an RG algorithm in the full automatic algorithm leads to both: (i) a reduction in the errors and (ii) a lower variability in the accuracy metrics among selected OWAs. The former is quantified by the Dice coefficient ranging in [0.91, 0.97] and [0.73, 0.93] with and without RG algorithm, respectively.

At all the sites, we can observe a decrease in the Dice coefficient, dc, when not using the RG algorithm: the largest decrease occurs in the Calar site, Spain, where dc is 0.92 with RG, and 0.73 with noRG; in Huelva, dc is 0.91 with RG, and 0.88 with noRG; in Kalamos, dc is 0.97 with RG, and 0.93 with noRG; finally, in Zakynthos dc is 0.94 and 0.78 with RG and noRG, respectively. The reduced variability is confirmed by the lower standard deviation of the dc metric estimates, that decreases from 0.09, when not using RG, to 0.03 when RG is applied (Table A1).

Indeed, in the fully automatic version, the contextual conditions applied by the RG algorithm reduce commission error at all sites by a quantity that ranges between a minimum of 5.2% in Huelva, and 8.2% in Kalamos, to a maximum of 25.2% in Zakynthos, and 34% in Calar. This decrease in commission does not always affect the increase in omission: when applying RG at the Calar and Huelva sites, omission increases only for 0.4% and 0.1%, respectively, while when applying RG in Kalamos and Zakynthos, there is also a decrease in omission, with a reduction in ce equal to 0.3 and 0.1, respectively.

Results confirm that the best mapping accuracy is achieved when region growing is applied as a way of balancing omission and commission errors [59]; in fact, region growing and contextual approaches are largely used in thematic mapping.

Overall, the site with lowest ce is Kalamos (ce = 2%); also at this site the greatest values for the Dice coefficient and relative bias metrics are obtained for the fully automatic algorithm (dc = 0.97, relB = 0.005).

Finally, by observing the results obtained without the RG algorithm, extreme optimistic conditions are depicted, consistently over all sites, by the noRG_AND and noRG_AlmostAND algorithms which, not surprisingly, deliver the greatest omission errors due to the restrictive condition applied by AND-like operators. This result was largely expected since AND-like operators implement a fusion strategy based on the selection of the minimum value of the global evidence of burn. Despite leading to a significant underestimation, global evidence of these operators is highly reliable for the restrictive conditions applied to fuse input features. Hence, OWA_AND could be chosen as alternative source for seed points when no training is available [31]. Indeed, it can be observed that, at all the sites, the fully automatic algorithm achieves the same results of the manually set algorithm generating the seed layer by OWA_AND and having the same grow layer of the automatic algorithm. In seed-based region growing algorithms, selection of initial seed points is crucial since it influences the final accuracy [60,61]; our results confirmed that the proposed approach is robust with respect to the choice of seeds from both OWA_AND and OWA_learn. Nevertheless, when choosing OWA_learn we can exploit the knowledge of its attitude to select the OWA_grow for generating the grow layer adaptively in each site, so as to minimize errors. In fact, the combination OWA_seed = OWA_AND and OWA_grow = OWA_Average achieves the same accuracy of the fully automatic algorithm at all the sites. Nevertheless, if we regard the omission and commission, we can see that the balance is slightly different for the manual set algorithm at the Kalamos site, where oe = 5.2% and ce = 1.1, while with the fully automatic algorithm, a better balance is achieved; oe = 3.1 and ce = 2%. Nevertheless, to confirm the usefulness of the adaptability mechanism over the manual combination OWA_seed = OWA_AND and OWA_grow = OWA_Average new experiments are needed.

5. Discussion

In this paper, we propose an approach for automatically mapping burned areas from S2 imagery, exploiting reflectance values in the S2 spectral bands; spectral signal of burned surfaces in post-fire images as well as in temporal difference in reflectance values are fused by OWA operators. The proposed algorithm builds on our previous work where OWAs were exploited to map surfaces affected by disturbances such as wildfires [31] and flooding [32]. In this paper, we further confirm that OWAs are flexible operators for data fusion in multi-criteria evaluations and we also propose a fully automated version of the BA mapping algorithm. In this improved version, we apply a ML algorithm, trained over input active fire points operatively made available by RS data, to learn the weighting vector of the OWA operator (OWA_learn). This way, we can learn an OWA that is tuned over site and fire characteristics. The experimental tests carried out over four study sites in southern Europe (Spain and Greece) for the 2017 summer fire season showed that the weighting vector learnt from the training AFs changes from site to site, thus reflecting differences in the characteristics of the surfaces affected by fires.

We propose exploiting two measures that can be derived from the semantic of the OWA_learn operator (orness and of dispersion) to formalize fusion attitude through pessimism (ps) and democracy (dm). In particular, pessimism (ps), is exploited to automatically identify OWA_grow, i.e., the optimal growing layer, of the RG algorithm that is implemented in the approach (Table 5).

Results of the experiments show that by adapting the choice of OWA_grow depending on the degree of pessimism ps of OWA_seed (where OWA_seed = OWA_learn) and determined based on learning allows us to achieve accuracy levels of BA mapping equal or very close to the best performing OWA_grow in all the four sites. This is because the adaptation mechanism actuated by the rules defined in (11) counterbalances the attitude of OWA_learn to generate more or less commission/omissions.

If we compare the results of the automatic algorithm with all those obtained by the semi-automatic algorithm in which OWA_seed = OWA_AND combined with different OWA_grow, we can observe that at two of the sites the automatic algorithm achieves equal or greater accuracy with respect to all semi-automatic versions. Only at the Calar site the greatest accuracy is achieved by the semi-automatic algorithm with OWA_seed = OWA_AND and OWA_grow = OWA_{Almost_OR}. The performance is certainly a function of the learning mechanism and of the ability of active fire points to represent the variability of the spectral characteristics of burned areas, as observed in S2 wavebands. Spectral characteristics of burned areas are largely variable as a function of pre-fire vegetation, soil properties, fire characteristics, fire severity and the age of the burned surface [62,63]. Fire severity is in fact one of the factors controlling post-fire vegetation recovery and regrowth. Although the learned OWA at this site may be inadequate to represent the actually burned areas, mapping accuracy, as quantified by the metrics, is more than satisfactory (dc > 0.8).

Figure 6 shows the results of the automatic algorithms in each site: the RG_score with seed points (first column) compared to EMS reference fire perimeters and the spatial distribution of the agreement between BA maps and reference (second column, distinct colors to mark agreement and disagreement classes). It can be immediately noticed that RG algorithm exploiting spatial connectivity allows more compact burned areas to be generated than by relying solely on the segmented RGscore maps obtained by applying a given threshold, which appear highly fragmented. It can also be appreciated that omission (False Negatives) and commission (False Positive) are located at the boundaries between the burned (TP) and unburned (TN) areas, meaning that the grow layer generation could be refined. It is, however, true that in these regions we can find the most critical detection conditions, such as partially burned pixels and low severity burned pixels.

As far as the degree of democracy is concerned, it is an indicator of how many factors and how much influence they have in determining the final map of burned areas. Figure 7 and Figure 8 depict the MDs of all factors at the Huelva and Kalamos sites, respectively; in both cases, the degree of democracy dm is below 0.5, that is the neutral attitude, thus corresponding to the nearly monarchical attitude (Table 5). Notwithstanding this, since their degrees of dm are noticeably different, being equal to 0.28 and 0.45 in Huelva and Kalamos, respectively, we can observe different patterns of the high influential partial evidence degrees (MDs) at the two sites. While in Huelva there are only a few highly influential factors, in line with lower value of dm, essentially the most influential is ΔSWIR2 and then ΔNIR and ΔRE3, in Kalamos, we have a more variable situation with two most influential factors ΔSWIR2 and ΔRE2, and then also ΔRE3 and ΔNIR, and finally RE2 and RE3. This means that at the two sites, burned areas are characterized by different spectral reflectance values in S2 bands probably due to different vegetation and burn severity. The algorithm is indeed able to adapt to site characteristics by flexibly selecting the most important layers in the fusion step. For example, in the Kalamos site, burned areas are mainly located in shrubland vegetation class (data not shown).

6. Conclusions

The fully automatic interpretable and adaptable algorithm presents several advantages over the literature both theoretical and practical.

First of all, it needs a small set of classified points for training (active fires) which allows fast learning; in this case study, in particular, we used a total of 300, 79, 327 and 189 AF points for the Calar, Huelva, Kalamos and Zakynthos sites, respectively. Deep learning approaches, on the contrary, typically Convolutional Neural Networks, need tens of thousands classified pixels and, as a consequence, longer training phases. Additionally, in many real cases of Earth Observation over large areas, representative and spatially distributed data sets for training are not available. As proposed here, the training phase relies on input active fires (MODIS and VIIRS from the FIRMS system) that are operationally available at the global scale from satellite-based products. Moreover, when changing the area, one generally needs to repeat the training phase with new ground truth data; in fact, transfer of a pre-trained models greatly depends on the choice of a proper CNN architecture for the target purpose.

Conversely, the proposed approach being partially knowledge-driven and partially data-driven has the second advantage of exportability of the knowledge mined in a different area: in fact, it exploits domain knowledge and data analysis performed in a study area to identify the influential factors and transfers it to new sites without any need of modification. Exportability with respect to membership functions has been proved in a previous paper in which we applied predefined MFs trained in a different site to new sites without any modification and achieving a high mapping accuracy at all sites [31]. The adaptation occurs at the level of factors fusion which allows both the selection of the most influential factors and the factors’ contribution to be tuned depending on the characteristics of the site.

Average accuracy metrics values of the burned area maps delivered by the fully automatic approach at the four sites are oe = 0.057, ce = 0.068, dc = 0.94 and relB = 0.0046; these values refer to the implementation of the proposed approach incorporating the RG algorithm. In fact, validation clearly showed that RG provides the highest accuracy by reducing commission errors. Although a full comparison with published values is difficult due to the differences in input data and algorithms, our results are more than satisfactory and comparable to published reference values for accuracy metrics of burned area maps [64,65,66].

The third advantage is the interpretability of the fusion in terms of its attitude to generating a seed layer affected by more commission/omission errors. Deep learning approaches are black boxes which achieve high prediction accuracy at the expense of lack of transparency: there is no possibility of understanding the “why” of the prediction. Nevertheless, being able to understand the prediction criteria is important in order to increase experts’ knowledge of the problem. For example, one important aspect when using a product generated from remote sensing data analysis, which is inevitably affected by some form of error, is the knowledge of the types of errors: when using a map of burned areas to estimate the loss in ecosystems, it is important to know whether one is underestimating or overestimating the damage/loss. Nevertheless, there are situations in which reference data are not available to assess the accuracy of the generated map. With our proposed approach, even in this situation, we can state if the product will be affected more by commission or omission errors. Furthermore, being able to identify the most influential factors that determine the result is a condition that increases the trust of domain experts and their knowledge of the context.

Finally, the approach is general: in this paper we presented it to map burned areas but it can be applied for different tasks of environmental status assessment in land and environment management and planning. A version without the RG algorithm was successfully applied for mapping standing water areas [32]. The approach could be used to deliver maps of critical situations and anomalies produced by disturbance phenomena such as wildfires, floods, desertification, erosion.

Author Contributions

Conceptualization, Daniela Stroppiana, Mirco Boschetti, Gloria Bordogna; methodology, Daniela Stroppiana, Mirco Boschetti and Giovanna Sona; software, Daniela Stroppiana; validation, Matteo Sali, Daniela Stroppiana, Mirco Boschetti; formal analysis, Matteo Sali, Daniela Stroppiana; data curation, Daniela Stroppiana; writing—original draft preparation, Daniela Stroppiana and Gloria Bordogna; writing—review and editing, Daniela Stroppiana, Gloria Bordogna, Matteo Sali, Giovanna Sona, supervision, Giovanna Sona and Pietro Alessandro Brivio. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

Authors wish to acknowledge the four anonymous reviewers who significantly contributed to the improvements of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Confusion matrices (TP = true positive, TN = true negative, FP = false positive, FN = false negative) and the metrics (omission error = oe, commission error = ce, Dice coefficient = dc, relative bias = relbias) for the selected and tested combinations of OWAseed and OWAgrow in the RG algorithm. Labels are the same displayed in Figure 5. The lines in bold are the best performing combinations of the seed and grow layer in each site; the lines outlined in yellow are those corresponding with the full automatic algorithm.

Seed	Grow	Label	TP	TN	FP	FN	oe	ce	dc	Relbias
Calar SP
AND	Average	RG_AND_Average	282073	1005800	10195	37818	0.118	0.035	0.920	0.027
AND	AlmostOR	RG_AND_AlmostOR	298753	990104	25891	21138	0.066	0.080	0.930	−0.005
AND	OR	RG_AND_OR	303087	912916	103079	16804	0.053	0.254	0.830	−0.085
learn	Average	RG_learn_Average	282276	1005800	10195	37615	0.118	0.035	0.920	0.027
learn	AlmostOR	RG_learn_AlmostOR	298753	990104	25891	21138	0.066	0.080	0.930	−0.005
learn	OR	RG_learn_OR	303087	912916	103079	16804	0.053	0.254	0.830	−0.085
Average							0.079	0.123	0.89	−0.021
Standard deviation							0.031	0.103	0.05	0.052
Huelva SP
AND	Average	RG_AND_Average	780066	3659400	98225	59952	0.071	0.112	0.910	−0.010
AND	AlmostOR	RG_AND_AlmostOR	803771	3612717	144908	36247	0.043	0.153	0.900	−0.029
AND	OR	RG_AND_OR	807912	3603750	153875	32106	0.038	0.160	0.900	−0.032
learn	Average	RG_learn_Average	782787	3656845	100780	57231	0.068	0.114	0.910	−0.012
learn	AlmostOR	RG_learn_AlmostOR	805279	3610946	146679	34739	0.041	0.154	0.900	−0.030
learn	OR	RG_learn_OR	809420	3601876	155749	30598	0.036	0.161	0.900	−0.033
Average							0.050	0.142	0.903	−0.024
Standard deviation							0.016	0.023	0.005	0.010
Kalamos GR
AND	Average	RG_AND_Average	260204	654869	3010	14341	0.052	0.011	0.970	0.017
AND	AlmostOR	RG_AND_AlmostOR	265908	652499	5380	8637	0.031	0.020	0.970	0.005
AND	OR	RG_AND_OR	266987	651743	6136	7558	0.028	0.022	0.970	0.002
learn	Average	RG_learn_Average	260204	654869	3010	14341	0.052	0.011	0.970	0.017
learn	AlmostOR	RG_learn_AlmostOR	265908	652499	5380	8637	0.031	0.020	0.970	0.005
learn	OR	RG_learn_OR	266987	651743	6136	7558	0.028	0.022	0.970	0.002
Average							0.037	0.018	0.970	0.008
Standard deviation							0.012	0.005	0.000	0.007
Zakynthos GR
AND	Average	RG_AND_Average	125635	2302900	14585	1122	0.009	0.104	0.940	−0.006
AND	AlmostOR	RG_AND_AlmostOR	126159	2298491	18994	598	0.005	0.131	0.930	−0.008
AND	OR	RG_AND_OR	126250	2296935	20550	507	0.004	0.140	0.930	−0.009
learn	Average	RG_learn_Average	125635	2302900	14585	1122	0.009	0.104	0.940	−0.006
learn	AlmostOR	RG_learn_AlmostOR	126159	2298491	18994	598	0.005	0.131	0.930	−0.008
learn	OR	RG_learn_OR	126250	2296935	20550	507	0.004	0.140	0.920	−0.009
Average							0.006	0.125	0.932	−0.008
Standard deviation							0.002	0.017	0.008	0.001
Global Average of full automatic algorithm over all sites							0.057	0.068	0.935	0.004
Global Standard deviation							0.048	0.048	0.026	0.017
Global Average of best performing algorithm over all sites							0.044	0.080	0.9375	−0.005
Global Standard deviation							0.030	0.041	0,025	0.005

Table A2. Confusion matrices (TP = true positive, TN = true negative, FP = false positive, FN = false negative) and the metrics (omission error = oe, commission error = ce, Dice coefficient = dc, relative bias = relbias) for the OWA layers used for mapping Bas without RG algorithm. Labels are the same displayed in Figure 5.

OWA	Label	TP	TN	FP	FN	oe	ce	dc	Relbias
Calar SP
AND	noRG_AND	127865	1015964	31	192026	0.600	0.000	0.570	0.189
AlmostAND	noRG_AlmostAND	144475	1015940	55	175416	0.548	0.000	0.620	0.173
Average	noRG_Average	283576	845991	170004	36315	0.114	0.375	0.730	−0.132
AlmostOR	noRG_AlmostOR	299316	689234	326761	20575	0.064	0.522	0.630	−0.301
OR	noRG_OR	303032	618881	397114	16859	0.053	0.567	0.590	−0.374
Average						0.276	0.293	0.628	−0.089
Standard deviation						0.274	0.277	0.062	0.262
Huelva SP
AND	noRG_AND	197050	3752055	5570	642968	0.600	0.000	0.38	0.170
AlmostAND	noRG_AlmostAND	295352	3745169	12456	544666	0.548	0.000	0.51	0.142
Average	noRG_Average	782319	3601903	155722	57699	0.114	0.375	0.88	−0.026
AlmostOR	noRG_AlmostOR	803296	3530378	227247	36722	0.064	0.522	0.86	−0.051
OR	noRG_OR	807342	3501864	255761	32676	0.053	0.567	0.85	−0.059
Average						0.313	0.139	0.696	0.035
Standard deviation						0.362	0.100	0.234	0.111
Kalamos GR
AND	noRG_AND	157782	657813	66	116763	0.425	0.000	0.730	0.177
AlmostAND	noRG_AlmostAND	182066	657701	178	92479	0.337	0.001	0.800	0.140
Average	noRG_Average	259006	648900	8979	15539	0.057	0.034	0.950	0.010
AlmostOR	noRG_AlmostOR	264900	627739	30140	9645	0.035	0.102	0.930	−0.031
OR	noRG_OR	265990	616726	41153	8555	0.031	0.134	0.910	−0.050
Average						0.177	0.054	0.864	0.049
Standard deviation						0.189	0.061	0.095	0.103
Zakynthos GR
AND	noRG_AND	90652	2317245	240	36105	0.285	0.003	0.830	0.015
AlmostAND	noRG_AlmostAND	101437	2316142	1343	25320	0.200	0.013	0.880	0.010
Average	noRG_Average	125541	2248191	69294	1216	0.010	0.356	0.780	−0.029
AlmostOR	noRG_AlmostOR	126152	2199065	118420	605	0.005	0.484	0.680	−0.051
OR	noRG_OR	126259	2164238	153247	498	0.004	0.548	0.620	−0.066
Average						0.101	0.281	0.758	−0.024
Standard deviation						0.133	0.258	0.107	0.036

References

Velickov, S.; Solomatine, D.P.; Yu, X.; Price, R.K. Application of Data Mining Techniques for Remote Sensing Image Analysis. In Proceedings of the 4-th International Conference on Hydroinformatics, Iowa City, IA, USA, 23–27 July 2000. [Google Scholar]
Ramo, R.; García, M.; Rodríguez, D.; Chuvieco, E. A data mining approach for global burned area mapping. Int. J. Appl. Earth Obs. Geoinf. 2018, 73, 39–51. [Google Scholar] [CrossRef]
Quintano, C.; Fernández-Manso, A.; Stein, A.; Bijker, W. Estimation of area burned by forest fires in Mediterranean countries: A remote sensing data mining perspective. For. Ecol. Manag. 2011, 262, 1597–1607. [Google Scholar] [CrossRef]
Liu, Z.; Peng, C.; Work, T.; Candau, J.-N.; DesRochers, A.; Kneeshaw, D. Application of machine-learning methods in forest ecology: Recent progress and future challenges. Environ. Rev. 2018, 26, 339–350. [Google Scholar] [CrossRef] [Green Version]
Shen, C. A Transdisciplinary Review of Deep Learning Research and Its Relevance for Water Resources Scientists. Water Resour. Res. 2018, 54, 8558–8593. [Google Scholar] [CrossRef]
Karpatne, A.; Ebert-Uphoff, I.; Ravela, S.; Babaie, H.A.; Kumar, V. Machine Learning for the Geosciences: Challenges and Opportunities. IEEE Trans. Knowl. Data Eng. 2019, 31, 1544–1554. [Google Scholar] [CrossRef] [Green Version]
Arif, M.; Alghamdi, K.K.; Sahel, S.A.; Alosaimi, S.O.; Alsahaft, M.E.; Alharthi, M.A.; Arif, M. Role of Machine Learning Algorithms in Forest Fire Management: A Literature Review. J. Robot. Autom. 2021, 5, 212–226. [Google Scholar]
Sun, A.Y.; Scanlon, B.R. How can Big Data and machine learning benefit environment and water management: A survey of methods, applications, and future directions. Environ. Res. Lett. 2019, 14, 073001. [Google Scholar] [CrossRef]
Cui, Y.; Chen, X.; Gao, J.; Yan, B.; Tang, G.; Hong, Y. Global water cycle and remote sensing Big data: Overview, challenge, and opportunities. Big Earth Data 2018, 2, 282–297. [Google Scholar] [CrossRef]
Jain, P.S.; Coogan, C.P.; Subramaniany, S.G.; Crowley, M.; Taylor, S.; Flannigan, M.D. A review of machine learning applications in wildfire science and management. Environ. Rev. 2020, 28, 478–505. [Google Scholar] [CrossRef]
European Parliament and Council of the European Union. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the Protection of Natural Persons with Regard to the Processing of Personal Data and on the Free Movement of Such Data, and Repealing Directive 95/46/EC (General Data Protection Regulation). 2016. Available online: https://op.europa.eu/it/publication-detail/-/publication/3e485e15-11bd-11e6-ba9a-01aa75ed71a1/language-en (accessed on 29 July 2021).
European Commission. White Paper: On Artificial Intelligence—A European Approach to Excellence and Trust; EU: Brussels, Belgium, 2020; Volume 65, pp. 1–26. [Google Scholar]
European Commission. High Level Expert Group on Artificial Intelligence 2019. Ethics Guidelines for Trustworthy AI. Available online: https://digital-strategy.ec.europa.eu/en/library/ethics-guidelines-trustworthy-ai (accessed on 29 July 2021).
Hamon, R.; Junklewitz, H.; Malgieri, G.; De Hert, P.; Beslay, L.; Sanchez, I. Impossible Explanations? Beyond Explainable AI in the GDPR from a COVID-19 Use Case Scenario. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, Virtual Event Canada, Toronto, ON, Canada, 3–10 March 2021; ISBN 9781450383097. [Google Scholar] [CrossRef]
Emre, B. Transparency of Automated Decisions in the GDPR: An Attempt for Systemisation. SSRN 2018. [Google Scholar] [CrossRef]
Felzmann, H.; Fosch-Villaronga, E.; Lutz, C.; Tamò-Larrieux, A. Towards Transparency by Design for Artificial Intelligence. Sci. Eng. Ethics 2020, 26, 3333–3361. [Google Scholar] [CrossRef]
Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 2019, 1, 206–215. [Google Scholar] [CrossRef] [Green Version]
Yager, R.R. Quantifier guided aggregation using OWA operators. Int. J. Intell. Syst. 1996, 11, 49–73. [Google Scholar] [CrossRef]
Leblon, B.; Bourgeau-Chavez, L.; San-Miguel-Ayanz, J. Use of Remote Sensing in Wildfire Management. In Sustainable Development, Authoritative and Leading Edge Content for Environmental; Curkovic, S., Ed.; InTech: Rijeka, Croatia, 2012; ISBN 978-953-51-0682-1. [Google Scholar] [CrossRef]
Giglio, L.; Loboda, T.; Roy, D.P.; Quayle, B.; Justice, C.O. An active-fire based burned area mapping algorithm for the MODIS sensor. Remote. Sens. Environ. 2009, 113, 408–420. [Google Scholar] [CrossRef]
Schmoldt, D.L. Application of artificial intelligence to risk analysis for forested ecosystems. In Risk Analysis in Forest Management; Gadow, K.L., Ed.; Springer: Berlin/Heidelberg, Germany, 2001; pp. 49–74. [Google Scholar]
Olden, J.D.; Lawler, J.J.; Poff, N.L. Machine learning methods without tears: A primer for ecologists. Q. Rev. Biol. 2008, 83, 171–193. [Google Scholar] [CrossRef] [Green Version]
Roy, D.P.; Huang, H.; Boschetti, L.; Giglio, L.; Yan, L.; Zhang, H.H.; Li, Z. Landsat-8 and Sentinel-2 burned area mapping—A combined sensor multi-temporal change detection approach. Remote Sens. Environ. 2019, 231, 111254. [Google Scholar] [CrossRef]
Drusch, M.; Del Bello, U.; Carlier, S.; Colin, O.; Fernandez, V.; Gascon, F.; Hoersch, B.; Isola, C.; Laberinti, P.; Martimort, P.; et al. Sentinel-2: ESA’s Optical High-Resolution Mission for GMES Operational Services. Remote Sens. Environ. 2012, 120, 25–36. [Google Scholar] [CrossRef]
Zhang, H.K.; Roy, D.P.; Yan, L.; Li, Z.; Huang, H.; Vermote, E.; Skakun, S.; Roger, J.-C. Characterization of Sentinel-2A and Landsat-8 top of atmosphere, surface, and nadir BRDF adjusted reflectance and NDVI differences. Remote Sens. Environ. 2018, 215, 482–494. [Google Scholar] [CrossRef]
Gonzalez, R.C.; Woods, R.E. Digital Image Processing, 2nd ed.; Prentice Hall: Hoboken, NJ, USA, 2002. [Google Scholar]
Espindola, G.M.; Camara, G.; Reis, I.A.; Bins, L.S.; Monteiro, A.M. Parameter selection for region-growing image segmentation algorithms using spatial autocorrelation. Int. J. Remote Sens. 2006, 27, 3035–3040. [Google Scholar] [CrossRef]
Stroppiana, D.; Bordogna, G.; Carrara, P.; Boschetti, M.; Boschetti, L.; Brivio, P.A. A method for extracting burned areas from Landsat TM/ETM+ images by soft aggregation of multiple Spectral Indices and a region growing algorithm. ISPRS J. Photogramm. Remote Sens. 2012, 69, 88–102. [Google Scholar] [CrossRef]
Zhang, T.; Yang, X.; Hu, S.; Su, F. Extraction of Coastline in Aquaculture Coast from Multispectral Remote Sensing Images: Object-Based Region Growing Integrating Edge Detection. Remote Sens. 2013, 5, 4470–4487. [Google Scholar] [CrossRef] [Green Version]
Shan, T.; Wang, C.; Chen, F.; Wu, Q.; Li, B.; Yu, B.; Shirazi, Z.; Lin, Z.; Wu, W. A Burned Area Mapping Algorithm for Chinese FengYun-3 MERSI Satellite Data. Remote Sens. 2017, 9, 736. [Google Scholar] [CrossRef] [Green Version]
Sali, M.; Piaser, E.; Boschetti, M.; Brivio, P.A.; Sona, G.; Bordogna, G.; Stroppiana, D. A Burned Area Mapping Algorithm for Sentinel-2 Data Based on Approximate Reasoning and Region Growing. Remote Sens. 2021, 13, 2214. [Google Scholar] [CrossRef]
Goffi, A.; Bordogna, G.; Stroppiana, D.; Boschetti, M.; Brivio, P.A. Knowledge and Data-Driven Mapping of Environmental Status Indicators from Remote Sensing and VGI. Remote Sens. 2020, 12, 495. [Google Scholar] [CrossRef] [Green Version]
Sánchez-Benítez, A.; García-Herrera, R.; Barriopedro, D.; Sousa, P.M.; Trigo, R.M. June 2017: The Earliest European Summer Mega-heatwave of Reanalysis Period. Geophys. Res. Lett. 2018, 45, 1955–1962. [Google Scholar] [CrossRef]
Turco, M.; Jerez, S.; Augusto, S.; Tarin-Carrasco, P.; Ratola, N.; Jimenez-Guerrero, P.; Trigo, R.M. Climate drivers of the 2017 devastating fires in Portugal. Sci. Rep. 2019, 9, 13886. [Google Scholar] [CrossRef]
Available online: https://earth.esa.int/web/sentinel/home (accessed on 29 July 2021).
Sali, M.; Busetto, L.; Boschetti, M.; Franquesa, M.; Chuvieco, E.; Stroppiana, D. Fire Reference Perimeters Extracted from Sentinel-2 Data for Validation of Burned Area Products in Africa Biomes. In Proceedings of the IGARSS 2021 IEEE International Geoscience and Remote Sensing Symposium, Brussels, Belgium, 11–16 July 2021; pp. 3749–3752. [Google Scholar]
Available online: https://scihub.copernicus.eu/ (accessed on 29 July 2021).
Ranghetti, L.; Busetto, L. Sen2r: An R toolbox to find, download and preprocess Sentinel-2 data. R Package Version. 2019. Available online: https://zenodo.org/record/5035912#.YRX4wnkRXIU (accessed on 29 July 2021).
Available online: https://ranghetti.github.io/sen2r (accessed on 29 July 2021).
Available online: https://land.copernicus.eu/global/products/ba (accessed on 29 July 2021).
Giglio, L.; Schroeder, W.; Justice, C.O. The Collection 6 MODIS active fire detection algorithm and fire products. Remote Sens. Environ. 2016, 178, 31–41. [Google Scholar] [CrossRef] [Green Version]
Schroeder, W.; Oliva, P.; Giglio, L.; Csiszar, I.A. The New VIIRS 375m active fire detection data product: Algorithm description and initial assessment. Remote Sens. Environ. 2014, 143, 85–96. [Google Scholar] [CrossRef]
Robinson, P.B. A perspective on the fundamentals of fuzzy sets and their use in geographic information systems. Trans. GIS 2003, 7, 3–30. [Google Scholar] [CrossRef]
Carrara, P.; Bordogna, G.; Boschetti, M.; Brivio, P.A.; Nelson, A.; Stroppiana, D. A flexible multi-source spatial-data fusion system for environmental status assessment at continental scale. Int. J. Geogr. Inf. Sci. 2008, 22, 781–799. [Google Scholar] [CrossRef]
Stroppiana, D.; Boschetti, M.; Zaffaroni, P.; Brivio, P.A. Analysis and interpretation of spectral indices for soft multi-criteria burned area mapping in Mediterranean regions. IEEE Geosci. Remote. Sens. Lett. 2009, 6, 499–503. [Google Scholar] [CrossRef]
Yager, R.R. On ordered weighted averaging aggregation operators in multi-criteria decision making. IEEE Trans. on Syst. Man Cybern. 1998, 18, 183–190. [Google Scholar] [CrossRef]
Marichal, J.L. Tolerant or intolerant character of interacting criteria in aggregation by the Choquet integral. Eur. J. Oper. Res. 2004, 155, 771–791. [Google Scholar] [CrossRef] [Green Version]
Bordogna, G.; Pagani, M.; Pasi, G. A Flexible Decision support approach to model ill-defined knowledge in GIS. In Proceedings of the NATO Workshop on Environmental Impact Assement, Kiew, Ukraine, 23–26 June 2006. [Google Scholar]
Yager, R.R. New modes of OWA information fusion. Int. J. Intell. Syst. 1998, 13, 661–681. [Google Scholar] [CrossRef]
Yager, R.R. On the dispersion measure of OWA operators. Inf. Sci. 2009, 179, 3908–3919. [Google Scholar] [CrossRef]
Congalton, R.G. Accuracy assessment and validation of remotely sensed and other spatial information. Int. J. Wildland Fire 2001, 10, 321–328. [Google Scholar] [CrossRef] [Green Version]
Chicco, D.; Jurman, G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genom. 2020, 21, 6. [Google Scholar] [CrossRef] [Green Version]
Stehman, S.V. Selecting and Interpreting Measures of Thematic Classification Accuracy. Remote Sens. Environ. 1997, 62, 77–89. [Google Scholar] [CrossRef]
Stehman, S.V.; Czaplewski, R.L. Design and Analysis for Thematic Map Accuracy Assessment: Fundamental Principles. Remote Sens. Environ. 1998, 64, 331–344. [Google Scholar] [CrossRef]
Padilla, M.; Stehman, S.V.; Hantson, S.; Oliva, P.; Alonso-Canas, I.; Bradley, A.; Tansey, K.; Mota, B.; Pereira, J.M.; Chuvieco, E. Comparing the Accuracies of Remote Sensing Global Burned Area Products using Stratified Random Sampling and Estimation. Remote Sens. Environ. 2015, 160, 114–121. [Google Scholar] [CrossRef] [Green Version]
Fernandez-Carrillo, A.; Franco-Nieto, A.; Pinto-Bañuls, E.; Basarte-Mena, M.; Revilla-Romero, B. Designing a Validation Protocol for Remote Sensing Based Operational Forest Masks Applications. Comparison of Products Across Europe. Remote Sens. 2020, 12, 3159. [Google Scholar] [CrossRef]
Dice, L.R. Measures of the amount of ecologic association between species. Ecology 1945, 26, 297–302. [Google Scholar] [CrossRef]
CLC. 2012. Available online: https://land.copernicus.eu/pan-european/corine-land-cover (accessed on 29 July 2021).
Bastarrika, A.; Chuvieco, E.; Martín, M.P. Mapping burned areas from Landsat TM/ETM+ data with a two-phase algorithm: Balancing omission and commission errors. Remote Sens. Environ. 2011, 115, 1003–1012. [Google Scholar] [CrossRef]
Wu, L.; Wang, Y.; Long, J.; Liu, Z. A Non-seed-based Region Growing Algorithm for High Resolution Remote Sensing Image Segmentation. In Image and Graphics; Zhang, Y.J., Ed.; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar] [CrossRef] [Green Version]
Li, Z.; Kaufman, Y.J.; Ichoku, C.; Fraser, R.; Trishchenke, A.; Giglio, L.; Jin, J.; Yu, X. A review of AVHRR-based active fire detection algorithms: Principles, limitations, and recommendations. In Global and Regional Vegetation Fire Monitoring from Space. Planning a Coordinated and International Effort; Ahern, F.J., Goldammer, J.G., Justice, C.O., Eds.; SPB Academic: The Hague, The Netherlands, 2001; pp. 199–225. [Google Scholar]
Roy, D.P.; Landmann, T. Characterising the surface heterogeneity of fire effects using multi temporal reflecting wavelength data. Int. J. Remote Sens. 2005, 26, 4197–4218. [Google Scholar] [CrossRef]
Boschetti, L.; Brivio, P.A.; Eva, H.D.; Gallego, J.; Baraldi, A.; Gregoire, J.-M. A sampling method for the retrospective validation of global burned area products. IEEE Trans. Geosci. Remote Sens. 2006, 44, 1765–1773. [Google Scholar] [CrossRef]
Pulvirenti, L.; Squicciarino, G.; Fiori, E.; Fiorucci, P.; Ferraris, L.; Negro, D.; Gollini, A.; Severino, M.; Puca, S. An Automatic Processing Chain for Near Real-Time Mapping of Burned Forest Areas Using Sentinel-2 Data. Remote Sens. 2020, 12, 674. [Google Scholar] [CrossRef] [Green Version]
Smiraglia, D.; Filipponi, F.; Mandrone, S.; Tornato, A.; Taramelli, A. Agreement Index for Burned Area Mapping: Integration of Multiple Spectral Indices Using Sentinel-2 Satellite Images. Remote Sens. 2020, 12, 1862. [Google Scholar] [CrossRef]
Seydi, S.T.; Akhoondzadeh, M.; Amani, M.; Mahdavi, S. Wildfire Damage Assessment over Australia Using Sentinel-2 Imagery and MODIS Land Cover Product within the Google Earth Engine Cloud Platform. Remote Sens. 2021, 13, 220. [Google Scholar] [CrossRef]

Figure 1. The four study sites selected for BA mapping in southern Europe.

Figure 2. Pre- and post-fire S2 images (first and second column, respectively) for each site: Calar, Spain (a); Huelva, Spain (b); Kalamos, Greece (c) and Zakynthos, Greece (d). S2 images are displayed as RGB false color composites (SWIR2, NIR, RED). In the second column, active fire points from MODIS (red) and VIIRS (yellow) are overlaid on the RGB image.

Figure 3. Semantics of the fusion of N = 7 contributing factors of burned areas defined by OWA operators with distinct degrees of pessimism (ps) in [0, 1] and democracy (dm) in {1/N, 2/N, .-..N/N}.

Figure 4. Workflow of the fully automatic algorithm proposed for burned area mapping with the multi-criteria adaptive approach. The grey box highlights the innovative step in the algorithm introduced to fully automatize the definitions of both OWA_seed and OWA_grow, generating the seed and grow layers, respectively, and used in input by the RG algorithm.

Figure 5. Accuracy metrics estimated for the four study sites for all combinations of OWAs with region growing (RG) and without region growing (noRG) (oe=omission error, ce=commission error, dc=dice coefficient, relB=relative bias).

Figure 6. Left: The RG_score maps (shades of blue) output from the algorithm (left column) with highlighted seed points (red pixels) and EMS reference perimeters (black line). Right: the accuracy maps (correct burned TP = orange, correct unburned TN = white, omission FN = green and commission FP = blue) for the four study sites: Calar, SP (a,b), Huelva, SP (c,d), Kalamos, GR (e,f) and Zakynthos, GR (g,h). Unburnable masked regions are grey.

Figure 7. Contribution of each feature MD after the re-ordering step (g_i,i = 1,…7) over the Huelva site, Spain: panels are ordered from left to right to show pixels contributing to the i-th position. Along the columns, each pixel value belongs to the n-th features and contributes to the i-th ordered layers.

Figure 8. Contribution of each feature MD after the re-ordering step (g_i,i = 1,…7) over the Huelva site, Spain: panels are ordered from left to right to show pixels contributing to the i-th position. Along the columns each pixel value belongs to the n-th features and contributes to the i-th ordered layers.

Table 1. MSI S2 spectral bands, spectral domain, central wavelength, spatial resolution and name used in this paper. Temporal difference (Δ) is the reflectance difference between post-fire and pre-fire S2 images. In bold are the spectral bands and difference used as input features for the BA mapping algorithm in this study.

Band Name	Spectral Domain	Central Wavelength (µm)	Spatial Resolution [m]	Features Name
Band 2	Blue	0.490	10	RE2 and ΔRE2
Band 3	Green	0.560	10	RE2 and ΔRE2
Band 4	Red	0.665	10	Red and ΔRed
Band 5	Red Edge 1	0.705	20	RE1 and ΔRE1
Band 6	Red Edge 2	0.740	20	RE2 and ΔRE2
Band 7	Red Edge 3	0.783	20	RE3 and ΔRE3
Band 8	NIR	0.842	10	NIR and ΔNIR
Band 11	SWIR 1	1.610	20	SWIR1 and ΔSWIR1
Band 12	SWIR 2	2.190	20	SWIR2 and ΔSWIR2

In bold are the spectral bands and difference used as input features for the BA mapping algorithm in this study.

Table 2. Pre-fire and post-fire S2 image dates. The reference date is the date of the EMS dataset used for validation in the four study sites.

Study Site	Pre-Fire Date	Post-fire Date	Reference Date
Calar, Spain	15/07	04/08	04/08
Huelva, Spain	11/06	01/07	27/06
Zakynthos, Greece	25/07	03/09	18/08
Kalamos, Greece	28/07	17/08	18/08

Table 3. Sampled error/confusion matrix: n_ij express the number of pixels of agreements (diagonal cells) or disagreements (off diagonal cells) between the BA product and the reference EMS. In the case of binary classification (burned areas and not burned areas), considering as target objective of the algorithm the identification of burned areas, we can make this equivalence: True Positives (TP = n₁₁), True Negatives (TN = N₂₂), False Positives (FP = n₁₂) and False Negatives (FN = n₂₁).

		EMS Reference
		Burned	Unburned	Total
RG algorithm	Burned	n₁₁	n₁₂	n₁₊
RG algorithm	Unburned	n₂₁	n₂₂	n₂₊
	Total	n₊₁	n₊₂

Table 4. Metrics computed from the error/confusion matrix and range of variability.

Accuracy Metric Name	Formula	Range
Commission error	$C e = \frac{n_{12}}{n_{1 +}}$	[0, 1]
Omission Error	$O e = \frac{n_{21}}{n_{+ 1}}$	[0, 1]
Dice Coefficient	$D C = \frac{2 n_{11}}{2 n_{11} + n_{12} + n_{21}}$	[0, 1]
Relative Bias	$r e l B = \frac{n_{21} - n_{12}}{n_{+ 1}}$	[−1, +1]

Table 5. Weighting vectors of the OWA_learn in each site, its degrees of pessimism ps and democracy dm, the correspondent attitude expressed linguistically, the expected type of error in the seed layer generated by OWA_learn, the predicted OWA_grow based on the heuristic rules in (11) and the best performing OWA_grow that has been assessed based on the validation comparison.

	OWA_learn Weighting Vector	ps	dm	Attitude	Expected Errors in Seed Layer	Predicted OWA_grow (OWA_seed = OWA_learn)	Best OWA_grow (OWA_seed = AND)
Calar	[0.43, 0.02, 0.03, 0.03, 0.13, 0.16, 0.21, 0.55, 0.67]	0.55	0.67	Towards Pessimistic and Nearly Democratic	ce ≥ oe	Average	Almost OR (Δdc = 0.01)
Huelva	[0.69, 0.00, 0.00, 0.00, 0.00, 0.00, 0.30, 0.70, 0.28]	0.70	0.28	Towards Pessimistic and Nearly Monarchical	ce > oe	Average	Average
Kalamos	[0.36, 0.02, 0.00, 0.00, 0.02, 0.11, 0.49, 0.4, 0.45]	0.40	0.45	Towards Optimistic and Nearly Monarchical	oe ≥ ce	Almost OR	OR (Δdc = 0.007)
Zakynthos	[0.53, 0.00, 0.00, 0.00, 0.00, 0.00, 0.46, 0.5, 0.30]	0.54	0.30	Towards Pessimistic and Nearly Monarchical	ce ≥ oe	Average	Average

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Stroppiana, D.; Bordogna, G.; Sali, M.; Boschetti, M.; Sona, G.; Brivio, P.A. A Fully Automatic, Interpretable and Adaptive Machine Learning Approach to Map Burned Area from Remote Sensing. ISPRS Int. J. Geo-Inf. 2021, 10, 546. https://0-doi-org.brum.beds.ac.uk/10.3390/ijgi10080546

AMA Style

Stroppiana D, Bordogna G, Sali M, Boschetti M, Sona G, Brivio PA. A Fully Automatic, Interpretable and Adaptive Machine Learning Approach to Map Burned Area from Remote Sensing. ISPRS International Journal of Geo-Information. 2021; 10(8):546. https://0-doi-org.brum.beds.ac.uk/10.3390/ijgi10080546

Chicago/Turabian Style

Stroppiana, Daniela, Gloria Bordogna, Matteo Sali, Mirco Boschetti, Giovanna Sona, and Pietro Alessandro Brivio. 2021. "A Fully Automatic, Interpretable and Adaptive Machine Learning Approach to Map Burned Area from Remote Sensing" ISPRS International Journal of Geo-Information 10, no. 8: 546. https://0-doi-org.brum.beds.ac.uk/10.3390/ijgi10080546

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Fully Automatic, Interpretable and Adaptive Machine Learning Approach to Map Burned Area from Remote Sensing

Abstract

1. Introduction

2. Materials

3. Methods

3.1. Ordered Weighted Averaging Operators (OWA)

3.2. Semantics of Ordered Weighted Averaging Operators (OWA)

3.3. Fusion Attitude based on Optimism and Democracy

3.4. Learning OWA Weighting Vector from Training Points

3.5. Workflow of the Automatic BA Mapping Algorithm

3.6. Validation Metrics

4. Results

4.1. Learning the OWA Operator for Seed Layer Computation

4.2. Burned Area Mapping Accuracy

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI