Next Article in Journal
Clustering of Wall Geometry from Unstructured Point Clouds Using Conditional Random Fields
Next Article in Special Issue
Forest Connectivity, Host Assemblage Characteristics of Local and Neighboring Counties, and Temperature Jointly Shape the Spatial Expansion of Lyme Disease in United States
Previous Article in Journal
Strawberry Yield Prediction Based on a Deep Neural Network Using High-Resolution Aerial Orthoimages
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

RFim: A Real-Time Inundation Extent Model for Large Floodplains Based on Remote Sensing Big Data and Water Level Observations

State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2019, 11(13), 1585; https://0-doi-org.brum.beds.ac.uk/10.3390/rs11131585
Submission received: 5 June 2019 / Revised: 29 June 2019 / Accepted: 3 July 2019 / Published: 4 July 2019
(This article belongs to the Special Issue Remote Sensing for Public Health)

Abstract

:
The real-time flood inundation extent plays an important role in flood disaster preparation and reduction. To date, many approaches have been developed for determining the flood extent, such as hydrodynamic models, digital elevation model-based (DEM-based) methods, and remote sensing methods. However, hydrodynamic methods are time consuming when applied to large floodplains, high-resolution DEMs are not always available, and remote sensing imagery cannot be used alone to predict inundation. In this article, a new model for the highly accurate and rapid simulation of floodplains, called “RFim” (real-time inundation model), is proposed to simulate the real-time flooded area. The model combines remote sensing images with in situ data to find the relationship between the inundation extent and water level. The new approach takes advantage of remote sensing images, which have wide spatial coverage and high resolution, and in situ observations, which have continuous temporal coverage and are easily accessible. This approach has been applied in the study area of East Dongting Lake, representing a large floodplain, for inundation simulation at a 30 m resolution. Compared with the submerged extent from observations, the accuracy of the simulation could be more than 90% (the lowest is 93%, and the highest is 96%). Hence, the approach proposed in this study is reliable for predicting the flood extent. Moreover, an inundation simulation for all of 2013 was performed with daily water level observation data. With an increasing number of Earth observation satellites operating in space and high-resolution mappers deployed on satellites, it will be much easier to acquire large quantities of images with very high resolutions. Therefore, the use of RFim to perform inundation simulations with high accuracy and high spatial resolutions in the future is promising because the simulation model is built on remote sensing imagery and gauging station data.

Graphical Abstract

1. Introduction

Floods are one of the most common and harmful natural disasters in the world, having caused direct economic losses exceeding $1 trillion and killing more than 220,000 people over the last forty years [1]. The flood risk may not be reduced and may even increase in many regions of the world in the future because of climate change [2]. Knowing the real-time flood inundated extent during a flood event is an important way to respond quickly and reduce disaster impacts [3]. Great efforts have been made to study flood inundation, and the methods can be roughly divided into three types: Hydrodynamic methods, DEM-based (digital elevation model-based) inundation methods, and remote sensing methods.
Based on hydrodynamic models, hydrodynamic methods mathematically express the physical laws of flood movement and the inundation extent. From the perspective of dimension, the hydrodynamic models include one-dimensional (1D), two-dimensional (2D), and three-dimensional (3D) models. One-dimensional hydrodynamic models are used to calculate the flow through a cross section, but they cannot directly simulate the flood extent. Two-dimensional hydrodynamic models, which describe floodplain flow in two dimensions, are popular models for inundation analysis. However, 2D models require a lot time [4] and are impractical to apply in large-floodplain scenarios [5], and some of them may need spatial parametric inputs that are not available. Three-dimensional hydrodynamic models, which add the vertical dimension to analyze the flood extent, still have limitations on computational costs and data requirements when they are utilized to simulate flood extent in a large-scale floodplain.
DEM inundation methods are based on simplified hydraulic concepts and DEMs. DEM-based inundation methods do not require as many computing resources as hydrodynamic methods [6,7,8]. These methods are fast and easy to use when predicting the flood extent. However, high-resolution DEMs are not always available, especially in large floodplains.
Remote sensing methods analyze the flood inundation extent from direct observations. The Earth observation satellites, e.g., Landsat, SPOT (Systeme Probatoire d’Observation de la Terre), and MODIS (moderate-resolution imaging spectroradiometer), have accumulated a large number of images and are still acquiring images every day. Some researchers use these images to extract flood inundation information for flood mapping and analyses of inundation changes [1,9,10], but they cannot show the inundation extent in a timely manner during a flood event because the methods only record the flood extent at specific times.
In recent years, an increasing number of satellites have been deployed with high-resolution mappers and short revisit cycles. These huge imagery data from different satellites are made up of remote sensing big data [11], which accumulate a large amount of remote sensing data on flood extent and flood history.
When floods spread, the flood extent could be affected by factors like topography, water level, and so on. Under a certain water level, there is a certain flood extent due to the topography, if that water level could represent every place of the flood water surface and the topography in that area is relatively stable. Hence, there is a relationship between water level and flood extent, if the topography does not change. Some studies find and build the relationship between flood area and water storage or water level, but do not consider the dynamic flood extent [9,10]. Hence, it is meaningful to explore and find the relationship between flood extent and one of or some of the influencing factors by incorporating remote sensing big data. Water level, which is relatively simple to access and partly determines whether a place is submerged or not, could be a suitable factor to establish the relationship with flood extent. After building the relationship between flood extent and water level, the real-time flood extent could be simulated under the real-time water level observations and that relationship.
The purpose of this research is to design a new and simple approach called “RFim” (real-time inundation model) which can make real-time simulations on flood inundation extent. Based on the remote sensing big data and water level values, RFim models the relationships between water levels and inundation extents, and will simulate real-time inundation extents under the daily water level data and that relationship. This means that the RFim approach does not need detailed spatial parameters, which are usually not available [12,13,14,15,16], or high-resolution DEMs as inputs, unlike hydrodynamic models and DEM-based methods. Furthermore, RFim overcomes the shortcoming that using historical images alone cannot predict the inundation extent, by incorporating water level values for which, as we know, the timely monitoring is relatively simple during floods. Our research identified that the results of inundation simulation using RFim were acceptable in a large floodplain, East Dongting Lake. In addition, the demand has increased for real-time flood inundation extent information in decision making for flood prevention and mitigation. The future results of RFim are expected to be desirable. In this article, the study area and data are described in Section 2. The methodology and results are illustrated in Section 3 and Section 4, respectively. The applications are shown in Section 5. The discussions are presented in Section 6. The conclusions and future work are summarized in Section 7.

2. Study Area and Data

2.1. Study Area

East Dongting Lake, which is connected to the Yangtze River, is the largest lake in the Donging Lake system [17]. It covers the region from approximately 28°59″ to 29°38″ and 112°43″ to 113°15″ [18], and drains an area of 1900 km2, including a water area of 364 km2 [19]. The location of East Dongting Lake is shown in Figure 1. Since it is a flood basin of the Yangtze River and precipitation is imbalanced during the year, East Dongting Lake fluctuates dramatically between the wet and dry seasons, with the maximum area in August and the minimum area in January or February [20]. The elevations of East Dongting Lake are lower than 35 m in Huanghai Datum 1965, and slopes are less than 3° [21]. In recent years, under the impact of economic development and human activities, flood disasters have occurred frequently around East Dongting Lake. Hence, it is significant to predict the inundation extent in this area for flood disaster preparation and mitigation.

2.2. Data Selection

The size of East Dongting Lake is always changing throughout the year. To assess its extent as much as possible, Landsat images of the lake with a dense time series were chosen. Generally, large-scale flood inundation models are run at a 25–100 m resolution [22]. Landsat images with a 30 m resolution were chosen. In this paper, the archives of Landsat 5 Thematic Mapper (TM) and Landsat 8 Operational Land Imager (OLI) orthorectified, top-of-atmosphere reflectance images acquired between 2001 and 2016 were used. The 97 images acquired from 2002 to 2011 that were used for constructing the simulation model in the study are shown in Table 1. The images acquired from 2013 to 2016 that were used for validation or application are also shown in Table 1. Landsat 5 was launched on 1 March 1984, to collect imagery of the Earth until 5 June 2013. Landsat 8, launched on 11 February 2013, still works. Landsat 5 and 8 revisit the same scene every 16 days, with a 30 m resolution. Landsat 7 Enhanced Thematic Mapper-plus (ETM+) images were not used in the study because of the scan line corrector (SLC) failure [23], which led to the loss of more than 20% of each single scene acquired after 23 May 2003 [24].
The measurements of the water level were provided by the Chenglingji gauging station (29.41°, 113.12°), which is located at the confluence of East Dongting Lake and the Yangtze River, and is near East Dongting Lake. This gauging station provides the daily water level of East Dongting Lake. The collected daily water levels at the Chenglingji gauging station from 1 January 2001 to 31 December 2016 were used in this study.

2.3. Data Preparation

To pre-process the images quickly, Google Earth Engine (GEE), a free cloud-computing platform, was used in this study. The GEE provides Internet-based application programming interfaces (APIs) that enable users to process and analyze large quantities of remote sensing images at a time. Additionally, many geographical datasets could be found in the GEE, including Earth surface observations from Landsat, SPOT, MODIS, and Sentinel-1 [25]. The main part of data preparation is cloud removal. More than five hundred Landsat 5 and 8 images were acquired in the research area from 2001 to 2016, but most of them were partly or entirely covered by clouds. It is necessary to remove the clouds to make use of the acquired images. There is an API called ‘simpleCloudScore’, provided by the GEE, that calculates the cloud likelihood for every pixel in the range from 1 to 100 by using the normalized difference snow index (NDSI), and the brightness and temperature from the Landsat imagery [26]. The details of simpleCloudScore could be found in the GEE documentation. The simpleCloudScore is easy to operate and saves time, and some studies used the API to remove the clouds [26,27]. Therefore, we also chose to use it to remove the clouds. For this study, more than 40% of the images covered by clouds were deleted. A threshold of 20 for the cloud score was used to remove clouds based on the visual interpretation of the Landsat images [27]. After cloud removal, only 97 images were left that were suitable for further processing.

3. Method

The basic idea of RFim is to find the relationships between water levels and inundation extents. Following the main idea of RFim, the method of the real-time flood inundation simulation and prediction model based on remote sensing data is shown in Figure 2. The method consists of six steps: (1) Discretizing the study area and resampling the images; (2) extracting historical flood extents based on remote sensing images; (3) relating the water level values to pixels; (4) forming inundation records for every grid cell in the study area; (5) establishing the relationship between inundation extent and water level; and (6) simulating and predicting flood extent with the relationship between inundation extent and water level.

3.1. Discretizing the Study Area and Resampling the Images

The proposed method requires pixels covering the same position to be completely coincident in different images. Therefore, it is necessary to discretize the research area, as shown in Figure 3. Then, the remote sensing images used in the experiment are resampled, using nearest-neighbor interpolation according to the results of grid cell division, as shown in Figure 4. After that, the pixels covering the same location are completely coincident in different resampled images. For example, the study area, East Dongting Lake, is divided into 1560 rows and 1907 columns to discretize the area into 30 m grid cells, the amount of which is 2,974,920 (1560 × 1907).

3.2. Extracting the Historical Flood Extents Based on Remote Sensing Images

Water extent extraction is a common topic in object classification, and experts have developed many models to successfully extract surface water, such as tasselled cap transformation, the normalized differences of water index (NDWI) [28], the modified normalized differences of water index (mNDWI) [29], and support vector machines (SVMs) [30]. In this research, the NDWI is selected to detect the water extents from the images because of its simplicity and high accuracy in large open water [31]. The NDWI separates the water from the background by the different reflections of the green band (p(Green)) and near infrared band (p(NIR)) on different objects. It should be stressed that the water extraction method is not fixed, and any method could be used if it is suitable for the experimental conditions. The mathematical model of the NDWI is as follows:
NDWI = (p(Green) − p(NIR))/(p(Green) + p(NIR))
Determining the threshold is one of the most important steps in using the NDWI to distinguish water from non-water areas [32]. Unfortunately, the threshold values between water and other objects were unstable and varied with scenes and locations [33], and it is quite impractical to find the threshold for every image by hand. Otsu thresholding [34] is widely used to determine the optimal threshold for water detection with the NDWI [35]. From the image histogram, the Otsu method decided the ideal threshold to make the between-class variance maximum and the within-class variance minimum. Thus, Otsu thresholding was selected to determine the threshold automatically in this paper. The accuracy of water extraction this study using NDWI and Ostu thresholding is shown in Appendix A.

3.3. Relating the Water Level Values to Pixels

To find the relationship between water levels and inundation, it is necessary to relate the right pixels to corresponding water levels according to the date when the image is acquired. If there is one gauging station that can provide water level data, the procedure of relating water level values to images is illustrated in Figure 5. As shown in the right part of Figure 5, each pixel in blue with the water level value in it means that the pixel was submerged based on observations at that water level, and each pixel in white with the water level value in it means that the pixel was not submerged based on observations at that water level. Furthermore, if there are two or more gauging stations, each pixel will be related to the water level value which is calculated from water level values in the gauging stations, based on the inverse distance weighted interpolation method and the distances between gauging stations and the center of the pixel.

3.4. Forming Inundation Records for Every Grid Cell in the Study Area

Based on the previous step, where all the pixels in the images have been labelled with water level values, pixels from different images but in the same grid cell that the study area is divided into are placed together to form a set of inundation information and corresponding water level values for that grid cell, as shown in Figure 6. The set of inundation information and water level values for that grid cell are called its “inundation records” in this article. After obtaining inundation records for a grid cell, we can easily determine whether the grid cell is inundated or not under the water level from the inundation records. In Figure 7, a schematic diagram of the inundation records, sorted by the water level value, in one grid cell is displayed, which looks like a column of cells.
Ideally, the minimum value of the water level under which the grid cell is inundated is always larger than the maximum value of the water level where the grid cell is not flooded. However, in some cases, the minimum value under which the grid cell is submerged is lower than the maximum value under which the grid cell is unsubmerged, as shown in the middle of Figure 8, due to errors when detecting water. Similar to other classification methods, the NDWI could not separate water from no-water with 100% accuracy. This means that a grid cell that is not submerged may be classified as an inundation area when extracting the water surface from an image because of the error associated with detecting water. When this error occurs, the minimum value or maximum value with the corresponding inundation information is called an “abnormal record” in the article. To prevent such anomalies from affecting the subsequent steps, these abnormal records should be removed from the inundation records.
The procedure that determines abnormal records is as follows:
a. Find the minimum value under which a grid is submerged or wet, and the maximum one under which a grid is unsubmerged or dry in the ‘inundation records’ of that grid. If the minimum value and the maximum value do not exist at the same level, there is no need to remove ‘abnormal records’, or in other words, there is no ‘abnormal record’;
b. Compare the minimum and the maximum value which are found in step a. If the minimum value under which the grid is submerged is lower than the maximum one under which the grid is unsubmerged, remove these ‘abnormal records’ from the ‘inundation records’ of that grid. If the minimum is larger than the maximum, there is no ‘abnormal record’, and so we do not need to remove ‘abnormal records’;
c. Repeat step a and step b on the rest of ‘inundation records’ of that grid in step b.

3.5. Establishing the Relationship between the Inundation Extent and Water Level

The essence of establishing the relationship between flood extent and water level is to find out the threshold for every grid cell that determines whether it would be submerged when compared to the water level.
It is obvious from Figure 7 that the inundation threshold of a grid cell is the value between the lowest water level under which the grid cell is submerged and the highest water level under which the grid cell is unsubmerged. However, any value in that range might be the true threshold. We choose the mean value of the lowest water level under which the grid cell is submerged plus the highest level under which the grid cell is unsubmerged. Although the mean value might not equal the exact threshold, it would be near the true threshold when the observations are dense enough to make that range small. The impacts of the mean value will be explained in the analysis section. The equation for calculating the threshold is as follows:
Threshold = ( u +   b ) / 2
where u is the lowest water level under which the grid cell is submerged and b is the highest water level under which the grid cell is unsubmerged.

3.6. Simulating and Predicting Flood Extent with the Relationship between Inundation Extent and Water Level

After calculating threshold values for each grid cell in the inundating threshold part, the inundation extent could be predicted easily by comparison with the water level on the desired date, as shown in Figure 9.

4. Results

The remote sensing images were divided into two parts: One part was used for modelling acquired from 1 January 2002 to 31 December 2011 by calculating the water level threshold in every grid cell, and the other part was used for validation or application acquired from 1 January 2013 to 31 December 2016.
Due to the location of East Dongting Lake and data losses when removing clouds in images, at least two images were required to cover the study area. Hence, “composite images” were used for validation in our study, and composite image in this article refers to mosaicking several images. We mosaicked the images according to four different periods: From 1 July 2014 to 30 September 2014; from 1 April 2015 to 30 June 2015; from 1 October 2015 to 31 December 2015; and from 1 January 2016 to 31 March 2016.
The Chenglingji gauging station was used in the inundation simulation on composite images from 1 July 2014 to 30 September 2014; from 1 April 2015 to 30 June 2015; from 1 October 2015 to 31 December 2015; and from 1 January 2016 to 31 March 2016. The results are shown in Figure 10. The accuracies and kappa coefficients were determined using a cell-to-cell comparison strategy between the predictions and observations. The accuracies and kappa coefficients are shown in Table 2. The flooded area obtained from Landsat imagery was regarded as the real and correct flood extent. The prediction accuracy is equal to the percentage of cells correctly predicted by the approach, and the formulation is shown in Equation (3):
Prediction accuracy = (w + n)/s
where w is the number of correct water pixels, n is the number of correct non-water pixels, and s is the number of pixels in the study area.

5. Application

The model is easy to set up for being used by users with relatively little hydraulic modelling experience. This model could simulate real-time flood extent, but also reproduce the flood extents in the past when the images on the floodplain were not available. Therefore, inundation extent on every day of 2013 was simulated based on the model and water level values collected during that period. The daily water level variations from 1 January 2013 to 31 December 2013 are shown in Figure 11, and the daily variations in the inundation area from 1 January 2013 to 31 December 2013 in East Dongting Lake are shown in Figure 12.
From Figure 12, it is clear that the variations in the inundation area during 2013 exhibit a first-rise-and-then-fall pattern that is similar to that for the changes in water level during 2013. The largest inundation area in 2013 was 1069.8 km2 when the water level at Chenglingji station was 29.83 m, and the lowest in 2013 was 9.1 km2 when the water level was 20.41 m. The variations in the inundation extent every four days from 5 May 2013 to 21 May 2013 are displayed in Figure 12.

6. Discussions

Among the predictions during the different time periods, there was the best agreement between the inundation prediction and the observation for the composite image from 1 July 2014 to 30 September 2014. Up to 96% of the cells or grid cells were predicted correctly based on the composite from 1 July 2014 to 30 September 2014, and the kappa coefficient was approximately 0.92, which represents almost perfect agreement. The worst performance of the approach was the prediction on the composite image from 1 January 2016 to 31 March 2016, with 93% of the cells predicted correctly, and the kappa coefficient was approximately 0.67, which represents substantial agreement. Although 93% was the lowest accuracy compared to those of the other three predictions, it was acceptable in the flood simulation. The accuracies of the four predictions were all approximately 94%, and the kappa coefficients ranged from 0.67 to 0.91.
There is a main reason that could explain why the accuracy of prediction from 1 January 2016 to 31 March 2016 was relatively low. The number of remote sensing images with good quality during that period was small compared to those in the other three time periods. The number of images during each month used to build the prediction model is shown in Figure 13. From 2001 to 2011, only 11 images, which were acquired from January to March, were considered good observations and used for modelling. The fewer the number of good observations that are acquired, the less the historical water extent can be accessed. The shortage of good observations made the model perform poorly from 1 January 2016 to 31 March 2016. The good performance from 1 July 2014 to 30 September 2014 could partially be explained by the relatively adequate images utilized for modelling.
Apart from the number of remote sensing images, three factors affected the model performance.
The first factor was the failure to extract a narrow water body when using the NDWI. Although the difference between water and other objects could be enhanced by the NDWI, its formulation is inherently sensitive to imagery noise [36]. In addition, mixed water pixels usually appear in narrow rivers and shallow water [37]. This made it difficult to delineate the narrow water bodies. The lake expanded and merged many narrow rivers and small ponds into a large single unity during the flood season, which reduced the narrow water bodies that were hard to detect using the NDWI. During the dry season, the lake shrank, and the narrow water bodies appeared, which led to much confusion regarding water extraction. Then, the results of water delineation, with failure to detect narrow water bodies, were applied to simulate the inundation extent and violated the predictions to some degree. Since the water extraction in the images during the wet season did not interfere with the small water surface, the inundation simulation from 1 July 2014 to 30 September 2014 had the highest accuracy in the experiment.
The second factor that had a negative effect on the model performance was including only one gaging station, the Chenglingji water station, for the measurements of water stages. Although the Chenglingji water station could represent the daily water level of East Dongting Lake using a single value, the height of the whole water surface actually varied slightly from place to place, especially when many water bodies shrank and separated from the lake during the dry season.
The third factor was using the mean value to represent the real threshold in each grid cell. This impacts the performance of RFim, as the mean value and the real threshold are not equal in every grid cell. If the gap, which is the difference between the lowest water level under which the grid cell is submerged and the highest level under which the grid cell is unsubmerged, is small, RFim is likely to obtain the mean value equivalent the real threshold in that grid cell. The grid cells were collected, of which corresponding thresholds were calculated from water level values from July to September, and the distribution of the gaps from these grid cells is shown in Figure 14a. Additionally, the distribution of the gaps from which the corresponding thresholds were calculated from the water levels from January to March is shown in Figure 14b. From these figures, the gaps for the water levels from July to September were smaller than the gaps for the water levels from January to March. Thus, this observation can partly explain why the prediction from 1 July 2014 to 30 September 2014 was better than the prediction from 1 January 2016 to 31 March 2016.
The new approach described in the paper could determine the flood extent in a large floodplain thanks to the wide spatial coverage and short revisit cycle of remote sensing images. It successfully simulated the inundation extent in the approximately 2000 km2 region in this study. Furthermore, hydrodynamic models need detailed parameters as inputs, which may be not available. Since the data for this approach are temporally continuous and easy to access, the new method could simulate the flooded area in real time if the real-time water level was given.
If the first five steps have been finished and the inundation thresholds of the grid cells are already known, it will not take much time to obtain a real-time inundation simulation according to the real-time measurements of the water stage in the final part. The final part of the approach is, essentially, a simple matrix operation that is just comparing the elements in the matrix with the given value. Real-time inundation simulation can be achieved with this method.
This method does have the capabilities to use remote sensing big data with huge volume and big complexity, although only Landsat archived data were used in this article. If the user intends to apply this method with remote sensing big data, the inundation simulation can be made through the same six steps of the method, which were illustrated in this article. The only adjustment of the method when using remote sensing big data is the way of extracting water bodies according to the resolutions and wave bands of images.
From the application section, the method also shows that it could simulate real-time flood extent, and also reproduce the flood extents from the past when the images on the floodplain were not available.
With the increasing number of Earth observation satellites equipped with higher-resolution mappers, the ability to predict the real-time inundation extent at higher spatial resolutions by using remote sensing imagery and in situ measurements of water height is promising. Over the last twenty years, an increasing number of high-resolution sensors have been operating in space. For example, Ikonos, Worldview, Quickbird, and RapidEye have the ability to provide imagery at the metre or submetre level.
There are some drawbacks of the new method:
First, it could not model flood movement. However, if the flood extent is predicted at a daily time step, flood movement is relatively unimportant on such a temporal scale.
Second, the new method may not predict the inundation area correctly under water levels higher than the historical maximum water level, or lower than the minimum. The RFim is built on images and the corresponding water levels among the acquired dates of the images. Therefore, the range of water levels under which RFim could simulate the corresponding inundation area is limited. If a flood occurred with a water level higher than the historical maximum level, the model could not perform well.
Third, there should be at least one gauging station or other device which could measure and represent the water levels of rivers or lakes in the target area.
Fourth, the new method works well only if the topography of the study area is relatively stable. If the terrain changes greatly, the inundation extent will be different under the same water level, which will affect the performance of the model. If many other satellites are selected, such as SPOT, Ikonos, and GF-1, more information about water level and inundation extent can be obtained in a short period, during which the terrain is likely to be relatively stable, avoiding the impact of topographic changes in the study area.
Fifth, if the water level data are scarcer, the performance will be affected. Scarcer water level data mean that fewer water levels could be related to flood extents according to whether the water level and the flood extent were acquired at the same date, which is necessary to establish the relationship between flood extent and water level.

7. Conclusions and Further Work

In this paper, a new approach was proposed for flood extent simulation and prediction. The new approach tries to find a relationship between remote sensing big data and in situ data to build a real-time flood prediction model, called RFim, taking advantage of the wide spatial coverage and high resolution of remote sensing images, and the continuous temporal coverage and easy accessibility of in situ observations. RFim was validated in East Dongting Lake. The prediction accuracy was approximately 94%, and the kappa coefficients ranged from 0.67 to 0.91. With an increasing number of Earth observation satellites operating in space and equipped with high-resolution mappers, the approach in this study has great potential for real-time flood simulation, since RFim is based on remote sensing big data.
There are some points that need to be considered in future work. Firstly, how to balance the cost of using remote sensing big data and the performance of the model needs to be considered. Using large quantities of images can improve the model performance, but the performance may be improved only a little with a heavy price of computational resources. Secondly, how to reduce the negative effect from scarcer water level data needs to be investigated. This may solve the problems of finding and establishing more relationships between other factors and the flood extent. That is, when water level data cannot be accessed, other factors can be options as inputs to simulate flood extent. Thirdly, how to extend the model capabilities to predict not just flood extent, but also flood duration and water volume, is worth exploring.

Author Contributions

Z.C. and J.L. conceived and designed the research, processed the data, and wrote the manuscript. N.C. conducted the fieldwork and reviewed the manuscript. R.X. and G.S. contributed materials.

Funding

This work was supported by the National Natural Science Foundation of China (nos. 41771422, 41890822).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Water Extraction Accuracy Using NDWI

Here, we will calculate the accuracy of water extraction for the composite image from 1 July 2014 to 30 September 2014. Since it was impossible to verify more than 100 images, the composite image from 1 July 2014 to 30 September 2014 was selected for verification. There were two purposes for choosing the composite image from 1 July 2014 to 30 September 2014 for verification: One was to show that it is reasonable to use NDWI to detect water; and the other was that water extraction using NDWI can be treated as the real water distribution, which will be compared with the results of the flood simulation.
The method of verification is as follows:
a. Select 250 samples randomly from the image;
b. By visual object interpretation, identify the classes of the 250 samples;
c. Compare the samples that have been identified with the results of water extraction based on the image, and then calculate the accuracy of extraction.
The accuracy of the composite image from 1 July 2014 to 30 September 2014 is 98.00%, with 130 samples classified correctly as water, 115 classified correctly as non-water, and only five classified incorrectly.

References

  1. Pekel, J.F.; Cottam, A.; Gorelick, N.; Belward, A.S. High-resolution mapping of global surface water and its long-term changes. Nature 2016, 540, 418–422. [Google Scholar] [CrossRef]
  2. Jonkman, S.N. Global Perspectives on Loss of Human Life Caused by Floods. Nat. Hazards 2005, 34, 151–175. [Google Scholar] [CrossRef]
  3. Burn, D.H.; Whitfield, P.H. Changes in floods and flood regimes in Canada. Can. Water Resour. J./Rev. Can. Ressour. Hydr. 2015, 41, 139–150. [Google Scholar] [CrossRef]
  4. Leandro, J.; Chen, A.S.; Schumann, A. A 2D parallel diffusive wave model for floodplain inundation with variable time step (P-DWave). J. Hydrol. 2014, 517, 250–259. [Google Scholar] [CrossRef]
  5. Teng, J.; Jakeman, A.J.; Vaze, J.; Croke, B.F.W.; Dutta, D.; Kim, S. Flood inundation modelling: A review of methods, recent advances and uncertainty analysis. Environ. Model. Softw. 2017, 90, 201–216. [Google Scholar] [CrossRef]
  6. Nobre, A.D.; Cuartas, L.A.; Hodnett, M.; Rennó, C.D.; Rodrigues, G.; Silveira, A.; Waterloo, M.; Saleska, S. Height Above the Nearest Drainage—A hydrologically relevant new terrain model. J. Hydrol. 2011, 404, 13–29. [Google Scholar] [CrossRef]
  7. Bermúdez, M.; Ntegeka, V.; Wolfs, V.; Willems, P. Development and Comparison of Two Fast Surrogate Models for Urban Pluvial Flood Simulations. Water Resour. Manag. 2018, 32, 2801–2815. [Google Scholar] [CrossRef]
  8. Gouldby, B.; Sayers, P.; Mulet-Marti, J.; Hassan, M.A.A.M.; Benwell, D. A methodology for regional-scale flood risk assessment. Proc. Inst. Civil Eng. Water Manag. 2008, 161, 169–182. [Google Scholar] [CrossRef]
  9. Henry, J.B.; Chastanet, P.; Fellah, K.; Desnos, Y.L. Envisat multi-polarized ASAR data for flood mapping. Int. J. Remote Sens. 2007, 27, 1921–1929. [Google Scholar] [CrossRef]
  10. Feng, L.; Hu, C.; Chen, X.; Cai, X.; Tian, L.; Gan, W. Assessment of inundation changes of Poyang Lake using MODIS observations between 2000 and 2010. Remote Sens. Environ. 2012, 121, 80–92. [Google Scholar] [CrossRef]
  11. Ma, Y.; Wu, H.; Wang, L.; Huang, B.; Ranjan, R.; Zomaya, A.; Jie, W. Remote sensing big data computing: Challenges and opportunities. Futur. Gener. Comput. Syst. 2015, 51, 47–60. [Google Scholar] [CrossRef]
  12. Ernst, J.; Dewals, B.J.; Detrembleur, S.; Archambeau, P.; Erpicum, S.; Pirotton, M. Micro-scale flood risk analysis based on detailed 2D hydraulic modelling and high resolution geographic data. Nat. Hazards 2010, 55, 181–209. [Google Scholar] [CrossRef]
  13. Van Der Knijff, J.M.; Younis, J.; De Roo, A.P.J. LISFLOOD: A GIS-based distributed model for river basin scale water balance and flood simulation. Int. J. Geogr. Inf. Sci. 2010, 24, 189–212. [Google Scholar] [CrossRef]
  14. Wang, J.; Hong, Y.; Li, L.; Gourley, J.J.; Khan, S.I.; Yilmaz, K.K.; Adler, R.F.; Policelli, F.S.; Habib, S.; Irwn, D.; et al. The coupled routing and excess storage (CREST) distributed hydrological model. Hydrol. Sci. J. 2011, 56, 84–98. [Google Scholar] [CrossRef]
  15. Hong, Y.; Adler, R.F.; Hossain, F.; Curtis, S.; Huffman, G.J. A first approach to global runoff simulation using satellite rainfall estimation. Water Resour. Res. 2007, 43, W5802. [Google Scholar] [CrossRef]
  16. Liang, X.; Lettenmaier, D.P.; Wood, E.F. A simple hydrologically based model of land surface water and energy fluxes for general circulation models. J. Geophys. Res. Atmos. 1994, 99, 14415–14428. [Google Scholar] [CrossRef]
  17. Li, F.; Huang, J.; Zeng, G.; Yuan, X.; Li, X.; Liang, J.; Wang, X.; Tang, X.; Bai, B. Spatial risk assessment and sources identification of heavy metals in surface sediments from the Dongting Lake, Middle China. J. Geochem. Explor. 2013, 132, 75–83. [Google Scholar] [CrossRef]
  18. Yuan, Y.; Zeng, G.; Liang, J.; Li, X.; Li, Z.; Zhang, C.; Huang, L.; Lai, X.; Lu, L.; Wu, H.; et al. Effects of landscape structure, habitat and human disturbance on birds: A case study in East Dongting Lake wetland. Ecol. Eng. 2014, 67, 67–75. [Google Scholar] [CrossRef]
  19. Yang, Y.; Cao, X.; Zhang, M.; Wang, J. Occurrence and distribution of endocrine-disrupting compounds in the Honghu Lake and East Dongting Lake along the Central Yangtze River, China. Environ. Sci. Pollut. Res. Int. 2015, 22, 17644–17652. [Google Scholar] [CrossRef]
  20. Xie, Y.-H.; Yue, T.; Xin-Sheng, C.; Feng, L.; Zheng-Miao, D. The impact of Three Gorges Dam on the downstream eco-hydrological environment and vegetation distribution of East Dongting Lake. EcoHydrology 2015, 8, 738–746. [Google Scholar] [CrossRef]
  21. Zheng, J.-M.; Wang, L.-Y.; Li, S.-Y.; Zhou, J.-X.; Sun, Q.-X. Relationship between community type of wetland plants and site elevation on sandbars of the East Dongting Lake, China. For. Stud. China 2009, 11, 44–48. [Google Scholar] [CrossRef]
  22. Teng, J.; Vaze, J.; Dutta, D.; Marvanek, S. Rapid Inundation Modelling in Large Floodplains Using LiDAR DEM. Water Resour. Manag. 2015, 29, 2619–2636. [Google Scholar] [CrossRef]
  23. Markham, B.L.; Storey, J.C.; Williams, D.L.; Irons, J.R. Landsat sensor performance: History and current status. IEEE Trans. GeoSci. Remote Sens. 2004, 42, 2691–2694. [Google Scholar] [CrossRef]
  24. Chen, J.; Zhu, X.; Vogelmann, J.E.; Gao, F.; Jin, S. A simple and effective method for filling gaps in Landsat ETM+ SLC-off images. Remote Sens. Environ. 2011, 115, 1053–1064. [Google Scholar] [CrossRef]
  25. Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
  26. Wang, C.; Jia, M.; Chen, N.; Wang, W. Long-Term Surface Water Dynamics Analysis Based on Landsat Imagery and the Google Earth Engine Platform: A Case Study in the Middle Yangtze River Basin. Remote Sens. 2018, 10, 1635. [Google Scholar] [CrossRef]
  27. Stanley, U.O.; Udo, S.; Jürgen, B.; Uwe, A.S. A novel approach in monitoring land-cover change in the tropics: Oil palm cultivation in the Niger Delta, Nigeria. J. Geogr. Soc. Berl. 2016, 147, 40–52. [Google Scholar] [CrossRef]
  28. McFeeters, S.K. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 2007, 17, 1425–1432. [Google Scholar] [CrossRef]
  29. Xu, H. Modification of normalised difference water index (NDWI) to enhance open water features in remotely sensed imagery. Int. J. Remote Sens. 2007, 27, 3025–3033. [Google Scholar] [CrossRef]
  30. Lu, W.; Yu, Q.; Yu, W. Water extraction in SAR images using GLCM and Support vector Machine. In Proceedings of the IEEE International Conference on Signal Processing, Beijing, China, 24–28 October 2010. [Google Scholar] [CrossRef]
  31. Zhou, Y.; Dong, J.; Xiao, X.; Xiao, T.; Yang, Z.; Zhao, G.; Zou, Z.; Qin, Y. Open Surface Water Mapping Algorithms: A Comparison of Water-Related Spectral Indices and Sensors. Water 2017, 9, 256. [Google Scholar] [CrossRef]
  32. Huang, C.; Chen, Y.; Zhang, S.; Wu, J. Detecting, Extracting, and Monitoring Surface Water from Space Using Optical Sensors: A Review. Rev. Geophys. 2018, 56, 333–360. [Google Scholar] [CrossRef]
  33. Feyisa, G.L.; Meilby, H.; Fensholt, R.; Proud, S.R. Automated Water Extraction Index: A new technique for surface water mapping using Landsat imagery. Remote Sens. Environ. 2014, 140, 23–35. [Google Scholar] [CrossRef]
  34. Ostu, N. A threshold selection method from gray-histogram. IEEE Trans. Syst. Man Cybern. 2007, 9, 62–66. [Google Scholar] [CrossRef]
  35. Du, Y.; Zhang, Y.; Ling, F.; Wang, Q.; Li, W.; Li, X. Water Bodies’ Mapping from Sentinel-2 Imagery with Modified Normalized Difference Water Index at 10-m Spatial Resolution Produced by Sharpening the SWIR Band. Remote Sens. 2016, 8, 354. [Google Scholar] [CrossRef]
  36. Lei, J.; Li, Z.; Bruce, W. Analysis of Dynamic Thresholds for the Normalized Difference Water Index. Photogramm. Eng. Remote Sens. 2009, 75, 1307–1317. [Google Scholar] [CrossRef]
  37. Jiang, H.; Feng, M.; Zhu, Y.; Lu, N.; Huang, J.; Xiao, T. An Automated Method for Extracting Rivers and Lakes from Landsat Imagery. Remote Sens. 2014, 6, 5067–5089. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The location of East Dongting Lake.
Figure 1. The location of East Dongting Lake.
Remotesensing 11 01585 g001
Figure 2. The flowchart of the approach in this paper.
Figure 2. The flowchart of the approach in this paper.
Remotesensing 11 01585 g002
Figure 3. Procedure to discretize the study area: (1) Set the envelope (bounding box) of the research area; and (2) divide the area inside of the envelope into many square grid cells with a given length and width (e.g., the resolution of the images used in this study).
Figure 3. Procedure to discretize the study area: (1) Set the envelope (bounding box) of the research area; and (2) divide the area inside of the envelope into many square grid cells with a given length and width (e.g., the resolution of the images used in this study).
Remotesensing 11 01585 g003
Figure 4. Procedure to resample the image: (1) Obtain images and the discretization of the study area; and (2) use nearest-neighbor interpolation to complete remote sensing image resampling.
Figure 4. Procedure to resample the image: (1) Obtain images and the discretization of the study area; and (2) use nearest-neighbor interpolation to complete remote sensing image resampling.
Remotesensing 11 01585 g004
Figure 5. The procedure of relating water levels to pixels: (1) Find the water level value at the specific date when the image was acquired; (2) relate the water level value to pixels in that image, and reserve the pixel values that represent dry or wet (thus, every pixel in that image will have two kinds of data at the same time, one of which represents dry or wet, while the other represents water level).
Figure 5. The procedure of relating water levels to pixels: (1) Find the water level value at the specific date when the image was acquired; (2) relate the water level value to pixels in that image, and reserve the pixel values that represent dry or wet (thus, every pixel in that image will have two kinds of data at the same time, one of which represents dry or wet, while the other represents water level).
Remotesensing 11 01585 g005
Figure 6. Relating water levels to images and forming inundation records for every grid cell in the study area.
Figure 6. Relating water levels to images and forming inundation records for every grid cell in the study area.
Remotesensing 11 01585 g006
Figure 7. Inundation records: The cell in blue with a water level value means that the grid cell is submerged from an observation under that water level value, and the cell in white with a water level means that the grid cell is observed to be unsubmerged under that water level.
Figure 7. Inundation records: The cell in blue with a water level value means that the grid cell is submerged from an observation under that water level value, and the cell in white with a water level means that the grid cell is observed to be unsubmerged under that water level.
Remotesensing 11 01585 g007
Figure 8. ‘Abnormal records’ in inundation records of one grid cell and the results of removing abnormal records.
Figure 8. ‘Abnormal records’ in inundation records of one grid cell and the results of removing abnormal records.
Remotesensing 11 01585 g008
Figure 9. Simulating and predicting the flood extent under a certain water level: The grid cell is submerged if the water level is higher than the threshold in that grid cell. Otherwise, the grid cell is not submerged.
Figure 9. Simulating and predicting the flood extent under a certain water level: The grid cell is submerged if the water level is higher than the threshold in that grid cell. Otherwise, the grid cell is not submerged.
Remotesensing 11 01585 g009
Figure 10. The predictions and the observations of inundation: (a) Observation from 1 July 2014 to 30 September 2014; (b) prediction from 1 July 2014 to 30 September 2014; (c) observation from 1 April 2015 to 30 June 2015; (d) prediction from 1 April 2015 to 30 June 2015; (e) observation from 1 October 2015 to 31 December 2015; (f) prediction from 1 October 2015 to 31 December 2015; (g) observation from 1 January 2016 to 31 March 2016; and (h) prediction from 1 January 2016 to 31 March 2016.
Figure 10. The predictions and the observations of inundation: (a) Observation from 1 July 2014 to 30 September 2014; (b) prediction from 1 July 2014 to 30 September 2014; (c) observation from 1 April 2015 to 30 June 2015; (d) prediction from 1 April 2015 to 30 June 2015; (e) observation from 1 October 2015 to 31 December 2015; (f) prediction from 1 October 2015 to 31 December 2015; (g) observation from 1 January 2016 to 31 March 2016; and (h) prediction from 1 January 2016 to 31 March 2016.
Remotesensing 11 01585 g010
Figure 11. The variations in water level at Chenglingji station from 1 January 2013 to 31 December 2013.
Figure 11. The variations in water level at Chenglingji station from 1 January 2013 to 31 December 2013.
Remotesensing 11 01585 g011
Figure 12. The variations in the inundation area in East Dongting Lake from 1 January 2013 to 31 December 2013, and the variations in the inundation extent boundary every four days from 5 May 2013 to 21 May 2013.
Figure 12. The variations in the inundation area in East Dongting Lake from 1 January 2013 to 31 December 2013, and the variations in the inundation extent boundary every four days from 5 May 2013 to 21 May 2013.
Remotesensing 11 01585 g012
Figure 13. The number of images during each month used to build the prediction model.
Figure 13. The number of images during each month used to build the prediction model.
Remotesensing 11 01585 g013
Figure 14. The distributions of gaps: (a) Gaps calculated from water level values from June to September; and (b) gaps calculated from water level values from January to March.
Figure 14. The distributions of gaps: (a) Gaps calculated from water level values from June to September; and (b) gaps calculated from water level values from January to March.
Remotesensing 11 01585 g014
Table 1. Images used in the study.
Table 1. Images used in the study.
YearDateAmount
20022002-07-08   2002-08-02   2002-09-043
20032003-04-16   2003-05-01   2003-07-28   2003-08-04   2003-08-28
2003-09-22   2003-10-08   2003-11-16   2003-12-28
9
20042004-07-22   2004-07-28   2004-09-16    2004-09-24   2004-11-02
2004-11-18   2004-12-04   2004-12-12
8
20052005-04-04   2005-04-20   2005-05-06   2005-05-28   2005-06-14
2005-06-24   2005-07-24   2005-08-01   2005-09-12   2005-09-18
2005-10-28   2005-11-20
12
20062006-04-08   2006-06-01   2006-06-10   2006-06-16   2006-07-04
2006-07-12   2006-07-20   2006-08-12   2006-08-20   2006-08-28
2006-09-20   2006-11-01   2006-11-08   2006-12-20   2006-12-26
15
20072007-01-28   2007-02-04   2007-02-12   2007-03-08   2007-04-10
2007-05-04   2007-05-20   2007-06-28   2007-07-06   2007-07-16
2007-08-01   2007-08-08   2007-09-16
13
20082008-02-16   2008-03-02   2008-03-12   2008-04-28   2008-05-20
2008-06-06   2008-07-16   2008-08-08   2008-09-10   2008-09-20
2008-10-04   2008-10-12   2008-11-28   2008-12-08   2008-12-16
15
20092009-01-16   2009-02-10   2009-03-14   2009-03-20   2009-07-12
2009-07-20   2009-08-20   2009-08-28   2009-09-06   2009-09-22
2009-09-28
11
20102010-07-30   2010-08-08   2010-09-01   2010-09-164
20112011-04-28   2011-05-08   2011-05-30   2011-06-08   2011-07-26
2011-08-18   2011-09-04
7
2012
20132013-04-10   2013-04-26   2013-05-12   2013-05-28   2013-06-04
2013-06-12   2013-07-22   2013-08-01   2013-08-08   2013-08-31
2013-09-16   2013-10-04   2013-10-10   2013-10-26   2013-11-04
2013-12-06   2013-12-12   2013-12-22   2013-12-28
19
20142014-01-14   2014-01-24   2014-01-30   2014-07-08   2014-08-04
2014-08-20   2014-09-28   2014-10-06   2014-10-22   2014-11-14
2014-12-16
11
20152015-04-16   2015-05-08   2015-10-16   2015-10-24   2015-10-30
2015-11-26
6
20162016-02-04   2016-02-29   2016-04-18   2016-05-12   2016-06-04
2016-07-24   2016-07-30   2016-08-16   2016-08-24   2016-09-24
2016-10-02   2016-10-18   2016-11-28   2016-12-04   2016-12-30
15
Table 2. The accuracies of predictions.
Table 2. The accuracies of predictions.
Composite ImageAccuracyKappa
2014-07-01 to 2014-09-300.961140.91856
2015-04-01 to 2015-06-300.959530.85135
2015-10-01 to 2015-12-310.943620.83917
2016-01-01 to 2016-03-310.932610.67293

Share and Cite

MDPI and ACS Style

Chen, Z.; Luo, J.; Chen, N.; Xu, R.; Shen, G. RFim: A Real-Time Inundation Extent Model for Large Floodplains Based on Remote Sensing Big Data and Water Level Observations. Remote Sens. 2019, 11, 1585. https://0-doi-org.brum.beds.ac.uk/10.3390/rs11131585

AMA Style

Chen Z, Luo J, Chen N, Xu R, Shen G. RFim: A Real-Time Inundation Extent Model for Large Floodplains Based on Remote Sensing Big Data and Water Level Observations. Remote Sensing. 2019; 11(13):1585. https://0-doi-org.brum.beds.ac.uk/10.3390/rs11131585

Chicago/Turabian Style

Chen, Zeqiang, Jin Luo, Nengcheng Chen, Ren Xu, and Gaoyun Shen. 2019. "RFim: A Real-Time Inundation Extent Model for Large Floodplains Based on Remote Sensing Big Data and Water Level Observations" Remote Sensing 11, no. 13: 1585. https://0-doi-org.brum.beds.ac.uk/10.3390/rs11131585

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop