Next Article in Journal
A High-Performance Database Management System for Managing and Analyzing Large-Scale SNP Data in Plant Genotyping and Breeding Applications
Next Article in Special Issue
Evaluation of Deep Learning for Automatic Multi-View Face Detection in Cattle
Previous Article in Journal
Antioxidant, Antifungal and Phytochemical Investigations of Capparis spinosa L.
Previous Article in Special Issue
Lettuce Growth Pattern Analysis Using U-Net Pre-Trained with Arabidopsis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimation and Forecasting of Rice Yield Using Phenology-Based Algorithm and Linear Regression Model on Sentinel-II Satellite Data

1
Department of Space Science, Institute of Space Technology, P.O. Box 2750, Islamabad 44000, Pakistan
2
Institute of Soil and Environmental Sciences, University of Agriculture, Faisalabad 38040, Pakistan
3
Agricultural Remote Sensing Laboratory (ARSL), National Centre of GIS and Space Application (NCGSA), University of Agriculture, Faisalabad 38040, Pakistan
4
Institute of Agriculture and Resource Economics, University of Agriculture, Faisalabad 38040, Pakistan
5
Department of Applied Mathematics and Statistics, Institute of Space Technology, P.O. Box 2750, Islamabad 44000, Pakistan
6
Department of Agronomy, Faculty of Agriculture, University of Agriculture, Faisalabad 38040, Pakistan
7
Sustainable Rice Production, Galaxy Rice Mills Pvt Ltd., Gujranwala 52230, Pakistan
*
Authors to whom correspondence should be addressed.
Submission received: 11 September 2021 / Revised: 7 October 2021 / Accepted: 12 October 2021 / Published: 19 October 2021
(This article belongs to the Special Issue Digital Innovations in Agriculture)

Abstract

:
Rice is a primary food for more than three billion people worldwide and cultivated on about 12% of the world’s arable land. However, more than 88% production is observed in Asian countries, including Pakistan. Due to higher population growth and recent climate change scenarios, it is crucial to get timely and accurate rice yield estimates and production forecast of the growing season for governments, planners, and decision makers in formulating policies regarding import/export in the event of shortfall and/or surplus. This study aims to quantify the rice yield at various phenological stages from hyper-temporal satellite-derived-vegetation indices computed from time series Sentinel-II images. Different vegetation indices (viz. NDVI, EVI, SAVI, and REP) were used to predict paddy yield. The predicted yield was validated through RMSE and ME statistical techniques. The integration of PLSR and sequential time-stamped vegetation indices accurately predicted rice yield (i.e., maximum R2 = 0.84 and minimum RMSE = 0.12 ton ha−1 equal to 3% of the mean rice yield). Moreover, our results also established that optimal time spans for predicting rice yield are late vegetative and reproductive (flowering) stages. The output would be useful for the farmer and decision makers in addressing food security.

1. Introduction

The rapid increase in the world population exerts pressure on the agriculture sector and threatening the food security of the world [1]. Among cereals, rice is one of the prime sources of food with high nutritive value (i.e., containing carbohydrate, vitamins (B, E, thiamine), and minerals (Ca, Mg, Fe). Rice is widely grown, consumed globally (i.e., daily food of 3.5 billion people worldwide), and accounts for 19% of the dietary energy [2]. Globally, 90% of the rice comes from Asia, which is approximately 640 million tons per annum [3,4]. Pakistan ranks 11th at the global rice production list and contributes 8% to the world’s total rice trade [3]. Pakistan produced seven (7) million tons of rice in the year 2017–2018 and earned a foreign exchange of two-billion dollars ($USD) from rice export [5]. The high-quality nutritious rice (e.g., basmati) produced in the country is available at affordable price in the international market and thus contributing in food security for the increasing global population [6].
Timely and accurate predictions of crop yield before harvest at a large scale is critical for food security and administrative planning, especially in the current continually changing global environment and international situation [7,8]. Different approaches have been adopted for precise yield estimation and each method has its own strengths and limitations. For instance, the traditional field surveys and crop statistics are useful for precisely estimating crop yield; however, when crop yield prediction of the large region is desired, the surveys prove inadequate due to budget, time, and large skilled manpower constraints [9]. The use of Earth observation data (remote sensing) offers an effective system for monitoring agriculture and quantifying crop yield at large spatial extent. The remotely sensed solution is fast, cost-efficient, and non-destructive [10,11]. In addition, the repetitive data acquisition capability of remote sensing sensors makes them an ideal choice for retrieving temporal information of crop phenology, plants health (stress), response to weather and soil nutrients (i.e., manure and fertilizer), variation in plant biomass, and ultimately its effect on yield production [12,13].
Satellite remote sensing also enables crop yield estimation at field, landscape, and regional scales for making policies and ensuring food security [14,15]). Yield estimation of various crops, such as wheat [11], corn [16], and sugar beet [17] is done successfully using assimilation algorithms on RS data. In recent research, two approaches are commonly used for this purpose: one is canopy reflectance data, and the other is based on the spectral indices. The free availability of optical remote sensing data of Sentinel-2 satellite with multiple spectral bands in the red, red edge, and near infrared (NIR) is making RS an ideal choice for monitoring agricultural crops, vegetation phenology [18] (Caballero et al., 2020), temporal variability in cropping [19], as well as environmental monitoring and land cover mapping [20].
Different vegetation indices (VIs) derived from satellite images are effective indicators of vegetation status and are positively correlated with crop yield. The Normalized Difference Vegetation Index (NDVI) has been widely used for predicting crop yield and identifying growth stages [20,21,22]. Similarly, other variants of NDVI such as Soil Adjusted Vegetation Index (SAVI) and Enhanced Vegetation Index (EVI) have been found effective for crop growth monitoring in the initial filling stages of the crop [21,22]. While using canopy reflectance data to directly estimate crop yield, multivariable analysis methods are commonly introduced to support the dataset analysis. Partial least squares regression (PLSR), stepwise multiple linear regression (SMLR), artificial neural network (ANN), etc. are helpful to construct and validate the multivariate remote sensing models of estimating the yield and improve the accuracy of crop yield estimation through satellite remote sensing, specifically when analyzing the quantitative relationship between RS variables obtained from satellite images and crop yield [11].
The impacts of different phenological and growth stages (i.e., vegetative, reproductive, and ripening) on yield production are rarely explored. Very few researchers quantified the impact of growth stages on crop yields and assigned different weightage to different growth stages [21,23]. Some studies conclude that physiological status (e.g., crop growth) and biochemical contents (e.g., nitrogen) of pre-heading stage is more crucial [24,25], while other found that high rice biomass at post-heading stages is essential for optimum production [23]. Similarly, few studies highlight the relationship between late reproductive growth period and rice yield [26,27]. The overall objective of this study is to forecast rice yield and investigate the relationship between remote sensing derived VIs at different phenological stages of rice crop and its yield. The study also aims to identify the most critical growth period (phenological stage) for quantification of rice yield with hyper-temporal sentinel-II and derived-indices using Partial Least Square Regression (PLSR) model to improve the estimation accuracy of rice yield by remote sensing.

2. Materials and Methods

2.1. Study Area

This study was carried out at Sheikhupora district (Latitude 31°30′00″ to 31°65′30″ N and Longitude 73°40′00″ to 74°23′00″ E) of Punjab, Pakistan (Figure 1). Sheikhupora district is located between Ravi and Chenab rivers and irrigated by river water from two canals (Upper Chenab and Khanpur Canals). Climatically the region is dominated by the wet monsoon, thus making it favorable for rice crop. The annual precipitation ranges from 120 to 720 mm, which mainly occurs between July and August [28]. The study area is dominated by alluvial clay and loamy soil rich in humus and mineral composition. The mineralogical, chemical and geotechnical compositions of soil (pH = 8, EC = 1.1−4.5 dS m−1 with soil field capacity from 45–71%) make the region ideal for rice cultivation [29]. Due to the favorable conditions, Sheikhupora district is the second largest rice-producing district in Pakistan with an average production of 2–2.5 million tons annually [5,30].

2.2. Data Collection

2.2.1. Field Data

Stratified-random-sampling procedure was used to collect data from 137 plots well distributed in the study area. The rice fields with minimum size of 60 × 60 m (i.e., corresponding to the coarse pixel size of satellite images used in this study) were considered for the purpose of sampling, monitoring, and analysis. In most of the paddy fields, the rice crop was transplanted in start of July (after seed sown in the nursery at the start of June). These sampled plots were carefully monitored from transplanting till harvesting period. The rice produced in each plot (Figure 2) was carefully measured and recorded for further analysis.

2.2.2. Satellite Data Preprocessing and Vegetation Index

The optical remote sensing data of Sentinel-II satellite for year 2016 was used in this study. The availability of multiple spectral bands in the red, red edge, and near infrared (NIR) part of the EM spectrum with 10–20 m resolution makes Sentinel-II an ideal choice for studying vegetation phenology and monitoring agricultural crops for stress level, nutrient contents, pest attack, and yield estimation [31,32,33]. The cloud free Sentinel-2 time-series images (i.e., spanning from growing to harvesting phase) were pre-processed for atmospheric correction using Sen2Cor processing. The atmospherically corrected images were then used for computing vegetation indices (Table 1) from times series satellite images spanning from sowing till harvesting period of the rice crop. Vegetation indices are mathematical transformations using two or more spectral bands devised to enhance certain characteristics of vegetation [34]. Several images to cover the entire growth period of rice crop were used and vegetation indices were computed using Google Earth Engine (GEE).

2.3. Geo-Statistical Analysis

The PLSR is an established multivariate analysis technique commonly used in chemo-matric and hyperspectral data analysis [39,40]. The Partial Least Square Regression (PLSR) analyses were performed to all time-series vegetation indices (explanatory variables) with rice yields (response variable). In PLSR model development, the selection of optimum number of latent variables (LVs) is more critical, as increase of the number of LVs would improve the accuracy of the model, while selection of too many variables can lead to the over fitting and the error would increase [41]. To minimize this over fitting problem, the optimal number of LVs was selected based on achieving a combination of a high R2 and a low root mean squared error of the prediction (RMSEP) (Figure 3). The PLSR model was evaluated by plotting the 1:1 relationship graph between the predicted and measured values of the yield (Figure 3). The evaluation indices were the R2 and the RMSE. A larger R2 shows that the model is better, while smaller RMSE values indicate the stronger estimation ability of the model. To evaluate the performances of the prediction models, leave-one-out cross-validation [42] was used, in which the model was iteratively trained on multiple time series data and then used to predict yield. PLSR can be mathematically expressed as:
Y = a +   b 1 X 1 +   b 2 X 2 + +   b n X n
where Y is response variable (rice yield), X1, X2–Xn are the selected latent variables (LVs), which are the time series images, in this case a is the intercept and b1–bn represent the regression coefficients (also known as β-coefficients) for different predictors.

2.4. Spatial Distribution and Mapping of Rice Yield

To model the spatial distribution of rice yields, a two-step procedure was adopted. The thematic map of rice crop was developed using phenological based mapping algorithms (Figure 4). In this routine, the phenology profiles (or signatures) serves as numerical key for discerning different crop types grown in the region of interest [43,44]. The time series vegetation indices (e.g., NDVI profiles computed from optical data for the entire growth span of rice crop: 130 days) were used to demarcate the rice crop and to compute the rice cultivated area. The phenological-mapping-routine takes into account the entire range (minimum–maximum) of vegetation index values (e.g., NDVI in this case) at each time stamp (e.g., from transplanting till harvesting) and can be mathematically expressed as:
R = ( NDVI 1 min   and   NDVI 1 max )   and   ( NDVI 2 min   and   NDVI 2 max )   NDVI n min   and   NDVI n max )
where R represents response variable (rice), and NDVI1, NDVI2, and NDVIn represent NDVIs derived from RS data at different crop phenological stages starting from transplanting till harvesting.
To develop the spatial distribution maps of rice yield, the models of statistical analysis (the regression equations of PLSR analysis) were inverted to the time series vegetation indices of rice masked areas (developed from phenological based mapping) and were validated with an independent dataset.

3. Results

3.1. Rice Yield Estimation (Field Level Data)

The statistical descriptions of the measured rice yield are summarized in Table 2. In calibration datasets (with sample size n = 96), the rice yield varies between 3.06 ton/ha and 4.15 ton/ha with mean equal to 3.70 ton/ha. The standard deviation (SD) was ±0.31 ton/ha and coefficient of covariance (CV) equal to 0.083 ton/ha. The graphical display reflects that the calibration dataset is near normally distributed.
The summary statistics of validation sets (n = 41) shows that rice yield ranges between 3.16 ton/ha and 4.15 ton/ha with average equal to 3.71 ton/ha. The standard deviation (SD) was ±0.29 and coefficient of covariance (CV) equal to 0.078 ton/ha. The validation data are also normally distributed (Table 2: see the graphical display at last column).

3.2. Variation in Temporal Profiles of Vegetation Indices with Rice Phenology

The temporal profiles of vegetation index spanning across the full growth period of rice crop are shown in Figure 5. The vegetation index (e.g., NDVI) values were least (minimum) at transplanting phase and showed gradual increase with increase in vegetative parts (Figure 5). The vegetation indices reached at peak in the late vegetative phase and continually maintained high values (e.g., spectral plateau) till flowering phase. At post flowering phase (e.g., ripening phase), the vegetation index values started declining and reached its minimum at fully ripened harvesting phase (see Figure 5).

3.3. Prediction of Rice Yield and the Performance of Vegetation Indices

The prediction of rice yields using PLSR and time series vegetation indices are summarized Table 3. The integration of PLSR and sequential-time-stamped-vegetation indices were found effective for quantifying rice yields. The time series NDVI yielded the highest R2 = 0.83 (lowest RMSEcv = 0.12 ton/ha) followed by EVI (R2 = 0.80, RMSEcv = 0.14 ton/ha), SAVI (R2 = 0.79, RMSEcv = 0.14 ton/ha), and REP (R2 = 0.64, RMSEcv = 0.17 ton/ha). The performance of different indices (NDVI, SAVI, EVI, REP) for rice yield estimation were consistent for both calibration and validation datasets (Table 3, Figure 6c,f,i,l).
Using the time series profiles of NDVI, EVI and SAVI, the PLSR models selected six latent variables (Figure 6, Table 3). The selected six latent variables explained most of the variance (e.g., as the case of NDVI where R2 = 0.83) and the addition of further variables hardly improve the model performance (e.g., the total 21 variables yield maximum R2 of 0.84) (Figure 6a,d,g,j). The important latent variable (in this study, time stamped vegetation indices) was located at late vegetative, reproductive (flowering), and ripening phases of rice growth (Figure 6b,e,h,k). Using time series Red Edge Position (REP) data, the number of selected latent variables were five (05) with high regression coefficient (or B-coefficient) values at flowering, ripening, and late vegetative phases (Figure 6c).

3.4. Spatial Varability in Rice Yield Potential

The distribution map (developed from best predicting PLSR model, time series vegetation indices and map of rice grown area) reflects that rice yield distribution varies in space and ranges between 1.5 to 4.2 ton/ha (Figure 7a). The upper limit of rice yields (i.e., 4.20 ton/ha) in the distribution map were closely matching with the ground measured yield (4.15 ton/ha). The minimum limit was underestimated in the spatial distribution map of rice yield (1.5 ton/ha) compared to the ground measured minimum yield (3.06 ton/ha). The validation of spatial distribution maps against independent ground measured yield data (30% of the total samples) confirms that yield was predicted with high accuracy (see Figure 7b). The high R2 (0.83) and low RMSE (0.14 tons/ha) manifested a close match between measured and predicted rice yields (Figure 7b).

4. Discussion

The field data (Table 2) reflect that the measured rice yield production (minimum (3.06 ton/ha), mean (3.7 ton/ha), maximum 4.15 (ton/ha) in the study area is within the limits of rice crop statistics within the country (i.e., ranging from 2.4 ton/ha to 10 ton/ ha). These numbers are far less than the statistics of the neighboring countries (i.e., China, Vietnam, Bangladesh) and could be attributed to the variety of rice [45], uneven water usage, weeds and pest attacks, and post-harvest loses (e.g., shattering and improper drying and storing etc.). The super basmati grown in the study area produces high quality rice and is famous for its aromatic fragrance; however, it is less productive compared to other hybrid varieties.
The time series profiles show that vegetation indices (Figure 8a) are minimum at the transplantation stage and gradually increase in vegetative phase (tillering, panicle, and flowering stages). A decline was observed in the vegetation indices after post flowering phases (e.g., dough, ripening). This initial increase in vegetation index values could be associated with increase in leaf area coverage (LAI, biomass) and the post flowering phase decline could be attributed to the senescing of rice crop. The temporal variation in vegetation indices (in this study) are in line with the findings of previous studies [46,47], where the peak greenness is achieved at flowering/milk phases and decline was observed in dough stages and reaching minimum at ripening phase (Figure 8).
The integration of PLSR and time series vegetation indices accurately predicted rice yields (the maximum R2 = 0.83 and least RMSE = 0.12 ton/ha (3.12% of the mean yield). The slightly better performance of NDVI (relative to SAVI, EVI, and REP) could be attributed to the canopy characteristics (e.g., structural, vegetation percent cover) of rice crop. The abundance of stem and leaf blades of rice crop obscures the background soil visibility and leads to enhanced vegetation (rice) reflectance signals, thus allowing slope-based-index (such as NDVI) to perform more precise estimates of rice yield [48,49].
Using PLSR regression, the prediction accuracy enhances with augmenting the number of variables (NDVI, EVI, SAVI, and REP) and leads to high R2 and low RMSE until the model stabilized at a certain point. In this study, the PLSR model saturate at the addition of six latent variables (Figure 6a,d,g,j) and captured maximum variance present in dataset. The addition of further variables hardly improves the prediction of rice yield and displays a flat line [50]. The selected latent variables belong to late vegetative, reproductive (panicle, flowering, milky), and ripening phases (Figure 6b,e,h,k) and reflects the critical importance of these growth stages in rice yield production. The picking of late vegetative phase may be associated with an increase in biomass and leaf covered area (leaves and shoots fully develop at this stage and gain maximum crop height). The high correlation at reproductive phases could be associated with the formation panicle, flowering, milk, dough, and maturity of grain, which directly influence the crop yields. The results of this study are also consistent with the findings of existing literature, where booting stage highly influences the rice yield production [51]. The outcomes of this study help in estimating the rice yield and highlight the critical phases in the life cycle (of rice crop) where monitoring and human intervention (such as usage of water and agrochemicals) can enhance the yield production.

5. Conclusions

This study aims to accurately quantify rice yield and identify the critical growth stages that influence the rice yield production. The integration of PLSR and time series vegetation indices (i.e., spanning across the entire rice crop growth period) results in accurate predictions of rice yield. Among vegetation indices, NDVI performs the best (yield high R2 and low RMSE) followed by EVI, SAVI, and REP. Using the time stamped vegetation indices, the PLSR coefficients identified the growth stages that influence the rice yield. The growth stages (selected latent variables) belong to late vegetative, reproductive (panicle, flowering, milky), and ripening phases. The selected critical growth stages were common in all four types of vegetation indices (i.e., NDVI, EVI, SAVI, and REP) used. This study concludes that PLSR can effectively be used for rice yield estimation and identifying critical stages of the rice growth cycle. The precise yield estimation (rice in this case) allows decision makers to strategize policy regarding yield import and export. The outcome of this study can also help the farmers to monitor rice at critical time spans and allow them to intervene (e.g., usage of water, fertilizer, pesticides etc.) in a timely manner. The timely interventions thus help in producing more yields which in turn is essential for minimizing hunger (SDG 2), alleviating poverty (SDG 1), ensuring land (SDG 15) and food security.
However, the present study did not compare the accuracy of PLS algorithm with artificial neural networks, support vector machines, other geo-statistics, etc., for yield forecasting. These would be interesting directions for future study.

Author Contributions

The study was conceptualization by A.N. and S.U.; methodology, A.A. (Azhar Abbas), M.S.I.; data collection and validation, Z.A.S. and K.H.; formal analysis, M.S. (Muhammad Shakir) and M.S. (Munawar Shah); Planning and Execution, M.U.B.; writing—review and editing, A.N. and A.A. (Asad Ali). All authors have read and agreed to the published version of the manuscript.

Funding

We are thankful for the financial support of University of Agriculture Faisalabad (APC funding) and Galaxy Rice Mills Pvt Ltd. Gujranwala for logistics during field data collection.

Institutional Review Board Statement

This study is ‘Not applicable’ for Institutional Review Board and Ethical review as this research work does not involve any humans or animals or biological material.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data will be made available for research purpose upon request.

Acknowledgments

We acknowledge the Copernicus Open Access Hub for providing freely available Sentinel-2 data. We also acknowledge M. Hamid Choudhary, GIS Centre The University of Punjab, Lahore Pakistan for helping in software and Field survey.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Godfray, H.C.J.; Beddington, J.R.; Crute, I.R.; Haddad, L.; Lawrence, D.; Muir, J.F.; Pretty, J.; Robinson, S.; Thomas, S.M.; Toulmin, C. Food Security: The Challenge of Feeding 9 Billion People. Science 2010, 327, 812–818. [Google Scholar] [CrossRef] [Green Version]
  2. Dawe, D. The contribution of rice research to poverty alleviation. In Redesigning Rice Photosynthesis to Increase Yield, Proceedings of the Workshop on the Quest to Reduce Hunger: Redesigning Rice Photosynthesis, Los Baños, Philippines, 30 November–3 December 1999; Sheehy, J.E., Mitchell, P.L., Hardy, B., Eds.; Elsevier: Amsterdam, The Netherlands, 2000; pp. 3–12. [Google Scholar]
  3. FAO. The State of Food and Agriculture (SOFA): Climate Change, Agriculture and Food Security; Food and Agriculture Organization of the United Nations (FAO): Rome, Italy, 2016. [Google Scholar]
  4. Bandumula, N. Rice Production in Asia: Key to Global Food Security. Proc. Natl. Acad. Sci. USA India Sect. B Biol. Sci. 2018, 88, 1323–1328. [Google Scholar] [CrossRef]
  5. PBS. Prices of Agricultural Commodities; Agriculture Statistics; Pakistan Bureau of Statistics (PBS): Islamabad, Pakistan, 2019; pp. 124–126. [Google Scholar]
  6. Wang, F.; Wang, F.; Zhang, Y.; Hu, J.; Huang, J.; Xie, J. Rice Yield Estimation Using Parcel-Level Relative Spectral Variables From UAV-Based Hyperspectral Imagery. Front. Plant Sci. 2019, 10, 453. [Google Scholar] [CrossRef] [Green Version]
  7. Gebbers, R.; Adamchuk, V.I. Precision Agriculture and Food Security. Science 2010, 327, 828–831. [Google Scholar] [CrossRef]
  8. Ji, Z.; Pan, Y.; Zhu, X.; Wang, J.; Li, Q. Prediction of crop yield using phenological information extracted from remote sensing vegetation index. Sensors 2021, 21, 1406. [Google Scholar] [CrossRef]
  9. Fang, H.; Liang, S.; Hoogenboom, G.; Teasdale, J.; Cavigelli, M. Corn-yield estimation through assimilation of remotely sensed data into the CSM-CERES-Maize model. Int. J. Remote Sens. 2008, 29, 3011–3032. [Google Scholar] [CrossRef]
  10. Lobell, D.B. The use of satellite data for crop yield gap analysis. Field Crop. Res. 2013, 143, 56–64. [Google Scholar] [CrossRef] [Green Version]
  11. Zhang, P.P.; Zhou, X.X.; Wang, Z.X.; Mao, W.; Li, W.X.; Yun, F.; Guo, W.S.; Tan, C.W. Using HJ-CCD image and PLS algorithm to estimate the yield of field-grown winter wheat. Sci. Rep. 2020, 10, 5173. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Savin, I.Y.; Isaev, V.A. Rice yield forecast based on satellite and meteorological data. Russ. Agric. Sci. 2010, 36, 424–427. [Google Scholar] [CrossRef]
  13. Shiu, Y.-S.; Chuang, Y.-C. Yield Estimation of Paddy Rice Based on Satellite Imagery: Comparison of Global and Local Regression Models. Remote Sens. 2019, 11, 111. [Google Scholar] [CrossRef] [Green Version]
  14. Peng, D.; Huang, J.; Li, C.; Liu, L.; Huang, W.; Wang, F.; Yang, X. Modelling paddy rice yield using MODIS data. Agric. For. Meteorol. 2014, 184, 107–116. [Google Scholar] [CrossRef]
  15. Gilardelli, C.; Stella, T.; Confalonieri, R.; Ranghetti, L.; Campos-Taberner, M.; García-Haro, F.J.; Boschetti, M. Downscaling rice yield simulation at sub-field scale using remotely sensed LAI data. Eur. J. Agron. 2019, 103, 108–116. [Google Scholar] [CrossRef]
  16. Ines, A.V.M.; Das, N.N.; Hansen, J.W.; Njoku, E.G. Assimilation of remotely sensed soil moisture and vegetation with a crop simulation model for maize yield prediction. Remote Sens. Environ. 2013, 138, 149–164. [Google Scholar] [CrossRef] [Green Version]
  17. Launay, M.; Guerif, M. Assimilating remote sensing data into a crop model to improve predictive performance for spatial applications. Agric. Ecosyst. Environ. 2005, 111, 321–339. [Google Scholar] [CrossRef]
  18. Caballero, I.; Fernández, R.; Escalante, O.M.; Mamán, L.; Navarro, G. New capabilities of Sentinel-2A/B satellites combined with in situ data for monitoring small harmful algal blooms in complex coastal waters. Sci. Rep. 2020, 10, 8743. [Google Scholar] [CrossRef]
  19. Basterrechea, D.A.; Parra, L.; Parra, M.; Lloret, J. A Proposal for Monitoring Grass Coverage in Citrus Crops Applying Time Series Analysis in Sentinel-2 Bands. In Proceedings of the 4th International Conference on Industrial IoT Technologies and Applications, Virtual Event; Springer: Cham, Switzerland, 11 December 2020; pp. 193–206. Available online: https://0-link-springer-com.brum.beds.ac.uk/chapter/10.1007/978-3-030-71061-3_12 (accessed on 30 September 2021).
  20. Phiri, D.; Simwanda, M.; Salekin, S.; Nyirenda, V.R.; Murayama, Y.; Ranagalage, M. Sentinel-2 data for land cover/use mapping: A review. Remote Sens. 2020, 12, 2291. [Google Scholar] [CrossRef]
  21. Zheng, H.; Cheng, T.; Yao, X.; Deng, X.; Tian, Y.; Cao, W.; Zhu, Y. Detection of rice phenology through time series analysis of ground-based spectral index data. Field Crop. Res. 2016, 198, 131–139. [Google Scholar] [CrossRef]
  22. Guo, C.; Zhang, L.; Zhou, X.; Zhu, Y.; Cao, W.; Qiu, X.; Cheng, T.; Tian, Y. Integrating remote sensing information with crop model to monitor wheat growth and yield based on simulation zone partitioning. Precis. Agric. 2018, 19, 55–78. [Google Scholar] [CrossRef]
  23. Murchie, E.H.; Yang, J.; Hubbart, S.; Horton, P.; Peng, S. Are there associations between grain-filling rate and photosynthesis in the flag leaves of field-grown rice? J. Exp. Bot. 2002, 53, 2217–2224. [Google Scholar] [CrossRef] [PubMed]
  24. Ntanos, D.A.; Koutroubas, S.D. Dry matter and N accumulation and translocation for Indica and Japonica rice under Mediterranean conditions. Field Crop. Res. 2002, 74, 93–101. [Google Scholar] [CrossRef]
  25. Ryu, C.; Suguri, M.; Umeda, M. Multivariate analysis of nitrogen content for rice at the heading stage using reflectance of airborne hyperspectral remote sensing. Field Crop. Res. 2011, 122, 214–224. [Google Scholar] [CrossRef] [Green Version]
  26. Horie, T.; Ohnishi, M.; Angus, J.F.; Lewin, L.G.; Tsukaguchi, T.; Matano, T. Physiological characteristics of high-yielding rice inferred from cross-location experiments. Field Crop. Res. 1997, 52, 55–67. [Google Scholar] [CrossRef]
  27. Yang, Q.; Shi, L.; Han, J.; Zha, Y.; Zhu, P. Deep convolutional neural networks for rice grain yield estimation at the ripening stage using UAV-based remotely sensed images. Field Crop. Res. 2019, 235, 142–153. [Google Scholar] [CrossRef]
  28. PMD-Pakistan Meteorological Department. Decadal Agromet Bulletin of Pakistan; National Agromet Centre (NAMC) Pakistan Meteorological Department: Islamabad, Pakistan, 2019; p. 8. [Google Scholar]
  29. Khaliq, T.; Gaydon, D.S.; Ahmad, M.-D.; Cheema, M.J.M.; Gull, U. Analyzing crop yield gaps and their causes using cropping systems modelling–A case study of the Punjab rice-wheat system, Pakistan. Field Crop. Res. 2019, 232, 119–130. [Google Scholar] [CrossRef]
  30. Qamar, W.; Younas, M.; Waseem, M. Price Fluctuations of Rice Crop in District Sheikhupura. J. Agric. Stud. 2019, 7, 227. [Google Scholar] [CrossRef]
  31. Transon, J.; D’Andrimont, R.; Maugnard, A.; Defourny, P. Survey of Hyperspectral Earth Observation Applications from Space in the Sentinel-2 context. Remote Sens. 2018, 10, 157. [Google Scholar] [CrossRef] [Green Version]
  32. Cohrs, C.W.; Cook, R.L.; Gray, J.M.; Albaugh, T.J. Sentinel-2 Leaf Area Index Estimation for Pine Plantations in the Southeastern United States. Remote Sens. 2020, 12, 1406. [Google Scholar] [CrossRef]
  33. Narin, O.G.; Abdikan, S. Monitoring of phenological stage and yield estimation of sunflower plant using Sentinel-2 satellite images. Geocarto. Int. 2020. [CrossRef]
  34. Thenkabail, P.S.; Smith, R.B.; De Pauw, E. Hyperspectral Vegetation Indices and Their Relationships with Agricultural Crop Characteristics. Remote Sens. Environ. 2000, 71, 158–182. [Google Scholar] [CrossRef]
  35. Rouse, J.W., Jr.; Haas, R.H.; Schell, J.A.; Deering, D.W. Monitoring Vegetation Systems in the Great Plains with ERTS; NASA Special Pub-351; NASA: Washington, DC, USA, 1974; pp. 309–317. [Google Scholar]
  36. Huete, A.R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
  37. Liu, H.Q.; Huete, A. A feedback based modification of the NDVI to minimize canopy background and atmospheric noise. IEEE Trans. Geosci. Remote Sens. 1995, 33, 457–465. [Google Scholar] [CrossRef]
  38. Filella, I.; Penuelas, J. The red edge position and shape as indicators of plant chlorophyll content, biomass and hydric status. Int. J. Remote Sens. 1994, 15, 1459–1470. [Google Scholar] [CrossRef]
  39. Kawamura, K.; Watanabe, N.; Sakanoue, S.; Lee, H.-J.; Inoue, Y.; Odagawa, S. Testing genetic algorithm as a tool to select relevant wavebands from field hyperspectral data for estimating pasture mass and quality in a mixed sown pasture using partial least squares regression. Grassl. Sci. 2010, 56, 205–216. [Google Scholar] [CrossRef]
  40. Ullah, S.; Skidmore, A.K.; Ramoelo, A.; Groen, T.A.; Naeem, M.; Ali, A. Retrieval of leaf water content spanning the visible to thermal infrared spectra. SPRS J. Photogramm. Remote Sens. 2014, 93, 56–64. [Google Scholar] [CrossRef]
  41. Denham, M.C. Choosing the number of factors in partial least squares regression: Estimating and minimizing the mean squared error of prediction. J. Chemom. 2000, 14, 351–361. [Google Scholar] [CrossRef]
  42. Li, Y.; Guan, K.; Yu, A.; Peng, B.; Zhao, L.; Li, B.; Peng, J. Toward building a transparent statistical model for improving crop yield prediction: Modeling rainfed corn in the U.S. Field Crop. Res. 2019, 234, 55–65. [Google Scholar] [CrossRef]
  43. Gong, Z.; Kawamura, K.; Ishikawa, N.; Goto, M.; Wulan, T.; Alateng, D.; Yin, T.; Ito, Y. MODIS normalized difference vegetation index (NDVI) and vegetation phenology dynamics in the Inner Mongolia grassland. Solid Earth 2015, 6, 1185–1194. [Google Scholar] [CrossRef] [Green Version]
  44. Cai, Z.; Jönsson, P.; Jin, H.; Eklundh, L. Performance of Smoothing Methods for Reconstructing NDVI Time-Series and Estimating Vegetation Phenology from MODIS Data. Remote Sens. 2017, 9, 1271. [Google Scholar] [CrossRef] [Green Version]
  45. Laenoi, S.; Rerkasem, B.; Lordkaew, S.; Prom-u-thai, C. Seasonal variation in grain yield and quality in different rice varieties. Field Crop. Res. 2018, 221, 350–357. [Google Scholar] [CrossRef]
  46. Kuenzer, C.; Knauer, K. Remote sensing of rice crop areas. Int. J. Remote Sens. 2013, 34, 2101–2139. [Google Scholar] [CrossRef]
  47. Mosleh, M.K.; Hassan, Q.K.; Chowdhury, E.H. Application of Remote Sensors in Mapping Rice Area and Forecasting its Production: A Rev. Sensors 2015, 15, 769–791. [Google Scholar] [CrossRef] [Green Version]
  48. Panda, S.S.; Ames, D.P.; Panigrahi, S. Application of Vegetation Indices for Agricultural Crop Yield Prediction Using Neural Network Techniques. Remote Sens. 2010, 2, 673–696. [Google Scholar] [CrossRef] [Green Version]
  49. Mirasi, A.; Mahmoudi, A.; Navid, H.; Valizadeh Kamran, K.; Asoodar, M.A. Evaluation of sum-NDVI values to estimate wheat grain yields using multi-temporal Landsat OLI data. Geocarto. Int. 2019, 36, 1309–1324. [Google Scholar] [CrossRef]
  50. Arshad, M.; Ullah, S.; Khurshid, K.; Ali, A. Estimation of leaf water content from mid- and thermal-infrared spectra by coupling genetic algorithm and partial least squares regression. J. Appl. Remote Sens. 2018, 12, 022203. [Google Scholar] [CrossRef]
  51. Chang, K.-W.; Shen, Y.; Lo, J.-C. Predicting Rice Yield Using Canopy Reflectance Measured at Booting Stage. Agron. J. 2005, 97, 872–878. [Google Scholar] [CrossRef]
Figure 1. Map of the study area (sampling sites are marked as red square dots).
Figure 1. Map of the study area (sampling sites are marked as red square dots).
Agriculture 11 01026 g001
Figure 2. The pictorial view of different phenological or growth stages of rice in field conditions.
Figure 2. The pictorial view of different phenological or growth stages of rice in field conditions.
Agriculture 11 01026 g002
Figure 3. The RMSE and R2 plot. The RMSE is decreasing as the number of latent variable increases. After a certain number of latent variables, the decrease in RMSE was negligible and that was taken as the optimum number of variables for PLSR model development.
Figure 3. The RMSE and R2 plot. The RMSE is decreasing as the number of latent variable increases. After a certain number of latent variables, the decrease in RMSE was negligible and that was taken as the optimum number of variables for PLSR model development.
Agriculture 11 01026 g003
Figure 4. The framework of methodology followed for phenology-based rice mapping and yield estimation.
Figure 4. The framework of methodology followed for phenology-based rice mapping and yield estimation.
Agriculture 11 01026 g004
Figure 5. (a) The temporal profiles of vegetation index (NDVI) and (b) different growth stages of rice crop. The points are not equally spaced (subplot a) due to the unavailability of cloud free images of the study area (i.e., monsoon season; late July and early August).
Figure 5. (a) The temporal profiles of vegetation index (NDVI) and (b) different growth stages of rice crop. The points are not equally spaced (subplot a) due to the unavailability of cloud free images of the study area (i.e., monsoon season; late July and early August).
Agriculture 11 01026 g005
Figure 6. The analysis based on temporal NDVI, EVI, SAVI, and REP. The R2 increases and RMSE decreases with augmenting the number of variables unless it reaches saturation (dashed elliptical in Panels a,d,g,j). After certain number of latent variables, the decrease in RMSE was negligible and that was taken as the optimum number of variables for PLSR model development. The temporal profiles of vegetation indices are similar in shape except REP (Panels b,e,h,k). The regression coefficients lines show that sowing, late vegetative, flowering and ripening are important phases for predicting rice yields using PLSR. The measured vs. predicted (Panels c,f,i,l) manifest that yield was best predicted using temporal NDVI data (yielded highest R2 and lowest RMSE).
Figure 6. The analysis based on temporal NDVI, EVI, SAVI, and REP. The R2 increases and RMSE decreases with augmenting the number of variables unless it reaches saturation (dashed elliptical in Panels a,d,g,j). After certain number of latent variables, the decrease in RMSE was negligible and that was taken as the optimum number of variables for PLSR model development. The temporal profiles of vegetation indices are similar in shape except REP (Panels b,e,h,k). The regression coefficients lines show that sowing, late vegetative, flowering and ripening are important phases for predicting rice yields using PLSR. The measured vs. predicted (Panels c,f,i,l) manifest that yield was best predicted using temporal NDVI data (yielded highest R2 and lowest RMSE).
Agriculture 11 01026 g006
Figure 7. The distribution of rice yield varies spatially (ranging from 1.5 to 4.2 ton/ha) in the study area (a). The predicted yield (based on distribution map) shows strong relation with measured yield (b).
Figure 7. The distribution of rice yield varies spatially (ranging from 1.5 to 4.2 ton/ha) in the study area (a). The predicted yield (based on distribution map) shows strong relation with measured yield (b).
Agriculture 11 01026 g007
Figure 8. (a) The range and mean temporal profiles of Normalized Difference Vegetation Index (NDVI) spanning the entire growth period of rice crop. (b) The temporal profile of vegetation index (NDVI) is closely in-line with the findings of [47] (shown in plot “b”).
Figure 8. (a) The range and mean temporal profiles of Normalized Difference Vegetation Index (NDVI) spanning the entire growth period of rice crop. (b) The temporal profile of vegetation index (NDVI) is closely in-line with the findings of [47] (shown in plot “b”).
Agriculture 11 01026 g008
Table 1. Mathematical formulas of vegetation indices (NDVI, SAVI, EVI, and Red Edge Position).
Table 1. Mathematical formulas of vegetation indices (NDVI, SAVI, EVI, and Red Edge Position).
Vegetation Indices (VIs)References
N D V I = ρ ( N I R ) ρ ( R e d ) ρ ( N I R ) + ρ ( R e d ) Rouse et al. (1974) [35]
S A V I = 1 + L ρ ( N I R ) ρ ( R e d ) ρ ( N I R ) + ρ ( R e d ) + L
where L = 0.5, to minimize the brightness effect of soil
Huete (1988) [36]
E V I = G ρ ( N I R ) ρ ( R e d ) ρ ( N I R ) + C 1 × ρ ( R e d ) C 2 × ρ ( B l u e ) + L
where G = 2.5; L = 0.5 (Soil adjusted factor); C1 and C2 are constants to reduce aerosols effects.
Liu and Huete (1995) [37]
R e d   E d g e = ρ ( R e d ) + ρ ( R e d   E d g e 3 ) 2
R E P = 704 + 35   [ R e d   E d g e ρ ( R e d   E d g e 1 ) ρ ( R e d   E d g e 2 ) ρ ( R e d   E d g e 1 ) ]
where 704 and 35 represent interpolation constants that can be adjusted according to available band’s wavelength
Filella and Penuelas (1994) [38]
Table 2. Statistical description of the field measured rice yield with graphical displays of calibration and validation datasets.
Table 2. Statistical description of the field measured rice yield with graphical displays of calibration and validation datasets.
Dataset TypeSample Size (n)Minimum (ton/ha)Maximum (ton/ha)Mean (ton/ha)SD (ton/ha)CV (ton/ha)Graphical Distribution
Calibration963.064.153.700.310.083 Agriculture 11 01026 i001
Validation413.164.153.710.290.078 Agriculture 11 01026 i002
Table 3. Results of the PLSR applied to time series vegetation indices (NDVI, SAVI, EVI, and REP). Number of latent variables (in PLSR model), calibration R2, validation R2, calibration RMSE (RMSEC), and cross validation RMSE (RMSECV) are summarized.
Table 3. Results of the PLSR applied to time series vegetation indices (NDVI, SAVI, EVI, and REP). Number of latent variables (in PLSR model), calibration R2, validation R2, calibration RMSE (RMSEC), and cross validation RMSE (RMSECV) are summarized.
IndicesNo. of Latent Variables in PLSR ModelCalibration R2RMSEC (ton/ha)Validation R2RMSECV (ton/ha)
NDVI60.870.110.830.12
EVI60.850.120.800.14
SAVI60.840.120.790.14
REP50.700.160.620.17
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Nazir, A.; Ullah, S.; Saqib, Z.A.; Abbas, A.; Ali, A.; Iqbal, M.S.; Hussain, K.; Shakir, M.; Shah, M.; Butt, M.U. Estimation and Forecasting of Rice Yield Using Phenology-Based Algorithm and Linear Regression Model on Sentinel-II Satellite Data. Agriculture 2021, 11, 1026. https://0-doi-org.brum.beds.ac.uk/10.3390/agriculture11101026

AMA Style

Nazir A, Ullah S, Saqib ZA, Abbas A, Ali A, Iqbal MS, Hussain K, Shakir M, Shah M, Butt MU. Estimation and Forecasting of Rice Yield Using Phenology-Based Algorithm and Linear Regression Model on Sentinel-II Satellite Data. Agriculture. 2021; 11(10):1026. https://0-doi-org.brum.beds.ac.uk/10.3390/agriculture11101026

Chicago/Turabian Style

Nazir, Abid, Saleem Ullah, Zulfiqar Ahmad Saqib, Azhar Abbas, Asad Ali, Muhammad Shahid Iqbal, Khalid Hussain, Muhammad Shakir, Munawar Shah, and Muhammad Usman Butt. 2021. "Estimation and Forecasting of Rice Yield Using Phenology-Based Algorithm and Linear Regression Model on Sentinel-II Satellite Data" Agriculture 11, no. 10: 1026. https://0-doi-org.brum.beds.ac.uk/10.3390/agriculture11101026

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop