Next Article in Journal
Carbon–Heteroatom Bond Formation via Coupling Reactions Performed on a Magnetic Nanoparticle Bed
Previous Article in Journal
Intermolecular Interactions and In Vitro Performance of Methyl Anthranilate in Commercial Sunscreen Formulations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Chemometric Differentiation of Pistachios (Pistacia vera, Greek ‘Aegina’ Variety) from Two Different Harvest Years Using FTIR Spectroscopy and DRIFTS and Disk Techniques

Laboratory of Chemistry, Department of Food Science & Human Nutrition, Agricultural University of Athens, Iera Odos 75, 11855 Athens, Greece
*
Author to whom correspondence should be addressed.
Submission received: 12 August 2021 / Revised: 27 August 2021 / Accepted: 30 August 2021 / Published: 2 September 2021

Abstract

:
Food quality is a topic of utmost importance as more and more emphasis is placed on quality rather than quantity of products. Previous studies have pointed out the interaction of quality with the harvest year. In this study, 22 Pistacia vera (Greek ‘Aegina’ variety) samples (11 from 2017 and 11 from 2018) were differentiated using Fourier transform infrared spectroscopy (FTIR) and (a) diffuse reflectance infrared Fourier transform spectroscopy (DRIFTS) and (b) KBr/sample disk techniques. In both years, the pistachios trees’ growing followed standard cultivation methods and similar agronomic conditions. Two chemometric models were developed using partial least squares-discrimination analysis (PLS-DA). DRIFTS proved unable to statistically differentiate the samples (R2 = 0.96266, Q2 = 0.63152). On the contrary, the disk technique completely differentiated the pistachio samples (R2 = 0.99705, Q2 = 0.97719). The 1720–1800 cm−1 region mostly contributed to the discrimination. The disk-FTIR chemometric model is fast, robust, economical, and environmentally friendly for determining pistachio matrix quality.

1. Introduction

Pistachios (Pistacia vera) are valued all over the world and have been described as superfoods for their special organoleptic characteristics (color, intense aroma, and taste), their ability to provide a multitude of beneficial ingredients in small quantities, and the possibility of valorizing the nut as a whole. Pistachio kernels are consumed fresh and dried in the sun or mechanically either as a snack, as a raw material to produce various products, or as a confectionery industry ingredient [1,2]. Their composition is characterized by low carbohydrate content, protein content above 20% w/w, and lipid content around 50% w/w. They contain a significant number of bioactive antioxidants, such as flavonoids, stilbenes, tocopherols, carotenoids, and chlorophylls [3,4]. In comparison to other nuts, pistachios have higher levels of lutein, zeaxanthin, γ-tocopherol, vitamin K, dietary fibers, phytosterols, and carotenoids [5,6]. Numerous studies prove the beneficial effect of pistachios to human health. They are known for their antioxidant, anti-inflammatory, anti-cancer, and cardioprotective action. In addition, their antimicrobial, anti-ischemic, and immunoregulatory properties have attracted the attention of many researchers [3,7].
In recent years, the pursuit of food quality has been a crucial goal for the food industry and consumers [8]. Several instrumental analytical methodologies have been broadly applied to address quality issues. The majority of these are not suitable for everyday or extensive analyses since they require time-consuming sample preparation and specialized laboratory staff. In recent years, researchers have focused on the application of methods that provide a whole molecular fingerprint of the food matrix, such as spectroscopic (infrared (IR), Raman, nuclear magnetic resonance (NMR)) and chromatographic techniques (gas chromatography-mass spectrometry (GC-MS) and high performance liquid chromatography-mass spectrometry (HPLC-MS)) [9].
In particular, Fourier transform infrared (FTIR) spectroscopy has emerged as a promising analytical tool both on industrial and research scales. Its high competitiveness lies in its speed of analysis, low cost, and convenience of sample preparation [9]. FTIR spectroscopy has been applied to nuts, flours, oils, cakes and flakes, meat, and spirit beverages [10]. The IR spectrum includes information about functional groups of the chemical compounds in the complex matrix of food matrices, such as pistachios. Diffuse reflectance infrared Fourier transform spectroscopy (DRIFTS) and KBr/sample disk spectroscopy are two FTIR sampling techniques that became very popular in a very short duration, since they offer a short analysis time and minor sample preparation [11,12].
Typically, in the DRIFTS technique, the sample may be a powder or a rough surface solid through which the radiation is scattered. Infrared light is directed to a sample cup and can lead to a single reflection from the surface (specular reflection) or be reflected multiple times causing diffuse scattered light over a large area. DRIFTS accessories are designed to reject the radiation of specular reflection and to collect by a mirror as much as possible from the diffuse scattered light, subsequently measured by the detector [11,13]. The KBr/sample disk technique involves dilution of the sample in KBr, transportation of a mixture amount in a die, and the use of a hydraulic press to form a disk. By this technique, the diffuse scattered light is increased as the incident light more deeply penetrates the sample [14].
Studies with chemometrics data, such as FTIR, result in complex multivariate datasets, so that multivariate data analysis is required to investigate these complex datasets [15]. FTIR spectroscopy associated with different chemometric tools has been frequently applied to dairy products, honey, coffee, olive oil, and wine [9]. One of the most used methods for development of multivariate discrimination models is partial least squares discriminant analysis (PLS-DA) [16]. PLS-DA can identify which explanatory variables significantly contribute to the construction of PLS components and, consequently, have a high explanatory power on the response variable [17].
The food quality and, consequently, the chemical characteristics of food matrices can be affected by the year of harvest [18]. In the present study, DRIFT and KBr disk spectra of pistachio samples of the Greek ‘Aegina’ variety from two consecutive years of harvest were acquired, aiming to: (a) accurately estimate whether pistachios could be classified in two classes due to year-to-year variability and (b) compare which spectroscopic technique gives the best obtained PLS-DA model ascertained through R2 and Q2 indicators.

2. Materials and Methods

2.1. Pistachio Samples

A total of 22 Pistacia vera samples of the ‘Aegina’ variety equally obtained from 2 consecutive years of harvest were provided directly from pistachio producers across Greece (Aegina, Megara, Phthiotis, Trizina). The origin conditions (pistachio farmers, field coordinates, cultivation care, post-harvest way of drying) of the 11 samples derived from 2017 harvest year were the same with those of the 11 samples from 2018 harvest year. All samples were unshelled, and the kernels were subjected to pulverization with a food processor equipped with metal cutting blade and stainless steel container (Izzy, Greece), followed by particle size separation to obtain ground samples (pistachio kernel flours) of 500–800 μm. Prior to further analysis, the samples were stored in re-closable plastic bags in the dark and refrigerated (−20 °C) conditions as a way of avoiding degradation and improving oxidation stability.

2.2. Moisture Content Measurement

According to the AOAC Official Method 925.40, approximately 2 g from each pistachio kernel flour was placed in a ceramic cup and dried in an oven (102 °C) until constant weight. The moisture content was calculated based on the weight difference before and after drying.

2.3. DRIFTS Sample Analysis

A randomly selected quantity from pistachio kernel flour was placed on a flat surface where infrared radiation was directed to obtain DRIFT spectrum. For each sample, three spectra were collected. Each spectrum of the triplet came from a different part of the same sample (sub-sample). A Thermo Nicolet 6700 FTIR spectrophotometer (Thermo Electron Corporation, Madison, WI, USA) was used, equipped with a deuterated triglycine sulfate (DTGS) detector using a DRIFT Spectra Tech microcup accessory (Spectra-Tech Inc., Stamford, CT, USA) (3 mm diameter, 2 mm height). The spectra were collected as absorbance from 100 scans through the wavenumber region of 400–4000 cm1. The speed of the interferometer moving mirror was 0.3165 mm/s. A background spectrum was collected using FTIR grade KBr as a non-absorbing matrix powder (Sigma-Aldrich, Steinhein, Germany), prior to spectrum recording of each sample. After each measurement, microcup was cleaned with acetone and dried.

2.4. KBr Disk Sample Analysis

A KBr disk sample preparation involved thorough mixing of 200 mg dried KBr powder and 5 mg ground pistachio kernel flour. The mixture was transferred into a 13 mm die and formed a clear disk when pressed under high pressure using a 2- to 8-ton bench top hydraulic press. The spectra were recorded with the above spectrophotometer using an accessory appropriate for disk measurements. Measurements’ conditions (scans, wavenumber region, and mirror speed) were previously described. Background measurements were made against pure KBr disk which had no absorptions over the entire absorbance range. Triplicate KBr/sub-samples disks were made for each sample, and the spectrum of each one was recorded.

2.5. FTIR Data Processing

The FTIR data were processed using the Omnic 8.2.0.387 software (Thermo Fisher Scientific Inc., Madison, WI, USA). Both the DRIFT and disk spectra were smoothed using the Savitsky-Golay algorithm (5-point moving second-degree polynomial), and the baseline was corrected using the ‘automatic baseline correct function’ (second-degree polynomial, 20 iterations). Then, the average spectra of each sample were calculated and converted into Kubelka-Munk units. The Kubelka-Munk conversion compensated for some of the following undesirable effects: low intensity bands were increased relative to intense bands, and strong intensity bands had broader, rounder peak shapes. Finally, spectra absorbance scale was normalized.

2.6. Discriminant Analysis

Classification of the year of harvest was based on DRIFT and disk spectra of pistachio kernel flours. The 650–4000 cm1 spectral region was used. Before the development of the discriminant analysis, the homogeneity of the covariance matrices was ensured since the ratio of the two groups (2017, n = 11 and 2018, n = 11) was equal to or less than 1.5 [19]. Two chemometric models were developed using the partial least square-discriminant analysis (PLS-DA) statistical technique. One was based on the DRIFTS data and another on disk spectra. Each chemometric model was examined using cross-validation and permutation tests. According to Field (2009), the classification ability of the samples is assessed with greater reliability through cross-validation [20]. The statistical analysis was performed using MetaboAnalyst 5.0 software.

3. Results

3.1. Moisture Content Measurement

The pistachios were dried by the pistachio growers either in the sun or mechanically. The moisture level was found to be between 5 and 7% wet basis (w.b.) for all samples. Drying nuts at moisture levels below 11% w.b. is important for both safety and taste. A safe level of moisture is defined as a level that does not support the growth of fungi, and for shelled pistachios, this ranges between 2.2 and 8.2% w.b. at 21 °C [21].

3.2. FTIR Analysis

Table 1 presents the most characteristic peaks of the pistachio kernel flours’ DRIFT and disk spectra attributed to the respective modes of vibration. Figure 1 and Figure 2 display the representative spectra of pistachio kernel flours using the DRIFTS and disk technique, respectively, from two harvest years. The peaks at 3006, 2928, and 2856 cm1 and the deformation vibrations in the range 1300–1500 cm1 generally arose from lipid content [12,22]. It was observed that between the 2017 and 2018 harvests, the correlation of these peaks’ heights or areas changed. In a recent study, it was found that the oil content is associated with the year of harvest. Therefore, the differences in the respective spectral region are related to the different oil contents [23]. Accordingly, the spectrum range 1500–1700 cm1 was defined as proteinic with peaks at 1660 and 1550 cm1 attributed to the amide I and amide II vibration modes, respectively. The region of amides I and II is associated with the secondary structure of proteins and the protein content. The protein content depends on the harvest period, the temperature, and the rain rate [24,25,26], a fact which is confirmed in the present study. The spectral region from 1300 to 900 cm1 was connected to oligo and polysaccharides’ ring vibrations, while absorption bands between 900 and 600 cm1 were generally caused by aromatic ring vibrations [12,22].
From the large and complex FTIR datasets, important features of the samples and relevant information for the creation of models could only be extracted with chemometric analysis [9].

3.3. Multivariate Statistical Analysis

Using either DRIFTS or the disk technique, due to the impossibility of the optical comparability between the collected spectra from year to year, it was necessary to conduct multivariate statistical analysis to investigate whether samples differed in a statistically significant manner and could be classified according to the year of harvest.
Preprocessing steps of FTIR data followed to diminish the effect that unrelated factors had on the intensity of absorbance peaks [12]. A first filtering was based on the interquartile range (IQR), which represented the 25th percentile and the 75th percentile of data distribution. Thus, 25% of the lowest and 25% of the highest data were not included in the model construction. Thereafter, as the number of independent variables (FTIR wavenumbers) ranged between 500 and 1000, 25% of data that were unlikely to be of use when modeling the data were filtered [27]. The remaining 75% of the source data were normalized to bring them all to the same baseline, so as to be able to compare data of different scales. Additionally, normalization inconsistencies between data were smoothed out, improving the effectiveness and the performance of the algorithms. Hence, the cube root transformation was applied. Subsequently, data Pareto scaling was useful and necessary. The data were mean centered and divided by the square root of the standard deviation of each independent variable.
After data manipulation, PLS modeling was developed to transform the raw data into a new set of data by extracting a set of latent variables (latent factors or principal components) that had the optimal spectral information and thus the best predictive power. The number of principal components was decided automatically when the predicted residual error sum of squares (PRESS) values reached a minimum or levelled off. Adding more factors resulted in over-fitted calibration models. The resulting variable importance in the projection (VIP) scores of the developed PLS models indicated which wavenumbers were more active for the discrimination, in other words mainly changing as intensity changed between the consecutive years [12]. In accordance, t-tests proved to which independent variables the statistically significant difference was mainly due.
PLS approaches were tested by using the ‘leave one out’ cross-validation method to show how well the developed PLS models performed by quantifying each calibration sample as if it were a validation one. In this way, a recalculation of the model was made. This method is indicated in cases with a number of samples less than 100. The performance of the final PLS models was compared in terms of the R2 parameter known as the ‘goodness of fit’ or explained variation and the Q2 parameter termed as ‘goodness of prediction’ or predicted variation. The R2 and Q2 values, which indicate the model fit and predictability, respectively, range between 0 and 1. The R2 index is a measure of the explanatory power of the main components of the model. A PLS-DA model with a high value of R2 is regarded as providing a good fit to the data and proves that the selected number of main components is sufficient for the explanatory power of the model. The Q2 index is calculated as a measure of the correct validation of the model by expressing the cumulative contribution of the selected principal components in the predictive quality of the model. A Q2 value from 0.5 to 0.9 indicates good predictability, stability, and reliability of the model, while one greater than 0.9 is considered to indicate excellent predictability, stability, and reliability. A large discrepancy between Q2 and R2 values indicates a non-objective model, dependent on the presence of the specific dataset that created it [28,29]. Furthermore, model performance was tested by conducting permutation tests. Permutation tests assumed that there was no difference among the two groups that were formed based on the year of harvest, so the labels of the samples were randomly permuted, and a new classification model was calculated [30,31].

3.3.1. DRIFTS Discriminant Analysis

Applying the PLS-DA on the entire DRIFT spectrum of samples indicated that pistachio kernel flours could not be completely separated between the two years of harvest (p-value > 0.05) (Figure 3). As shown in Figure 4, five principal components resulted in higher values of R2 and Q2, described in detail in Table 2.
To give a measure of the classification’s statistical significance (p-value), permutation tests were conducted [32]. The observed p-value of Figure 5 was higher than 0.05, reinforcing the conclusion of Figure 3 that there was no statistically significant difference between DRIFT spectra from year to year.

3.3.2. Disks Discriminant Analysis

PLS-DA analysis of the total wavenumber range proved a clear sample distinction according to the year of harvest (Figure 6). Five principal components were the optimal number for classification, accounting for as much of the variation explained as possible, while high R2 and Q2 values were already given by the three principal components (Figure 7 and Table 3).
The statistical significance of the obtained PLS-DA model was evaluated with permutation testing (Figure 8). A separation between the two groups significant from a statistical point of view (p-value < 0.05) was evident when the indicator with the red mark was completely to the right [33].
PLS-DA ranked the independent variables using the VIP scores at p = 0.05 (Figure 9). For evaluation of year-to-year variability, data from the two consecutive harvests were compared using a t-test (Figure 10), which showed that almost the entire spectrum played an important role in the identification of differences between the two classes. Statistically significant wavenumbers to the discrimination are shown with a pink color. The absorbance range of 1720–1800 cm−1, with the peak at 1750 cm−1 included, had the most contributory VIP variables in the discrimination having the highest VIP scores. The colored boxes on the right of Figure 9 indicate the relative intensity of the corresponding wavenumber in each group under study.

4. Discussion

Food quality is an issue of increasingly high concern to society and to all stakeholders involved in food production. In this sense, one of the products in which quality labels are mostly useful are pistachios. Food quality is inextricably linked to the chemical profile of food, which can be affected by several factors including the year of harvest [18]. A recent study related to pistachio oils identified statistically significant differences in terms of quality and nutritional value between crops of two consecutive years of harvest. This year-to-year differentiation of pistachio oils was possibly attributed to environmental factors [23].
In light of these results, a further investigation was carried out, not for pistachio oils but for pistachio samples of the Greek ‘Aegina’ variety collected from four different Greek regions. It was difficult to obtain standard results for pistachios cultivated in different locations and with different agricultural practices [18]. To overcome this problem, each sample of the 2017 harvest year (11 samples in total) was derived from the same farmer and the same field with each corresponding sample of the 2018 harvest season (11 samples in total).
FTIR spectroscopy is quick, environmentally friendly, and not complex and can be applied in routine analysis and official pistachio quality control [9,10]. FTIR spectra were obtained for each sample in triplicate using DRIFTS and the disks technique. The performance assessment of PLS-DA models obtained with two different types of FTIR analysis (DRIFTS and disks) was achieved with coefficients R2 and Q2 as a statistical measure of the model fitting. The disk acquired spectra in combination with the use of chemometric tools such as PLS-DA showed a clear tendency of discrimination (p-value < 0.05) between pistachios from the two different years with R2 = 0.99705 and Q2 = 0.97719. From the results of this model, it could be stated that the 1720–1800 cm−1 region had the highest contribution to this classification. However, the constructed PLS-DA model using the DRIFT spectra could not predict the year of harvest from the set of independent variables (p-value > 0.05) with R2 = 0.96266 and Q2 = 0.63152.
The DRIFTS quantitative spectral analysis is difficult, as the intensity of the light scattered is strongly dependent on the refractive index, particle size, density, and homogeneity of the sample [34]. In order to obtain a high-quality DRIFT spectrum, the sample must be well homogenized, its particle size must be small and uniform so that a spectrum with narrower bandwidths and more accurate zones of intensity is received, and the placement of the sample on a flat surface for measurement must not be excessively compact to maximize IR beam penetration [35].
On the other hand, in the disk technique, the sample was diluted with KBr, which enhances the contribution of the scattered light and minimizes the specular reflection. The specular reflection causes changes in band intensity, shape, and, in some cases, band inversions (Restrahlen bands). The KBr/sample mixture minimizes the negative effects of specular reflection. Generally, KBr is very hygroscopic, and if it is not well dried, it may result in bands at 3440 (OH stretch), 1630 (OH bend), and 560 cm−1 (OH wag) that affect the ability to interpret these spectral regions. During disk construction, excessive pressure along with the presence of water can change the hydration state and crystallinity [11]. In the current study, the complete drying of KBr was achieved, which made it possible to cope with this issue.

5. Conclusions

In conclusion, the disk-FTIR is a highly sensitive spectroscopic technique that allowed the collection of spectra from pulverized pistachio samples with minimal sample preparation. Results showed a complete discrimination between pistachios from the two different years of harvest. The proposed disk-FTIR spectroscopic chemometric model is fast, accurate, economical, and environmentally friendly.

Author Contributions

Conceptualization, L.V. and C.S.P.; methodology, L.V. and C.S.P.; software, L.V.; validation, L.V. and C.S.P.; formal analysis, L.V.; investigation, L.V. and C.S.P.; resources, L.V. and C.S.P.; data curation, L.V. and C.S.P.; writing—original draft preparation, L.V. and C.S.P.; writing—review and editing, L.V. and C.S.P.; visualization, L.V. and C.S.P.; supervision, C.S.P.; project administration, L.V. and C.S.P. Both authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

This research work was supported by the Hellenic Foundation for Research and Innovation (HFRI) and the General Secretariat for Research and Technology (GSRT), under the HFRI PhD Fellowship grant (GA. no. 1313).

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Catalán, L.; Alvarez-Ortí, M.; Pardo-Giménez, A.; Gómez, R.; Rabadán, A.; Pardo, J.E. Pistachio oil: A review on its chemical composition, extraction systems, and uses. Eur. J. Lipid Sci. Technol. 2016, 119, 1600126. [Google Scholar] [CrossRef]
  2. Salvador, M.D.; Ojeda-Amador, R.M.; Fregapane, G. Virgin Pistachio (Pistachia vera L.) Oil. In Fruit Oils: Chemistry and Functionality, 1st ed.; Ramadan, M.F., Ed.; Springer: Cham, Switzerland, 2019; pp. 181–197. [Google Scholar]
  3. Fallico, B.; Ballistreri, G.; Arena, E.; Tokuşoğlu, Ő. Nut Bioactives: Phytochemicals and Lipid Based Components of Almonds, Hazelnuts, Peanuts, Pistachios, and Walnuts. In Fruit and Cereal Bioactives: Sources, Chemistry, and Applications; Tokusoglu, Ö., Hall III, C.A., Eds.; CRC Press: Boca Raton, FL, USA, 2011; pp. 198–201. [Google Scholar]
  4. Martínez, M.L.; Fabani, M.P.; Baroni, M.V.; Huaman, R.N.M.; Ighani, M.; Maestri, D.M.; Wunderlin, D.; Tapia, A.; Feresin, G.E. Argentinian pistachio oil and flour: A potential novel approach of pistachio nut utilization. J. Food Sci. Technol. 2016, 53, 2260–2269. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Mahmoudi, R.; Norian, R.; Katiraee, F.; Pajohi-Alamoti, M.R.; Emami, S.J. Occurrence of aflatoxin B1 in pistachio nuts during various preparing processes: Tracing from Iran. J. Mycol. Res. 2014, 1, 1–5. [Google Scholar]
  6. Fogacci, F.; Cicero, A.F.G.; Derosa, G.; Rizzo, M.; Veronesi, M.; Borghi, C. Effect of pistachio on brachial artery diameter and flow-mediated dilatation: A systematic review and meta-analysis of randomized, controlled-feeding clinical studies. Crit. Rev. Food Sci. Nutr. 2019, 59, 328–335. [Google Scholar] [CrossRef] [PubMed]
  7. Rajaei, A.; Barzegar, M.; Mobarez, A.M.; Sahari, M.A.; Esfahani, Z.H. Antioxidant, anti-microbial and antimutagenicity activities of pistachio (Pistachia vera) green hull extract. Food Chem. Toxicol. 2010, 48, 107–112. [Google Scholar] [CrossRef]
  8. Pérez-Ràfols, C.; Subirats, X.; Serrano, N.; Díaz-Cruz, J.M. New discrimination tools for harvest year and varieties of white wines based on hydrophilic interaction liquid chromatography with amperometric detection. Talanta 2019, 201, 104–110. [Google Scholar] [CrossRef] [PubMed]
  9. Mendes, E.; Duarte, N. Mid-Infrared Spectroscopy as a Valuable Tool to Tackle Food Analysis: A Literature Review on Coffee, Dairies, Honey, Olive Oil and Wine. Foods 2021, 10, 477. [Google Scholar] [CrossRef]
  10. Sujka, K.; Koczoń, P.; Ceglińska, A.; Reder, M.; Ciemniewska-Żytkiewicz, H. The Application of FT-IR Spectroscopy for Quality Control of Flours Obtained from Polish Producers. J. Anal. Methods Chem. 2017, 2017, 4315678. [Google Scholar] [CrossRef] [PubMed]
  11. Larkin, P. Instrumentation and Sampling Methods. In Infrared and Raman Spectroscopy: Principles and Spectral Interpretation; Larkin, P., Ed.; Elsevier: Amsterdam, The Netherlands, 2011; pp. 27–54. [Google Scholar]
  12. Kaya-Celiker, H. Mid-Infrared Spectral Characterization of Aflatoxin Contamination in Peanuts. Ph.D. Thesis, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA, 2012. [Google Scholar]
  13. Reh, C. In-line and off-line FTIR measurements. In Instrumentation and Sensors for the Food Industry, 2nd ed.; Kress-Rogers, E., Brimelow, C.J.B., Eds.; Woodhead Publishing in Food Science and Technology: Sawston, UK, 2001; pp. 213–232. [Google Scholar]
  14. Dent, G. Preparation of Samples for IR Spectroscopy as KBr Disks. IJVS 1996, 1, 1–2. [Google Scholar]
  15. Szymańska, E.; Saccenti, E.; Smilde, A.K.; Westerhuis, J.A. Double-check: Validation of diagnostic statistics for PLS-DA models in metabolomics studies. Metabolomics 2012, 8, 3–16. [Google Scholar] [CrossRef] [Green Version]
  16. Barker, M.; Rayens, W. Partial least squares for discrimination. J. Chemom. 2003, 17, 166–173. [Google Scholar] [CrossRef]
  17. Bastien, P.; Vinzi, V.E.; Tenenhaus, M. PLS generalised linear regression. Comput. Stat. Data Anal. 2005, 48, 17–46. [Google Scholar] [CrossRef]
  18. Prolla, I.R.D.; Barbosa, R.G.; Veeck, A.P.L.; Augusti, P.R.; da Silva, L.P.; Ribeiro, N.D.; Emanuelli, T. Cultivar, harvest year, and storage conditions affecting nutritional quality of common beans (Phaseolus vulgaris L.). Ciênc. Tecnol. Aliment. Camp. 2010, 30, 96–102. [Google Scholar] [CrossRef] [Green Version]
  19. Stevens, P.J. Applied Multivariate Statistics for the Social Sciences, 3rd ed.; Lawrence Erlbaum: Mahwah, NJ, USA, 1996. [Google Scholar]
  20. Field, A. Discovering Statistics Using SPSS, 3rd ed.; Sage Publications Ltd.: London, UK, 2009. [Google Scholar]
  21. Kader, A.A.; Heintz, C.M.; Labavitch, J.M.; Rae, H.L. Studies related to the description and evaluation of pistachio nut quality [Genotype, production area, maturity, moisture content, degree of shell staining, and storage]. J. Am. Soc. Hortic. Sci. 1982, 107, 812–816. [Google Scholar]
  22. Kaya-Celiker, H.; Mallikarjunan, P.K.; Kaaya, A. Mid-infrared spectroscopy for discrimination and classification of Aspergillus spp. contamination in peanuts. Food Control 2015, 52, 103–111. [Google Scholar] [CrossRef]
  23. Valasi, L.; Kokotou, M.G.; Pappas, C.S. GC-MS, FTIR and Raman spectroscopic analysis of fatty acids of Pistacia vera (Greek variety “Aegina”) oils from two consecutive harvest periods and chemometric differentiation of oils quality. Food Res. Int. 2021, 148, 110590. [Google Scholar] [CrossRef]
  24. Esmaeilpour, A.; Shakerardekani, A. Effects of early harvest times on nut quality and physiological characteristics of pistachio (Pistacia vera) trees. Fruits 2018, 73, 110–117. [Google Scholar] [CrossRef]
  25. Alae-Carew, C.; Nicoleau, S.; Bird, F.A.; Hawkins, P.; Tuomisto, H.L.; Haines, A.; Dangour, A.D.; Scheelbeek, P.F.D. The impact of environmental changes on the yield and nutritional quality of fruits, nuts and seeds: A systematic review. Environ. Res. Lett. 2020, 15, 023002. [Google Scholar] [CrossRef] [PubMed]
  26. Manners, R.; Varela-Ortega, C.; van Etten, J. Protein-rich legume and pseudo-cereal crop suitability under present and future European climates. Eur. J. Agron. 2020, 113, 125974. [Google Scholar] [CrossRef]
  27. Hackstadt, A.J.; Hess, A.M. Filtering for increased power for microarray data analysis. BMC Bioinform. 2009, 10, 11. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Lee, B.-J.; Zhou, Y.; Lee, J.S.; Shin, B.K.; Seo, J.-A.; Lee, D.; Kim, Y.-S.; Choi, H.-K. Discrimination and prediction of the origin of Chinese and Korean soybeans using Fourier transform infrared spectrometry (FT-IR) with multivariate statistical analysis. PLoS ONE 2018, 13, e0196315. [Google Scholar] [CrossRef] [PubMed]
  29. XLStat 2020. Available online: https://www.xlstat.com (accessed on 2 August 2021).
  30. Lindgren, F.; Hansen, B. Model validation by permutation tests: Applications to variable selection. J. Chemom. 1996, 10, 521–532. [Google Scholar] [CrossRef]
  31. Westerhuis, J.A.; van Velzen, E.J.J.; Hoefsloot, H.C.J.; Smilde, A.K. Discriminant Q2 (DQ2) for improved discrimination in PLSDA models. Metabolomics 2008, 4, 293–296. [Google Scholar] [CrossRef] [Green Version]
  32. Pesarin, F.; Salmaso, L. Permutation Tests for Complex Data: Theory, Applications and Software; Wiley: Chichester, UK, 2010. [Google Scholar] [CrossRef]
  33. Barberini, L.; Noto, A.; Saba, L.; Palmas, F.; Fanos, V.; Dessì, A.; Zavattoni, M.; Fattuoni, C.; Mussap, M. Multivariate data validation for investigating primary HCMV infection in pregnancy. Data Brief 2016, 9, 220–230. [Google Scholar] [CrossRef] [Green Version]
  34. Sun, D.W. Infrared Spectroscopy for Food Quality Analysis and Control; Academic Press: Cambridge, MA, USA, 2008. [Google Scholar]
  35. Lercher, J.A.; Jentys, A. Infrared and Raman Spectroscopy for Characterizing Zeolites. In Introduction to Zeolite Science and Practice, 3rd ed.; Čejka, J., van Bekkum, H., Corma, A., Schüth, F., Eds.; Elsevier: Amsterdam, The Netherlands, 2007; Volume 168, pp. 435–476. [Google Scholar] [CrossRef]
Figure 1. Representative diffuse reflectance infrared Fourier transform (DRIFT) spectra of pistachio kernel flours from the same producer, but from two harvest years.
Figure 1. Representative diffuse reflectance infrared Fourier transform (DRIFT) spectra of pistachio kernel flours from the same producer, but from two harvest years.
Appliedchem 01 00006 g001
Figure 2. Representative disk spectra of pistachio kernel flours from the same producer, but from two harvest years.
Figure 2. Representative disk spectra of pistachio kernel flours from the same producer, but from two harvest years.
Appliedchem 01 00006 g002
Figure 3. Partial least squares-discrimination analysis (PLS-DA) 2D scores plot for the discrimination model of DRIFT spectra according to the year of harvest (MetaboAnalyst).
Figure 3. Partial least squares-discrimination analysis (PLS-DA) 2D scores plot for the discrimination model of DRIFT spectra according to the year of harvest (MetaboAnalyst).
Appliedchem 01 00006 g003
Figure 4. PLS-DA cross-validation for the discrimination model of DRIFT spectra according to the year of harvest (MetaboAnalyst). Red asterisk shows the optimal value of Q2.
Figure 4. PLS-DA cross-validation for the discrimination model of DRIFT spectra according to the year of harvest (MetaboAnalyst). Red asterisk shows the optimal value of Q2.
Appliedchem 01 00006 g004
Figure 5. PLS-DA permutation test for the discrimination model of DRIFT spectra according to the year of harvest (MetaboAnalyst) (permutation numbers:100).
Figure 5. PLS-DA permutation test for the discrimination model of DRIFT spectra according to the year of harvest (MetaboAnalyst) (permutation numbers:100).
Appliedchem 01 00006 g005
Figure 6. PLS-DA 2D scores plot for the discrimination model of disk spectra according to the year of harvest (MetaboAnalyst).
Figure 6. PLS-DA 2D scores plot for the discrimination model of disk spectra according to the year of harvest (MetaboAnalyst).
Appliedchem 01 00006 g006
Figure 7. PLS-DA cross-validation for the discrimination model of disk spectra according to the year of harvest (MetaboAnalyst). In this Figure, the optimal Q2 value is highlighted with the use of a red asterisk.
Figure 7. PLS-DA cross-validation for the discrimination model of disk spectra according to the year of harvest (MetaboAnalyst). In this Figure, the optimal Q2 value is highlighted with the use of a red asterisk.
Appliedchem 01 00006 g007
Figure 8. PLS-DA permutation test for the discrimination model of disk spectra according to the year of harvest (MetaboAnalyst) (permutation numbers:100).
Figure 8. PLS-DA permutation test for the discrimination model of disk spectra according to the year of harvest (MetaboAnalyst) (permutation numbers:100).
Appliedchem 01 00006 g008
Figure 9. PLS-DA importance features of disk spectra according to the year of harvest (MetaboAnalyst).
Figure 9. PLS-DA importance features of disk spectra according to the year of harvest (MetaboAnalyst).
Appliedchem 01 00006 g009
Figure 10. t-test of disk spectra according to the year of harvest (MetaboAnalyst).
Figure 10. t-test of disk spectra according to the year of harvest (MetaboAnalyst).
Appliedchem 01 00006 g010
Table 1. Peak interpretation of pistachio kernel flours’ (Pistacia vera, variety ‘Aegina’) diffuse reflectance infrared Fourier transform (DRIFT) and disk spectra [12,22].
Table 1. Peak interpretation of pistachio kernel flours’ (Pistacia vera, variety ‘Aegina’) diffuse reflectance infrared Fourier transform (DRIFT) and disk spectra [12,22].
Wavenumbers (cm−1)Functional GroupsAbbreviationsVibration Modes
3200–3600O-Hv(O-H)stretching
N-H of the amide groupv(Ν-H)stretching
3006cis=C-Hv(=C-H)stretching
2928C-H of -CH2-vas(CH2)stretching (asymmetric)
2856C-H of -CH2-vs(CH2)stretching (symmetric)
1750C=O of estersv(C=O)stretching
1660C=O, C-N of the amide Iv(C=O), ν(C-N)stretching
1550N-H, C-O of the amide IIδ(N-H), δ(C-O)bending
C-C, C-N of the amide IIν(C-C), ν(C-N)stretching
1465C-H of -CH2-δs(C-H)bending (scissoring)
1416cis =C-Hρ(=C-H)bending (rocking)
1365C-H of -CH3δs(C-H)bending (symmetric)
1240C-O of esters
of triglycerides
vas(C-O)stretching (asymmetric)
C-H of -CH3δs(C-H)bending (symmetric)
1165C-Ovas(C-O)stretching (asymmetric)
-CH2-δ(CH2)bending
1096C-Ov(C-O)stretching
725aromatic ring-deformation
Table 2. PLS-DA model performance according to the year of harvest for DRIFT spectra (MetaboAnalyst).
Table 2. PLS-DA model performance according to the year of harvest for DRIFT spectra (MetaboAnalyst).
Measure1 PC2 PC3 PC4 PC5 PC
Accuracy0.727270.772730.863640.954550.90909
R20.319490.622540.876410.946660.96266
Q20.0854990.388410.428740.604330.63152
Table 3. PLS-DA model performance according to the year of harvest for disks spectra (MetaboAnalyst).
Table 3. PLS-DA model performance according to the year of harvest for disks spectra (MetaboAnalyst).
Measure1 PC2 PC3 PC4 PC5 PC
Accuracy0.863640.954551.01.01.0
R20.703400.975180.977640.994180.99705
Q20.645440.871790.958530.969010.97719
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Valasi, L.; Pappas, C.S. Chemometric Differentiation of Pistachios (Pistacia vera, Greek ‘Aegina’ Variety) from Two Different Harvest Years Using FTIR Spectroscopy and DRIFTS and Disk Techniques. AppliedChem 2021, 1, 62-74. https://0-doi-org.brum.beds.ac.uk/10.3390/appliedchem1010006

AMA Style

Valasi L, Pappas CS. Chemometric Differentiation of Pistachios (Pistacia vera, Greek ‘Aegina’ Variety) from Two Different Harvest Years Using FTIR Spectroscopy and DRIFTS and Disk Techniques. AppliedChem. 2021; 1(1):62-74. https://0-doi-org.brum.beds.ac.uk/10.3390/appliedchem1010006

Chicago/Turabian Style

Valasi, Lydia, and Christos S. Pappas. 2021. "Chemometric Differentiation of Pistachios (Pistacia vera, Greek ‘Aegina’ Variety) from Two Different Harvest Years Using FTIR Spectroscopy and DRIFTS and Disk Techniques" AppliedChem 1, no. 1: 62-74. https://0-doi-org.brum.beds.ac.uk/10.3390/appliedchem1010006

Article Metrics

Back to TopTop