Next Article in Journal
Disturbances of the Thermosphere and the Ionosphere during a Meteorological Storm
Previous Article in Journal
Spatial-Temporal Changes of Methane Content in the Atmosphere for Selected Countries and Regions with High Methane Emission from Rice Cultivation
Previous Article in Special Issue
Real-Time Low-Cost Personal Monitoring for Exposure to PM2.5 among Asthmatic Children: Opportunities and Challenges
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Applications and Limitations of Quantifying Speciated and Source-Apportioned VOCs with Metal Oxide Sensors

1
Environmental Engineering, University of Colorado Boulder, Boulder, CO 80309, USA
2
Mechanical Engineering, University of Colorado Boulder, Boulder, CO 80309, USA
*
Author to whom correspondence should be addressed.
Submission received: 4 September 2021 / Revised: 14 October 2021 / Accepted: 20 October 2021 / Published: 22 October 2021

Abstract

:
While low-cost air quality sensor quantification has improved tremendously in recent years, speciated hydrocarbons have received little attention beyond total lumped volatile organic compounds (VOCs) or total non-methane hydrocarbons (TNMHCs). In this work, we attempt to use two broad response metal oxide VOC sensors to quantify a host of speciated hydrocarbons as well as smaller groups of hydrocarbons thought to be emanating from the same source or sources. For sensors deployed near oil and gas facilities, we utilize artificial neural networks (ANNs) to calibrate our low-cost sensor signals to regulatory-grade measurements of benzene, toluene, and formaldehyde. We also use positive matrix factorization (PMF) to group these hydrocarbons along with others by source, such as wet and dry components of oil and gas operations. The two locations studied here had different sets of reference hydrocarbon species measurements available, helping us determine which specific hydrocarbons and VOC mixtures are best suited for this approach. Calibration fits on the upper end reach above R2 values of 0.6 despite the parts per billion (ppb) concentration ranges of each, which are magnitudes below the manufacturer’s prescribed detection limits for the sensors. The sensors generally captured the baseline trends in the data, but failed to quantitatively estimate larger spikes that occurred intermittently. While compounds with high variability were not suited for this method, its success with several of the compounds studied represents a crucial first step in low-cost VOC speciation. This work has important implications in improving our understanding of the links between health and environment, as different hydrocarbons will have varied consequences in the human body and atmosphere.

1. Introduction

1.1. The Need for Low-Cost Speciated Hydrocarbon Data

While numerous studies have utilized low-cost sensors to study generalized VOCs or TNMHCs in addition to methane [1,2,3,4], few have attempted to characterize specific hydrocarbons other than methane in outdoor monitoring experiments. These current characterizations have limited utility; it is helpful to know when and where hydrocarbon emissions are significant, but not knowing the exact chemical composition leaves lingering questions about source attribution as well as impacts on climate and human health, since some hydrocarbons are more damaging to people and the environment than others [5,6]. Additionally, since VOC sources can be exceptionally small or transient, outdoor VOC hotspot detection is an up-and-coming area in exposure assessment [7,8]. Likewise, VOC source attribution is critical not only for individuals to understand their exposure risk, but also in community planning and mitigation efforts. Since myriad VOC sources, including oil and gas operations, may exist within a community, it is crucial to disentangle them to determine which emit the highest concentrations or most harmful compounds so that risks can be addressed by local organizations in order of importance [9,10,11,12]. This also has important implications on secondary pollutants generated in the troposphere, as the exact hydrocarbon makeup of emissions will determine their subsequent reaction pathways and byproducts [13].
While low-cost sensors are available for speciated hydrocarbons such as formaldehyde (HCHO), several weaknesses have prevented them from becoming more widely used. As many hydrocarbon-specific sensors have been developed for indoor use [14,15,16,17], years of research and trial and error will be required to determine which—if any—are suitable for long-term ambient monitoring in outdoor environments. Using a formaldehyde-specific sensor also closes the door to studying other types of hydrocarbons. If there are 30 hydrocarbons of interest, sensor packages do not currently have the capability to run 30 species-selective sensors all at once; selecting a few means neglecting the rest. For these reasons, we have opted to use two generalized VOC sensors to quantify a host of speciated hydrocarbons as well as groups of hydrocarbons. This method is imperfect; most commercially available sensors struggle with detection limits higher than the observed concentrations of individual VOCs and lack selectivity towards certain hydrocarbons [18]. However, this approach still allows us to view hydrocarbons in totality rather than only quantifying a select few.
At the regulatory level, summa cannisters and gas chromatography (GC) are typically used to determine the concentrations of speciated VOCs to a high degree of accuracy. Field-deployable continuous mass spectrometry data used in this work have associated uncertainties in the parts per billion range if not the mere parts per trillion (ppt). While these techniques deliver high-quality data, they are expensive to own and operate, with monthly operating costs typically totaling in the thousands [19,20]. Thus, utilizing this data only for sensor calibration purposes rather than entire deployments would greatly reduce operating expenses. Likewise, at the current instrument price point, multiple measurement locations cannot be established. Using low-cost sensors instead, we gain the ability to measure speciated hydrocarbons at multiple locations throughout an area of interest, highlighting localized differences on the relevant spatial and temporal scales.

1.2. VOC Quantification Using Low-Cost Sensors

One previous study used the same two generalized VOC sensors—the Figaro TGS 2600 and 2602—to quantify speciated VOC data [21]. As multivariate linear regression has been frequently used to match low-cost sensor signals to reference concentrations [1,2,3,18,21,22,23,24,25], Collier-Oxandale and co-workers used this method to quantify benzene as an individual hydrocarbon as well as aromatics as a group (benzene, toluene, C8, and C9). Using one of the pod datasets explored in this work, R2 values between the reference and sensor benzene signals reached 0.72 for calibration and 0.67 for validation, explaining nearly three-fourths of the variability in the species. In Collier-Oxandale’s iteration, carbon dioxide data from a reference instrument were incorporated into the multivariate linear regression along with the low-cost sensor signals to improve fits [21]. However, since additional reference signals are infrequently available, we will focus our analysis on achieving similar fits without the aid of any additional reference signals, relying solely on the sensors. We believe this is a more realistic approach as we step away from frequent, expensive calibration campaigns. Instead, we explore a less commonly used machine learning technique to improve fits. We also consider a different hydrocarbon grouping technique, this time by source as described in Section 2.3 rather than by physical structure as employed previously by Collier-Oxandale and co-workers.
Other previous low-cost sensor works used both multivariate linear regression and artificial neural networks (ANNs) to quantify several more commonly studied trace gases, including ozone, methane, carbon monoxide, carbon dioxide, and nitrogen dioxide in the ambient environment [4,26]. ANNs routinely outperformed linear models, and the varying input sensor measurements seemed to have a greater impact on the goodness of fit than the parameter tweaking performed. Early stopping also proved to be a valuable tool in preventing overfitting [26]. ANNs have also been successful in quantifying speciated VOCs from metal oxide sensors in a laboratory environment [27,28,29]. This work will be among the first at the intersection of these two categories: using ANNs to quantify speciated VOCs in an outdoor environment, opting for a field normalization technique rather than a chamber calibration.

1.3. Application to Oil and Gas Data

The sensor locations utilized in this work are all near oil and gas facilities on Colorado’s Front Range, making this one of the main sources of hydrocarbons in each area. As each facility pulls from the Wattenburg Oilfield [30], the chemical composition of associated hydrocarbon gas mixture may include similarities at each location. However, as the extraction and refining activities at each site vary, site to site differences in emissions from the same oilfield are likely. As further discussed in the instrumentation section, the hydrocarbons studied at each site varied as the reference measurement methods varied at each. Thus, our comparisons are further muddled as only two of the same pollutants were available at each site, with the rest remaining unique to their deployments.

2. Methods

Here, we make use of low-cost sensor data collected at regulatory monitoring sites near oil and gas facilities during two distinct campaigns. In the summer of 2014, we set up our monitors at a NASA-operated facility measuring VOCs as part of their Front Range Air Pollution and Photochemistry Experiment (FRAPPE) for approximately one month. Years later in 2020, we co-located our monitors at another VOC measurement site managed by the Colorado Department of Public Health and Environment (CDPHE) for approximately a month towards the end of the summer. In this work, we also apply our calibration fits from the CDPHE site to a week of field data collected on the grounds of an oil and gas facility in Eastern Colorado. Figure 1 shows each of the pod locations and durations with respect to oil and gas wells in the area as provided by the Colorado Oil and Gas Conservation Commission (COGCC) [30].

2.1. Instrumentation

Low-cost air quality monitors dubbed U-Pods and Y-Pods [3,23], two generations of the same pod platform, were used for data collection. Both are built in-house by the Hannigan lab and include a suite of environmental and gas-phase sensors, including temperature, pressure, CO2, and two metal oxide hydrocarbon sensors (TGS 2600 and 2602, Figaro USA Inc., Arlington Heights, IL, USA). At the FRAPPE site in Erie, CO, the pod was mounted to the railing of a 1-story trailer, approximately 1 m away from the reference instrument inlet [21]. At the CDPHE site in Greeley, CO, USA, pods were stacked on top of a container on the roof of the trailer within a meter of the reference instrument inlet. Note that during the colocation, the container was knocked over by the wind, so by the end of the colocation, the pods were not as close to the inlet as desired. For this reason, we have elected to show the data from only one pod, which we believe was the best positioned despite the stack toppling over.
Reference-grade mass spectrometry instruments were also utilized at each calibration site. For the FRAPPE colocation in Erie, CO, USA, a proton transfer reaction mass spectrometry (PTRMS) was operated as part of NASA FRAPPE [31]. At the site operated by CDPHE in Greeley, CO, USA a gas chromatography mass spectrometry (GCMS) instrument was utilized for speciated hydrocarbon measurements. Note that while minute-average data were available from the FRAPPE campaign, the data resolution for the CDPHE data was approximately one measurement per 45 min; we re-timed this to hourly based on the median timestamp and used hourly averages for our sensors as well.

2.2. Artificial Neural Networks

Based on previous success in using ANNs to quantify pollutants with low-cost sensors [26], we focused our efforts on this machine learning technique, which has not been used previously for this specific hydrocarbon application. When using multivariate linear regression to quantify pollutants such as methane and generalized VOCs previously, interaction terms among sensors proved beneficial [1,2,21]. For instance, the ratio or product of the two VOC sensors has been added as an additional term to better quantify methane. Trial and error was used to determine the best combination of input factors, starting with electrical signal: raw voltage in mV, converted resistance in mΩ, or normalized resistance (R/R0) as recommended by the manufacturer. Then, additional sensor signals such as CO2 and interaction terms among the two metal oxide sensors, temperature, and humidity were added to determine the best combination. Finally, ANN parameters such as the learning rate and number of epochs were tuned using trial and error based on literature recommendations [4,26,28]. Data were split into calibration and validation randomly using a three-fold algorithm, and early stopping was used by default to help to prevent overfitting.

2.3. Positive Matrix Factorization

In addition to quantifying individual hydrocarbons, we sought to group hydrocarbons by their source. This aids in our understanding of which pollutants are emitted by the same source and may also improve fits; summed hydrocarbons will have higher concentrations than individual species, raising the total concentration closer to the sensors’ prescribed detection limits. These source attribution efforts afford the relevant communities or agencies the ability to address hazards in order of importance. For instance, a previous PMF analysis near an oil and gas facility in Los Angeles resulted in VOCs split into 6 main factors, with compounds emanating from: combustion, aged motor vehicle emissions, fresh motor vehicle emissions, biogenic, industrial manufacturing, and natural gas [32]. This analysis aided in community understanding of anthropogenic and biogenic air quality risks in the area.
Although certain compounds may be emitted from more than one source (e.g., styrene can be found in both industrial and oil and gas emissions), source attribution in this manner affords us a more compartmentalized view of often overwhelming and tangled webs of VOC species present in a single area. In this aim, we utilized the EPA PMF 5.0 tool, a mathematical receptor model, which essentially categorizes input concentrations by their likely sources and contributions [33]. PMF is governed by Equation (1).
Equation (1): PMF Governing Equation, where Xij is the input data matrix of ambient concentrations (i samples and j compounds), p is the number of factors as designated by the user, g is each factor’s contribution, f is the chemical profile, and eij is the residual error.
X i j = k = 1 p g i k f k j + e i j
For each location, we opted for a three-factor analysis, assuming one for each meaningful source in the area: wet oil and gas activity, dry oil and gas activity, and miscellaneous, which could represent emissions from traffic or natural sources (i.e., trees) depending on the location. Note that separate analyses were conducted for each site, and the species within each group varied depending on the reference pollutants available at each site. Missing values and those below the model detection limit were replaced with half the detection limit for analysis purposes. While uncertainties were provided for the FRAPPE data, an uncertainty of 15% of each measurement was assumed for the CDPHE data.

3. Results

3.1. Calibration, Best-Fitting ANN Parameters

In developing sensor calibrations for each pollutant using ANNs, we found that varying the inputs to the ANN, such as the sensors, interactions between sensors, and electrical signals (e.g., voltage, resistance), was just as impactful as parameter tuning. Interestingly, raw voltage produced the best fits for all the pollutants studied except for Greeley’s biogenic factor pollutant group, which had the lowest reference concentrations. The ratio of the calculated resistance to the resistance of the sensor in clean air, referred to as R/R0, worked the best for this group. The sensor manufacturer recommends using R/R0 in all cases [34,35], yet it was only helpful here when the concentrations were the furthest below the prescribed detection limit. This suggests that the term may not be needed in general. This has been demonstrated in other studies [2,36], but may be useful in cases such as this where the desired concentration is much lower than the sensors are designed to quantify.
Note that several different species use the exact same ANN input data, similarly to how the same linear multivariable equation has been used for both methane and TNMHCs in the past [1,2]. However, the same pollutants or groups of pollutants at the two different sites typically had different inputs and parameters that produced the best results, likely due to the different sets of compounds comprising each group at the two sites. For instance, one group contained benzene and toluene, the only pollutants available at both sites. However, since the rest of the compounds in the oil and gas groups varied, their ANN input signals and parameter tuning were different. Similarly, the remaining two factors entirely comprised different compounds at each site, so the inputs and tuning were understandably mixed. Even when studied alone, benzene and toluene required different ANNs at each site, which may have been influenced by the different time averaging at each, as the minutely FRAPPE data were inherently more precise than the hourly CDPHE data. The inputs and parameters for each species or groups of species are shown in Table 1.
The sensor manufacturer lists several target gases for each of the VOC sensors used. Only two of our target gases made appearances in their literature: isobutane is listed for the light VOC sensor (VOC1), while toluene is listed for the heavy VOC sensor (VOC2) [34,35]. While we will discuss isobutane in Section 3.4, it is worth noting that both toluene models performed best with the addition of a heavy VOC sensor interaction term, which is in line with the manufacturer’s usage recommendations. Likewise, the oil and gas group contained toluene, and thus, an additional VOC interaction term was required as well. The term used here represents the ratio of heavy VOCs to total VOCs.

3.2. Calibration, PMF Analysis to Group VOCs

Based on our three-factor analysis, factor fingerprints were determined for each species, i.e., what percentage of their concentrations could be attributed to each source. Then, we placed each compound into one of the three groups based on whether at least roughly 60% of the species concentration could be explained by that factor. The factor fingerprints are shown in the right panels of Figure 2, and the final group selections on the left. The constituents of each group were summed to create three distinct groups, and each group was then used to generate ANN calibration models.
For the FRAPPE data, one factor (“FRAPPE oil and gas”) consisted of the aromatic compounds, benzene, toluene, C8, and C9, and was considered the aromatics or BTEX group. While these compounds may be emitted from traffic [37] based on the location, we are considering this primarily as the liquid phase of oil and gas emissions [38], although both sources likely contributed. The next group (“combustion”) consisted of aldehydes (formaldehyde and acetaldehyde) and acetone, which are all generated through fossil fuel combustion processes [39]. While traffic could have contributed to combustion concentrations, flaring is the more likely source in this area, and these represent gas-phase oil and gas emissions. Lastly, the final group (“FRAPPE natural gas”) contained methanol alone, which is likewise emitted from natural gas [40]. It is important to note that the eight hydrocarbons available for analysis at this site were far from exhaustive, and these were selected for analysis by NASA due to their roles as ozone precursors [41]. Although other compounds were undoubtedly emitted from other sources, we are only able to characterize these select few, most of which have links to oil and gas activity. Thus, all three of our sources are connected to oil and gas, while biogenic and other emissions expected in a rural area have been overlooked.
At CDPHE’s reference site in Greeley, a much larger panel of 21 hydrocarbons was available. While this is still not an exhaustive selection of every possible hydrocarbon emitted in this area, the reference data afforded us a more complete understanding of sources in the area besides those associated with natural gas. Similar to FRAPPE, one group (“Greeley oil and gas”) comprised the full set of aromatics or BTEX compounds, including benzene, toluene, ethylbenzene, and multiple xylenes, in addition to one other compound, n-nonane, primarily representing liquid-phase natural gas emissions [42]. Another group (“Greeley natural gas”) includes a large variety of alkanes, many of which are the components of natural gas and also represent oil and gas emissions [43]. The final factor (“biogenic”) represents biogenic compounds such as isoprene, which is emitted from plants [44]. Note that n-octane appeared to be evenly split between the oil and gas and natural gas groups; since neither held a majority, it was not included in the remainder of our analyses to avoid overlap among the separate concentrations we fit neural networks to. For clarity, each of these groups will be referred to as its likely source from this point forward, e.g., the aromatic groups will be referred to as “oil and gas”. Bear in mind that the two groups present at both sites (“oil and gas”, “natural gas”) represent entirely different sets of compounds at each site, sharing their name only due to PMF-aided source attribution rather than chemical structure.

3.3. Calibration for Individual VOCs, Benzene and Toluene

Although the species available for analysis at the two locations varied widely, two compounds were available at each: benzene and toluene. The comparison of the calibration fit for each, at each site, is shown in Figure 3; the quality of the fit may be indicative of the influence of time resolution on calibration. Both compounds displayed better fits at Greeley, where hourly data were available, than at the FRAPPE site, for which minute-average data were used. A previous study showed that higher time averaging and, therefore, sparser data points can cause calibration models to overfit [36]. However, in this case, the large spikes of both compounds may have contributed more to the poor FRAPPE fits rather than the time averaging. When the minutely data from FRAPPE were hourly averaged, the fit for benzene improved only slightly (calibration R2 = 0.30, validation R2 = 0.22). Likewise, when the FRAPPE toluene data were hourly averaged, the calibration and validation folds showed less variability, but fit worse overall (calibration R2 = 0.38, validation R2 = 0.36). This suggests that the time averaging used is not the reason for the different fits for the same pollutants at the two different sites.
The key similarity between the fits for these two individual species, as well as the rest of the pollutants studied, was the ability of the sensors to capture baseline trends in the reference concentration data while struggling to model intermittent peaks. The FRAPPE reference data for these two compounds display more frequent spikes of relatively high concentrations as compared to the rest of the data collected. Since all our models struggled with peak finding, the lower R2 values and random error are likely being driven primarily by the consistent underpredicting of large peaks.
Without using additional reference signals as in Collier-Oxandale et al. [21], the median calibration and validation fits for benzene were significantly reduced. While the FRAPPE benzene and toluene ANN fits need further refinement to be reliable enough for wider use, the promising fits seen at Greeley show that this method may be a strong enough start for quantifying speciated VOCs of interest.
To ensure the significance of our results, we tested the null hypothesis that the median Pearson’s R across all folds of calibration data for each compound was not equal to zero at the 0.05 significance level. For the FRAPPE data, based on minutely averaged samples (20,327 ≤ n ≤ 23,425 for each dataset), all compounds and groups of compounds were found to be statistically significant. For the Greeley data, which were hourly averaged (n = 371 for each dataset), all compounds and groups of compounds were likewise found to have R values statistically different from zero.

3.4. Calibration of Grouped VOCs Using PMF

As expected, since the summed species in the groups had higher overall concentrations than their constituents alone, fits were generally better. Since the groups comprised different compounds at each site, this is not a clear-cut comparison but rather a discussion of two separate sets of quasi-speciated compounds. The correlations among groups at each site were first examined (see Supplementary Figure S1) to ensure that consistencies among fits were not merely caused by similarities in the input data.
As seen in Figure 4, the species with the fewest and smallest spikes had the best fits, as all our models struggled to capture peaks in the reference concentration data. FRAPPE’s combustion and natural gas groups had slightly higher concentration ranges than other compounds and saw very few spikes above baseline, and as such, the models were able to capture more than half of the variability in the pollutants. FRAPPE’s oil and gas data, which included benzene and toluene, likely struggled due to the same issues that plagued these two pollutants on their own.
Some groups at Greeley fit better than others, but overfitting was an issue across the board, perhaps due to the hourly time averaging resulting in fewer data points in the ANN model development (~400 points per pollutant) as compared to FRAPPE (~1000+). The Greeley oil and gas group, which also contained benzene and toluene, displayed fits not unlike those seen for its individual constituents. Since the natural gas and biogenic groups were entirely different from their FRAPPE counterparts, our results varied and do not provide a clear comparison. The Greeley natural gas group represents a host of alkanes and experienced significantly more bias than any other compound or group of compounds, suggesting that the sensors themselves may have struggled to pick up on compounds in this group. Although isobutane, one of the members of this group, is listed by the manufacturers as a target gas for the light VOC sensor, reference concentrations for the individual compound had a median of 1.5 ppb, and the compound was one of 11 included in the group; concentrations were likely not significant enough to be singled out by the sensor.
The biogenic group experienced the worst calibration and validation fits out of all at Greeley. Based on the PMF analysis, this group comprised a mere 0.2% of all the total VOCs quantified; it was not a major contributor, and the small concentrations there were difficult to fit. The highest spikes for the biogenic group’s three compounds (isoprene, 1-2-4-trimethyl-benzene, and n-decane) combined totaled less than 1 ppb, so it is unsurprising that this group had the least successful fits.

3.5. Calibration of Formaldehyde

As a case study, we implemented ANN fits for one additional species only available at the FRAPPE site, formaldehyde, which was included in the natural gas group at the site (aldehydes). This species was of particular interest due to its role in acid rain formation [45] and toxicity to humans and animals [46]. Due to the lack of large spikes in the data, this compound’s calibration and validation fits hovered just below 0.5 for R2, with our model explaining approximately half the variability in the compound. Bias and random error were also relatively minor. Since formaldehyde was only available during the FRAPPE campaign, we cannot draw comparisons to any other datasets. However, our ability to model the compound at this one site is promising for future applications. These results are shown in Figure 3.

3.6. Application to Field Data

As a case study, we applied the calibration ANNs generated for the oil and gas and natural gas groups at the Greeley reference site to field data collected in Wiggins, CO, USA, as marked in Figure 1. This step is crucial as the calibration data studied are sometimes prone to overfitting, and likewise, small differences in environmental conditions (i.e., temperature, humidity) between co-location and deployment phases may contribute to less-than-ideal fits in the field. The pod was placed approximately 3 m west of oil and gas equipment; note that the prevailing wind direction in the area was northerly at the time. We elected to apply the fits for these two groups only for several reasons. Firstly, since concentrations of individual species were estimated to be very small in this area, using the groups of multiple species summed would allow for slightly higher concentrations to be fitted. Secondly, with the oil and gas factor representing BTEX compounds, this might represent liquids present close to oil and gas equipment [47]. Likewise, the natural gas group might represent gases associated with the pumping and metering equipment mere meters from the pod [48,49,50]. The results of each are shown in Figure 5.
While most trace gases studied demonstrate a diurnal effect where concentrations rise and fall as the boundary layer changes height throughout the day [51], these quasi-speciated compounds rise during the day, but return to different baselines each night. This may have been influenced by the colocation data, since the reference data for benzene exhibited some baseline shifting throughout the week of colocation (see Figure 4).
Upon examining the raw data collected by the pods, substantial baseline shifting is observed across all sensors. Large swings in temperature and humidity are typical of the spring season in Colorado, but given the remaining sensors’ sensitivity to temperature and humidity, may have had a larger effect on results than anticipated. The light VOC sensor (VOC1) in particular appears to mirror the trends in humidity, suggesting that the increasing light VOC baseline was due to environmental conditions rather than the presence of pollutants. While the CO2 and heavy VOC (VOC2) sensors exhibit more typical patterns, they likewise fail to return to the same baseline each night.
These trends in the raw data suggest that the drift experienced when applied to the field data was primarily driven by unstable environmental conditions, which in turn affected each of the sensors. In future analyses, we will aim to deploy in environments with more stability to better characterize the utility of the ANN method independent of unforeseen driving factors. Nevertheless, this analysis demonstrates that speciated VOC ANNs may be suitable for longer-term environmental monitoring, although baseline shifting techniques may still be necessary for reliability in the future. Concentration ranges are reasonable given the location, and typical diurnal patterns are still observed regardless of baseline drift.

4. Discussion

4.1. Comparison of Two Locations

Several key differences between the two datasets from the two sites may have driven the inconsistencies between the fits for each pollutant or group. The observed discrepancies were driven primarily by the species available at each site rather than differences in the oilfield, production, or other relevant nearby sources. Likewise, the differences in fits between the two may have been driven by the metal oxide sensors’ ability to recognize the different pollutants as the concentrations at each site did not vary widely (see Figure 6). Different concentrations and time averaging may have also contributed to fits across the two sites being less than consistent, as hourly data are prone to overfitting, which explains the worse validation fits at the Greeley site.
Some consistencies among the two sites and different sets of pollutants highlight the strengths and weaknesses of our methods. While baseline trends were generally captured, larger spikes were harder for the ANNs to model, resulting in worse fits for compounds experiencing spikes and better fits for those without.

4.2. Broader Implications

VOC source attribution has often been muddled due to the large range of compounds encompassed by the TNMHC category. In a previous low-cost sensor work concerning oil and gas development in Los Angeles, CA, USA, it was not abundantly clear whether hydrocarbon emissions were emanating from oil and gas facilities or roadway traffic—or to what extent each played a role [2]. In knowing the exact makeup of VOC emissions—or groups whose source can be deduced—VOC readings from low-cost sensors will be much more impactful in source attribution.
The implications of determining speciated and quasi-speciated hydrocarbons from more widely distributed sensors cannot be understated, as VOCs are a major precursor to ozone. Recent studies have attempted to model the sensitivity among VOCs, nitrogen oxides (NOx), and ozone to better characterize the non-linear reactions generating ozone [52,53]. The ability to quantify a variety of speciated hydrocarbons at different points in an area of interest, especially near roadways, could illuminate why reducing concentrations of precursors does not necessarily reduce ozone formation [13].
Furthermore, knowing the exact makeup of hydrocarbon emissions will greatly improve our ability to estimate the associated health and environmental risks. The BTEX compounds, which have been linked to oil and gas emissions, are known to be toxic and in some cases carcinogenic [5,6]. Many other VOCs have demonstrated links to adverse health effects, such as eye irritation, headaches, respiratory inflammation, and the exacerbation of pre-existing conditions such as asthma [54,55]. Children, pregnant people, and the elderly face particularly high risks of having adverse health reactions to elevated VOC concentrations from oil and gas [56,57,58]. In Greeley, our data were collected on the grounds of an elementary school near oil and gas activities; even the small concentrations detected here have the potential for health implications for sensitive groups [59]. Thus, understanding the composition of hydrocarbon emissions in this area is of great importance for assessing health risks. Likewise, certain hydrocarbons are stronger climate forcers than others [60]; the better our assessment of the associated risks, the more impactful our mitigation strategies can be.

4.3. Limitations and Lessons Learned

The most pressing shortcoming of this work is the detection limits associated with the metal oxide sensors used. Figaro states their lower detection limit as 1 ppm [34,35], which is orders of magnitude larger than any of the reference concentrations seen during our data collection. While our method is still able to explain a portion of the variability of each compound, higher pollutant concentrations would make for more reliable results and presumably better ANN fits. Our sensor deployment nearby oil and gas was not the most suitable based on the concentration ranges seen, but this approach might be better suited for fence line monitoring where higher baseline concentrations might be observed [2]. Individual VOC emissions rarely top this threshold, but other applications with higher concentrations than those observed here such as wildfire plumes [61] might be more acceptable. However, the issues surrounding spike finding would need to be addressed to use this method successfully.
Another major limitation is the difficulty our models had in representing concentration spikes. While our baseline fits appear reliable for each of the compounds studied, large spikes are typically of greater interest to community members based on the associated health effects. While we continue to develop methods to quantify spikes specifically, in the meantime, we will consider any measurements above baseline concentration as spikes for community monitoring purposes. We may not be able to adequately measure exact spike concentrations at the present, but being able to warn stakeholders of the presence of a spike may be sufficient until a better method is achieved.
It is worth noting that higher concentrations may only be required during the colocation phase of the deployment to attain better ANN fits. Based on prior pod research, assuming a decent colocation is carried out, calibration fits can be applied to field deployments in varying locations with vastly different temperature, humidity, and concentration spaces [2]. Thus, the application may not need to have abnormally high VOC concentrations so long as those experienced during the colocation were sufficient to achieve reasonable ANN fits. Perfect results will not be achieved with sensors developed neither for this purpose nor concentration range, but these results represent an important first step in making speciated hydrocarbon data available in ambient outdoor monitoring. Based on the utility of VOC sensor interaction terms in our ANN models, we believe that the use of additional VOC sensors may further our ability to quantify specific hydrocarbons or hydrocarbons from specific sources in the future. In addition to the generalized “heavy VOC” and “light VOC” sensors, a mid-range VOC sensor or one specific for aromatics, for instance, might bolster our abilities even further.

5. Conclusions

While this method is imperfect, it has great potential for researchers to quantify specific hydrocarbons on highly localized spatial and temporal resolutions. Source mixtures with low variability or a dominant source appear best suited for this method; additional measures will likely be needed to quantify less-than-ideal mixtures. Baseline concentrations developed using our ANN calibration method appear reliable in some cases, while spike finding will prove to be more challenging. PMF may be a valuable tool in connecting individual compounds to emission sources when their concentrations are too small to be studied individually. Recent studies have quantified TNMHCs with R2 values hovering around the 0.5 threshold [1,2] for outdoor experiments using a similar field normalization technique. Our ability to now quantify specific hydrocarbons rather than all these lumped together to a similar degree of accuracy represents an improvement in the field and may be further bolstered by low-cost separation techniques upstream of sensors in the future. Continuing to look forward, this type of tool will better inform us of the human exposure and environmental hazards posed by hydrocarbon emissions.

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/article/10.3390/atmos12111383/s1, Figure S1: Colored correlation matrices for FRAPPE & Greeley PMF groups.

Author Contributions

Conceptualization, M.H.; Formal analysis, K.O.; Visualization, K.O.; Writing—original draft, K.O.; Writing—review & editing, M.H. All authors have read and agreed to the published version of the manuscript.

Funding

Publication of this article was funded by the University of Colorado Boulder Libraries Open Access Fund.

Data Availability Statement

Reference data is available online at https://www-air.larc.nasa.gov/cgi-bin/ArcView/discover-aq.co-2014?GROUND-PLATTEVILLE=1 (see data from Armin Whisthaler, 21 October 2021) and https://www.colorado.gov/airquality/tech_doc_repository.aspx (see “Extraction Vetting (Bella Romero)”, 21 October 2021) for FRAPPE and Greeley respectively. Y-Pod data is available upon request; please contact the first author.

Acknowledgments

Thank you to CDPHE and all those involved in the NASA FRAPPE campaign, especially Ashely Collier-Oxandale and Hannah Halliday. Big thanks to Evan Coffey and Daniel Bonn for their help with the pod colocations.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Collier-Oxandale, A.; Wong, N.; Navarro, S.; Johnston, J.; Hannigan, M. Using gas-phase air quality sensors to disentangle potential sources in a Los Angeles neighborhood. Atmos. Environ. 2020, 233, 117519. [Google Scholar] [CrossRef]
  2. Okorn, K.; Jimenez, A.; Collier-Oxandale, A.; Johnston, J.; Hannigan, M. Characterizing methane and total non-methane hydrocarbon levels in Los Angeles communities with oil and gas facilities using air quality monitors. Sci. Total Environ. 2021. [Google Scholar] [CrossRef] [PubMed]
  3. Piedrahita, R.; Xiang, Y.; Masson, N.; Ortega, J.; Collier, A.; Jiang, Y.; Li, K.; Dick, R.P.; Lv, Q.; Hannigan, M.; et al. The next generation of low-cost personal air quality sensors for quantitative exposure monitoring. Atmos. Meas. Tech. 2014, 7, 3325–3336. [Google Scholar] [CrossRef] [Green Version]
  4. Vikram, S.; Collier-Oxandale, A.; Ostertag, M.H.; Menarini, M.; Chermak, C.; Dasgupta, S.; Rosing, T.; Hannigan, M.; Griswold, W.G. Evaluating and improving the reliability of gas-phase sensor system calibrations across new locations for ambient measurements and personal exposure monitoring. Atmos. Meas. Tech. 2019, 12, 4211–4239. [Google Scholar] [CrossRef] [Green Version]
  5. Adgate, J.L.; Goldstein, B.D.; McKenzie, L.M. Potential public health hazards, exposures and health effects from unconventional natural gas development. Environ. Sci. Technol. 2014, 48, 8307–8320. [Google Scholar] [CrossRef]
  6. Glass, D.; Gray, C.; Adams, G.; Manuell, R.; Bisby, J. Validation of exposure estimation for benzene in the Australian petroleum industry. Toxicol. Ind. Health 2001, 17, 113–127. [Google Scholar] [CrossRef] [PubMed]
  7. Wu, X.; Fan, Z.; Zhu, X.; Jung, K.H.; Ohman-Strickland, P.; Weisel, C.P.; Lioy, P.J. Exposures to volatile organic compounds (VOCs) and associated health risks of socio-economically disadvantaged population in a “hot spot” in Camden, New Jersey. Atmos. Environ. 2012, 57, 72–79. [Google Scholar] [CrossRef] [Green Version]
  8. Chen, D.; Liu, R.; Lin, Q.; Ma, S.; Li, G.; Yu, Y.; Zhang, C.; An, T. Volatile organic compounds in an e-waste dismantling region: From spatial-seasonal variation to human health impact. Chemosphere 2021, 275, 130022. [Google Scholar] [CrossRef] [PubMed]
  9. Aklilu, Y.; Cho, S.; Zhang, Q.; Taylor, E. Source apportionment of volatile organic compounds measured near a cold heavy oil production area. Atmos. Res. 2018, 206, 75–86. [Google Scholar] [CrossRef]
  10. Hildenbrand, Z.L.; Mach, P.M.; McBride, E.M.; Dorreyatim, M.N.; Taylor, J.T.; Carlton, D.D.; Meik, J.M.; Fontenot, B.E.; Wright, K.C.; Schug, K.A.; et al. Point source attribution of ambient contamination events near unconventional oil and gas development. Sci. Total. Environ. 2016, 573, 382–388. [Google Scholar] [CrossRef] [PubMed]
  11. Wilde, S.E.; Dominutti, P.A.; Allen, G.; Andrews, S.J.; Bateson, P.; Bauguitte, S.j.-B.; Burton, R.R.; Colfescu, I.; France, J.; Hopkins, J.R.; et al. Speciation of VOC emissions related to offshore North Sea oil and gas production. Atmos. Chem. Phys. 2021, 21, 3741–3762. [Google Scholar] [CrossRef]
  12. Maka, P.; Liggio, J.; Leithead, A.; Wentzell, J.; Stroud, C.; Soares, J.; Akingunola, A.; Zhang, J.; Moran, M.D.; Li, S.M. Source Attribution of VOCs in the Canadian Oil Sands using Hierarchical Clustering. In AGU Fall Meeting Abstracts; American Geophysical Union Fall Meeting: San Francisco, CA, USA, 2019. [Google Scholar]
  13. Gao, H.O.; Niemeier, D.A. The impact of rush hour traffic and mix on the ozone weekend effect in southern California. Transport. Rsrch. Part D Transp. Environ. 2007, 12, 83–98. [Google Scholar] [CrossRef]
  14. Honeycutt, W.T.; Ley, M.T.; Materer, N.F. Precision and Limits of Detection for Selected Commercially Available, Low-Cost Carbon Dioxide and Methane Gas Sensors. Sensors 2018, 19, 3157. [Google Scholar] [CrossRef] [Green Version]
  15. van den Broek, J.; Cerrejon, D.K.; Pratsinis, S.E.; Güntner, A.T. Selective formaldehyde detection at ppb in indoor air with a portable sensor. J. Hazard. Mater. 2020, 399, 123052. [Google Scholar] [CrossRef]
  16. Lin, C.F.; Zan, H.W.; Lu, C.J.; Meng, H.F.; Soppera, O. A low-cost miniaturized colorimetric sensor with vertically-stacked semi-transparent finger-type organic photo detector for formaldehyde sensing. Org. Electron. 2019, 73, 115–121. [Google Scholar] [CrossRef]
  17. Chi, C.Y.; Chen, H.I.; Chen, W.C.; Chang, C.H.; Liu, W.C. Formaldehyde sensing characteristics of an aluminum-doped zinc oxide (AZO) thin-film-based sensor. Sens. Actuators B Chem. 2018, 255, 3017–3024. [Google Scholar] [CrossRef]
  18. Spinelle, L.; Gerboles, M.; Villani, M.G.; Aleixandre, M.; Bonavitacola, F. Field calibration of a cluster of low-cost available sensors for air quality monitoring. Part A: Ozone and nitrogen dioxide. Sens. Actuators B Chem. 2015, 215, 249–257. [Google Scholar] [CrossRef]
  19. University of Florida Health: Center for Environmental & Human Toxicology, College of Veterinary Medicine, Analytical Services Pricing. Available online: https://toxicology.vetmed.ufl.edu/core-laboratories/analytical-services-pricing/ (accessed on 15 July 2021).
  20. U.S. Department of Defense. Environmental Security Technology Certification Program, ESTCP Cost and Performance Report; U.S. Department of Defense: Arlington, VA, USA, 2015.
  21. Collier-Oxandale, A.; Thorson, J.; Halliday, H.; Milford, J.; Hannigan, M. Understanding the ability of low-cost MOx sensors to quantify ambient VOCs. Atmos. Meas. Tech. Discuss. 2019, 12, 1441–1460. [Google Scholar] [CrossRef] [Green Version]
  22. Sadighi, K.; Coffey, E.; Polidori, A.; Feenstra, B.; Lv, Q.; Henze, D.; Hannigan, M. Intra-urban spatial variability of surface ozone in Riverside, CA: Viability and validation of low-cost sensors. Atmos. Meas. Tech. 2018, 11, 1777–1792. [Google Scholar] [CrossRef] [Green Version]
  23. Collier-Oxandale, A.; Casey, J.G.; Piedrahita, R.; Ortega, J.; Halliday, H.; Johnston, J.; Hannigan, M. Assessing a low-cost methane sensor quantification system for use in complex rural and urban environments. Atmos. Meas. Tech. 2018, 11, 3569–3594. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Cheadle, L.; Deanes, L.; Sadighi, K.; Casey, J.G.; Collier-Oxandale, A.; Hannigan, M. Quantifying Neighborhood-Scale Spatial Variations of Ozone at Open Space and Urban Sites in Boulder, Colorado Using Low-Cost Sensor Technology. Sensors 2017, 17, 2072. [Google Scholar] [CrossRef] [Green Version]
  25. Casey, J.G.; Hannigan, M. Testing the performance of field calibration techniques for low-cost gas sensors in new deployment locations: Across a county line and across Colorado. Atmos. Meas. Tech. 2018, 11, 6351–6378. [Google Scholar] [CrossRef] [Green Version]
  26. Casey, J.; Collier-Oxandale, A.; Hannigan, M. Performance of artificial neural networks and linear models to quantify 4 trace gas species in an oil and gas production region with low-cost sensors. Sens. Actuators B Chem. 2019, 283, 504–514. [Google Scholar] [CrossRef]
  27. Huang, J.R.; Gu, C.P.; Meng, F.L.; Li, M.Q.; Liu, J.H. Detection of volatile organic compounds by using a single temperature-modulated SnO2 gas sensor and artificial neural network. Smart Mater. Struct. 2007, 16, 701. [Google Scholar] [CrossRef]
  28. Sobanski, T.; Szczurek, A.; Licznerski, B.W. Application of sensor array and artificial neural network for discrimination and qualification of benzene and ethylbenzene. In Proceedings of the 24th International Spring Seminar on Electronics Technology, Concurrent Engineering in Electronic Packaging, Calimanesti-Caciulata, Romania, 5–9 May 2001; pp. 150–153. [Google Scholar]
  29. Chen, Z.; Zheng, Y.; Chen, K.; Li, H.; Jian, J. Concentration Estimator of Mixed VOC Gases Using Sensor Array with Neural Networks and Decision Tree Learning. IEEE Sens. J. 2017, 17, 1884–1892. [Google Scholar] [CrossRef]
  30. Colorado Oil & Gas Conservation Commission: Interactive Map. Available online: https://cogcc.state.co.us/maps.html#/gisonline (accessed on 1 July 2021).
  31. Halliday, H.S.; Thompson, A.M.; Wisthaler, A.; Blake, D.R.; Hornbrook, R.S.; Mikoviny, T.; Müller, M.; Eichler, P.; Apel, E.C.; Hills, A.J. Atmospheric benzene observations from oil and gas production in the Denver-Julesburg Basin in July and August 2014. JGR Atmospheres 2016, 121, 11055–11074. [Google Scholar] [CrossRef] [Green Version]
  32. Johnston, J.; Okorn, K.; van Horne, Y.O.; Jimenez, A. Changes in neighborhood air quality after idling of an urban oil production site. Environ. Sci. Process. Impacts 2021, 23, 967–980. [Google Scholar] [CrossRef]
  33. Reff, A.; Eberly, S.I.; Bhave, P.V. Receptor modeling of ambient particulate matter data using positive matrix factorization: Review of existing methods. J. Air Waste Manage. Assoc. 2007, 57, 146–154. [Google Scholar] [CrossRef] [Green Version]
  34. Figaro, Product Information: TGS 2600–for the Detection of Air Contaminants. 2021. Available online: https://www.figarosensor.com/product/docs/tgs2600_product%20information%28fusa%29_rev05.pdf (accessed on 21 October 2021).
  35. Figaro, Product Information: TGS 2602–for the Detection of Air Contaminants. 2021. Available online: https://www.figarosensor.com/product/docs/tgs2602_product%20information%28fusa%29_rev05.pdf (accessed on 21 October 2021).
  36. Okorn, K.; Hannigan, M. Improving Air Pollutant Metal Oxide Sensor Quantification Practices through: An Exploration of Sensor Signal Normalization, Multi-Sensor and Universal Calibration Model Generation, and Physical Factors Such as Co-Location Duration and Sensor Age. Atmosphere 2021, 12, 645. [Google Scholar] [CrossRef]
  37. Bono, R.; Bugliosi, E.H.; Schilirò, T.; Gilli, G. The Lagrange Street story: The prevention of aromatics air pollution during the last nine years in a European city. Atmos. Environ. 2001, 35, 1. [Google Scholar] [CrossRef]
  38. Gross, S.A.; Avens, H.J.; Banducci, A.M.; Sahmel, J.; Panko, J.M.; Tvermoes, B.E. Analysis of BTEX groundwater concentrations from surface spills associated with hydraulic fracturing operations. J. Air Waste Manag. Assoc. 2013, 63, 4. [Google Scholar] [CrossRef] [Green Version]
  39. Altemose, B.; Gong, J.; Zhu, T.; Hu, M.; Zhang, L.; Cheng, H.; Zhang, L.; Tong, J.; Kipen, H.M.; Ohman-Strickland, P.; et al. Aldehydes in relation to air pollution sources: A case study around the Beijing Olympics. Atmos. Environ. 2015, 109, 61–69. [Google Scholar] [CrossRef] [Green Version]
  40. Aasberg-Petersen, K.; Nielsen, C.S.; Dybkjær, I.; Perregaard, J. Large Scale Methanol Production from Natural Gas; ResearchGate, Full Text; Haldor Topsøe: Lyngby, Denmark, 2008. [Google Scholar]
  41. NOAA Earth System Research Laboratory: Chemical Sciences Division, FRAPPÉ/DISCOVER-AQ 2014. Available online: https://csl.noaa.gov/groups/csl7/measurements/2014frappe/ (accessed on 1 July 2021).
  42. Correa, S.M.; Arbilla, G.; Marques, M.R.C.; Oliveira, K.M.P.G. The impact of BTEX emissions from gas stations into the atmosphere. Atmos. Pollut. Res. 2012, 3, 163–169. [Google Scholar] [CrossRef] [Green Version]
  43. AMohamed, A.E.-S.; Panigrahy, S.; Sahu, A.B.; Bourque, G.; Curran, H.J. An experimental and kinetic modeling study of the auto-ignition of natural gas blends containing C1–C7 alkanes. Proc. Combust. Inst. 2021, 38, 1. [Google Scholar]
  44. Sharkey, T.D.; Wiberley, A.E.; Donohue, A.R. Isoprene Emission from Plants: Why and How. Ann. Bot. 2008, 101, 5–18. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Kieber, R.J.; Rhines, M.F.; Willey, J.D.; Avery, G.B. Rainwater formaldehyde: Concentration, deposition and photochemical formation. Atmos. Environ. 1999, 33, 22. [Google Scholar] [CrossRef]
  46. Songur, A.; Ozen, O.A.; Sarsilmaz, M. The Toxic Effects of Formaldehyde on the Nervous System. Rev. Environ. Contam. Toxicol. 2010, 203, 105–118. [Google Scholar]
  47. Verma, D.K.; Johnson, D.M.; McLean, J.D. Benzene and Total Hydrocarbon Exposures in the Upstream Petroleum Oil and Gas Industry. Am. Ind. Hyg. Assoc. 2000, 61, 2. [Google Scholar]
  48. Thompson, C.R.; Hueber, J.; Helmig, D. Influence of oil and gas emissions on ambient atmospheric non-methane hydrocarbons in residential areas of Northeastern Colorado. Elem. Sci. Anth. 2014, 3, 000035. [Google Scholar] [CrossRef] [Green Version]
  49. Gilman, J.B.; Lerner, B.M.; Kuster, W.C.; de Gouw, J. Source signature of volatile organic compounds from oil and natural gas operations in northeastern Colorado. Environ. Sci. Technol. 2013, 47, 1297–1305. [Google Scholar] [CrossRef]
  50. Peischl, J.; Ryerson, T.; Brioude, J.; Aikin, K.; Andrews, A.; Atlas, E.; Blake, D.; Daube, B.; de Gouw, J.; Dlugokencky, E. Quantifying sources of methane using light alkanes in the Los Angeles basin, California. J. Geophys. Res. Atmos. 2013, 118, 4974–4990. [Google Scholar] [CrossRef] [Green Version]
  51. Schindler, T.L. Never at Rest: The air over Los Angeles. 2013. Available online: https://svs.gsfc.nasa.gov/4077 (accessed on 11 June 2021).
  52. Irie, H.; Yonekawa, D.; Damiani, A.; Hoque, H.M.S.; Sudo, K.; Itahashi, S. Utilizing Continuous Multi-Component MAX-DOAS Observations for the Near-Surface Ozone Sensitivity Diagnosis at Chiba and Tsukuba, Japan for 2013–2019. In AGU Fall Meeting Abstracts; AGU Fall Meeting: San Francisco, CA, USA, 2020. [Google Scholar]
  53. Vohra, K.; Marais, E.; Lu, G.; Bloss, W.; Zhu, L.; Eskes, H.; De Smedt, I. Assessing Surface Ozone-NOX-VOC Sensitivity in Major Indian Cities Using High Resolution TROPOMI Observations; AGU Fall Meeting: San Francisco, CA, USA, 2020. [Google Scholar]
  54. Rasmussen, S.G.; Ogburn, E.L.; McCormack, M.; Casey, J.A.; Bandeen-Roche, K.; Mercer, D.G.; Schwartz, B.S. Association between unconventional natural gas development in the Marcellus Shale and asthma exacerbations. JAMA Intern. Med. 2016, 176, 1334–1343. [Google Scholar] [CrossRef]
  55. Delfino, R.J.; Jr, H.G.; Linn, W.S.; Pellizzari, E.D.; Hu, Y. Asthma symptoms in Hispanic children and daily ambient exposures to toxic and criteria air pollutants. Environ. Health Perspect. 2003, 111, 647–656. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  56. Johnston, J.E.; Lim, E.; Roh, H. Impact of upstream oil extraction and environmental public health: A review of the evidence. Sci. Total Environ. 2019, 657, 187–199. [Google Scholar] [CrossRef] [PubMed]
  57. Yermukhanova, L.; Zhexenova, A.; Izimbergenova, G.; Turebaev, M.; Bekbauova, A.; Zumabekov, E. Immunodeficiency states in persons residing in the oil-producing regions of Kazakhstan. Res. J. Med. Sci. 2017, 11, 16–18. [Google Scholar]
  58. Brosselin, P.; Rudant, J.; Orsi, L.; Leverger, G.; Baruchel, A.; Bertrand, Y.; Nelken, B.; Robert, A.; Michel, G.; Margueritte, G.; et al. Acute childhood leukaemia and residence next to petrol stations and automotive repair garages: The ESCALE study (SFCE). Occup. Environ. Med. 2009, 66, 598–606. [Google Scholar] [CrossRef] [PubMed]
  59. D’Andrea, M.A.; Reddy, G.K. Health Risks Associated with Benzene Exposure in Children: A Systematic Review. Glob. Pediatr. Health 2018, 5, 2333794X18789275. [Google Scholar] [CrossRef]
  60. Ferrero, L.; Sangiorgi, G.; Perrone, M.G.; Rizzi, C.; Cataldi, M.; Markuszewski, P.; Pakszys, P.; Makuch, P.; Petelski, T.; Becagli, S.; et al. Chemical Composition of Aerosol over the Arctic Ocean from Summer ARctic EXpedition (AREX) 2011–2012 Cruises: Ions, Amines, Elemental Carbon, Organic Matter, Polycyclic Aromatic Hydrocarbons, n-Alkanes, Metals, and Rare Earth Elements. Atmosphere 2019, 10, 54. [Google Scholar] [CrossRef] [Green Version]
  61. Evtyugina, M.; Calvo, A.I.; Nunes, T.; Alves, C.; Fernandes, A.P.; Tarelho, L.; Vicente, A.; Pio, C. VOC emissions of smouldering combustion from Mediterranean wildfires in central Portugal. Atmos. Environ. 2013, 64, 339–348. [Google Scholar] [CrossRef]
Figure 1. Map of colocation and deployment locations relative to oil wells.
Figure 1. Map of colocation and deployment locations relative to oil wells.
Atmosphere 12 01383 g001
Figure 2. Individual pollutant contributions and factor fingerprints for FRAPPE (A) and Greeley (B).
Figure 2. Individual pollutant contributions and factor fingerprints for FRAPPE (A) and Greeley (B).
Atmosphere 12 01383 g002
Figure 3. FRAPPE (A) and Greeley (B) speciated colocation timeseries (colored; reference data in black) along with boxplots for folds of calibration (light) and validation (dark): R2, random error (root mean square error, or RMSE), and bias (mean bias error or MBE). Estimate uncertainties are overlayed on the timeseries.
Figure 3. FRAPPE (A) and Greeley (B) speciated colocation timeseries (colored; reference data in black) along with boxplots for folds of calibration (light) and validation (dark): R2, random error (root mean square error, or RMSE), and bias (mean bias error or MBE). Estimate uncertainties are overlayed on the timeseries.
Atmosphere 12 01383 g003
Figure 4. FRAPPE (A) and Greeley (B) source-apportioned colocation timeseries (colored; reference data in black) along with boxplots for folds of calibration (light) and validation (dark): R2, random error (root mean square error, or RMSE), and bias (mean bias error, or MBE). Estimate uncertainties are overlayed on the timeseries.
Figure 4. FRAPPE (A) and Greeley (B) source-apportioned colocation timeseries (colored; reference data in black) along with boxplots for folds of calibration (light) and validation (dark): R2, random error (root mean square error, or RMSE), and bias (mean bias error, or MBE). Estimate uncertainties are overlayed on the timeseries.
Atmosphere 12 01383 g004
Figure 5. Field applications of the ANNs (A); raw pod data used to generate these estimates (B).
Figure 5. Field applications of the ANNs (A); raw pod data used to generate these estimates (B).
Atmosphere 12 01383 g005
Figure 6. Reference concentrations of individual and grouped pollutants by site.
Figure 6. Reference concentrations of individual and grouped pollutants by site.
Atmosphere 12 01383 g006
Table 1. ANN inputs and parameters per location and pollutant. For the sensor inputs, t is temperature in degrees Celsius, h is absolute humidity, VOC1 and VOC2 are the two VOC sensors, CO2 is the CO2 sensor, and Te is the elapsed time.
Table 1. ANN inputs and parameters per location and pollutant. For the sensor inputs, t is temperature in degrees Celsius, h is absolute humidity, VOC1 and VOC2 are the two VOC sensors, CO2 is the CO2 sensor, and Te is the elapsed time.
DatasetSpeciesElectrical InputSensor InputsHidden LayersHidden Layer SizeBackpropagation MethodEpochsPerformance Goalµ
FRAPPEOil and GasVoltaget, h, VOC1, VOC2, Te, VOC2/(VOC1 + VOC2)45Bayesian Regularization10001 × 10−100.5
CombustionVoltaget, h, VOC1, VOC2, Te, CO2, VOC1/(VOC1 + VOC2)110Levenberg-Marquardt1001 × 10−100.01
Natural GasVoltaget, h, VOC1, VOC2, Te, CO2, VOC1/VOC2, ln(t)×h110Levenberg-Marquardt1001.00 × 10−100.1
BenzeneVoltaget, h, VOC1, VOC2, Te, CO2110Levenberg-Marquardt1001 × 10−100.005
TolueneVoltaget, h, VOC1, VOC2, Te, VOC1/VOC2, VOC2 t47Bayesian Regularization10001 × 10−100.5
HCHOVoltaget, h, VOC1, VOC2, Te, CO2, VOC1/(VOC1 + VOC2)110Levenberg-Marquardt1001 × 10−100.005
GreeleyOil and GasVoltaget, h, VOC1, VOC2, Te, CO2, VOC2/(VOC1 + VOC2)110Levenberg-Marquardt1001 × 10−100.05
Natural GasVoltaget, h, VOC1, VOC2, Te, CO2, VOC1/VOC215Bayesian Regularization10000.010.5
BiogenicR/R0t, h, VOC1, VOC2, Te, VOC2/(VOC1 + VOC2), VOC1 t15Bayesian Regularization10000.010.5
BenzeneVoltaget, h, VOC1, VOC2, Te, CO2, VOC1/VOC2110Levenberg-Marquardt1001 × 10−100.005
TolueneVoltaget, h, VOC1, VOC2, Te, VOC1/VOC2, VOC2 t110Levenberg-Marquardt1001 × 10−100.01
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Okorn, K.; Hannigan, M. Applications and Limitations of Quantifying Speciated and Source-Apportioned VOCs with Metal Oxide Sensors. Atmosphere 2021, 12, 1383. https://0-doi-org.brum.beds.ac.uk/10.3390/atmos12111383

AMA Style

Okorn K, Hannigan M. Applications and Limitations of Quantifying Speciated and Source-Apportioned VOCs with Metal Oxide Sensors. Atmosphere. 2021; 12(11):1383. https://0-doi-org.brum.beds.ac.uk/10.3390/atmos12111383

Chicago/Turabian Style

Okorn, Kristen, and Michael Hannigan. 2021. "Applications and Limitations of Quantifying Speciated and Source-Apportioned VOCs with Metal Oxide Sensors" Atmosphere 12, no. 11: 1383. https://0-doi-org.brum.beds.ac.uk/10.3390/atmos12111383

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop