Next Article in Journal
Comparison between Modelled and Measured Magnetic Field Scans of Different Planar Coil Topologies for Stress Sensor Applications
Next Article in Special Issue
Comparative Performance Analysis of Support Vector Machine, Random Forest, Logistic Regression and k-Nearest Neighbours in Rainbow Trout (Oncorhynchus Mykiss) Classification Using Image-Based Features
Previous Article in Journal
Comparison of Tagging Technologies for Safeguards of Copper Canisters for Nuclear Spent Fuel
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Internet of Things System for Underground Mine Air Quality Pollutant Prediction Based on Azure Machine Learning

by
ByungWan Jo
and
Rana Muhammad Asad Khan
*
Department of Civil and Environmental Engineering, Hanyang University, 222 Wangsimni-ro, Seongdong-gu, Seoul 04763, Korea
*
Author to whom correspondence should be addressed.
Submission received: 20 February 2018 / Revised: 15 March 2018 / Accepted: 16 March 2018 / Published: 21 March 2018
(This article belongs to the Special Issue Artificial Intelligence and Machine Learning in Sensors Networks)

Abstract

:
The implementation of wireless sensor networks (WSNs) for monitoring the complex, dynamic, and harsh environment of underground coal mines (UCMs) is sought around the world to enhance safety. However, previously developed smart systems are limited to monitoring or, in a few cases, can report events. Therefore, this study introduces a reliable, efficient, and cost-effective internet of things (IoT) system for air quality monitoring with newly added features of assessment and pollutant prediction. This system is comprised of sensor modules, communication protocols, and a base station, running Azure Machine Learning (AML) Studio over it. Arduino-based sensor modules with eight different parameters were installed at separate locations of an operational UCM. Based on the sensed data, the proposed system assesses mine air quality in terms of the mine environment index (MEI). Principal component analysis (PCA) identified CH4, CO, SO2, and H2S as the most influencing gases significantly affecting mine air quality. The results of PCA were fed into the ANN model in AML studio, which enabled the prediction of MEI. An optimum number of neurons were determined for both actual input and PCA-based input parameters. The results showed a better performance of the PCA-based ANN for MEI prediction, with R2 and RMSE values of 0.6654 and 0.2104, respectively. Therefore, the proposed Arduino and AML-based system enhances mine environmental safety by quickly assessing and predicting mine air quality.

1. Introduction

The harsh and confined working conditions in underground coal mines (UCMs), have led to the listing of the mining industry as the most dangerous profession [1]. In recent years, the adoption of sophisticated regulations has greatly reduced mine accidents; yet, hundreds of miners lose their lives every year. According to the Mine Safety and Health Administration (MSHA), faulty equipment, negligence of labor towards explosions, structure failure, and gas accumulation are the most common causes of underground mine accidents [2]. During the economic year of 2014, in the salt range coal mine in Punjab, Pakistan, more than 35% of accidents occurred due to the accumulation of toxic gases [3]. Therefore, for the safety of workers and the mine itself, it is extremely important to continuously and accurately monitor the mine environment.
In recent years, advancements in the fields of wireless sensor network (WSNs), radio frequency identification (RFID), and cloud computing have led the way toward the development of internet of things (IoT) in the areas of Smart Grids, e-health services, home automation, and environment monitoring. Similarly, with reference to UCMs, the introduction of WSNs and the concept of smart monitoring have enabled accurate, cost-effective, and reliable real-time environmental monitoring. These technologies not only allow monitoring of inaccessible places but also assist in automatic event detection, control, and remote information sharing [4]. For instance, an efficient warning system [5] based on WSN has been introduced for early fire detection in a bord-and-pillar coal mine with capabilities of determining fire direction. Zhang et al. [6] integrated WSN and a cable monitoring system for multi-parameter environmental monitoring at Shangwan Coal Mine, Erdos, China. This system has two operational modes—periodic inspection and interruption service—and its feasibility for real mining conditions has been proven. Similarly, Issac [7] integrated WSN and ambient intelligence to detect the presence of toxic gases in a mine environment and to respond accordingly. This system introduced mobile sensing for the safety of underground mines. Recently, Jo and Khan [8] introduced an open-source, cost-effective, Arduino-based IoT system for early-warning and event reporting in UCMs. Another comprehensive and energy efficient monitoring system [9] based on WSN for early-event detection has been proposed to enhance safety in the challenging environment of UCMs. In addition to all these, detailed literature, focusing on the implementation of WSNs for mine environment monitoring, can be found in [9].
Along with accurate and continuous monitoring, mine air quality assessment and prediction can play a vital role in enhancing mine safety. This would not only reduce miners’ contact with bad air but also would allow efficient control of mine ventilation. However, the complex and non-linear behaviors of air quality variables are beyond the capabilities of a simple mathematical prediction formula [10]. In this regard, various statistical tools, such as ARIMA, multiple linear regression (MLR), artificial neural network (ANN), fuzzy time series, and principal component analysis (PCA) have shown highly satisfactory results for accurately assessing air quality and forecasting pollutant concentrations [11,12,13,14]. Concentrations of toxic gases are the major controlling factors of air quality; therefore, determination of the type and number of pollutants present in a specific environment is of extreme importance. In this regard, the statistical tool PCA has shown a high capability for identifying influencing pollutants, and ANN has enabled more accurate predictions. Therefore, in the harsh environment of UCM, a hybrid approach of PCA and ANN may be able to efficiently identify pollutants, along with giving accurate predictions.
This study aims to develop a reliable, cost-effective, and efficient IoT system for air quality monitoring and assessment in underground mines with the additional features of pollutant identification and air quality forecasting, based on Azure machine learning (AML) cloud computing. This study uses economical clones of Arduino-based sensor modules to monitor eight different air quality parameters (temperature, humidity, CO2, CH4, CO, SO2, H2S, and NO2), applies PCA to identify the most influential variables, and uses ANN to define a prediction model for mine air quality. The key contributions of this study are as follows:
(1)
Arduino-based sensor modules for environmental monitoring in UCM,
(2)
Mine environment index (MEI),
(3)
Use of AML platform for mine air quality prediction,
(4)
Identification of the most influential pollutants present in the mine environment and modelling of mine air quality for an accurate prediction of MEI.
The remainder of this paper is structured as follows: Section 2 summarizes the literature review related to PCA and ANN. Section 3 describes the architecture of the proposed system, discusses the selection of sensors, and introduces the AML. Section 4 focuses on a hybrid approach that integrates PCA and ANN, and it defines the parameters for model building. Section 5 and Section 6 illustrate the sensors’ calibration and system installation. The results of the proposed system are discussed in Section 7. Finally, conclusions are drawn in the last section, followed by the limitations and future considerations.

2. Literature Review

In recent decades, various scientific studies [15,16] have used multivariate statistical approaches, such as cluster analysis (CA), PCA, factor analysis (FA), and discriminant analysis for solving environmental and air quality issues. However, based on the eigenvalues solution, PCA is the most prevailing and simplest technique. Specifically, in air quality problems, it has been used alone or in combination with other approaches. For instance, [17,18] used PCA and CA to figure out the seasonal variations and spatial distribution of PM10 and O3 in the open air. Similarly, Juneng et al. [19] analyzed PM10 concentrations all over Malaysia using rotated PCA. Moreover, PCA, in combination with an enrichment factor, has successfully been implemented in the assessment of the air quality of an indoor charcoal cooking restaurant; it identified the particle fraction of PM2.5 as a possible source of pollution [20]. Therefore, in the present study, PCA is used to identify major pollutant sources present in the mine environment.
Recently, ANN has shown great potential in the fields of engineering, industrial process control, medicines, computing, risk management, and marketing [21]. Several air quality studies [10,22] have utilized ANN to simulate PM10 concentrations, air quality prediction, and other environmental issues. These applications clearly indicate the high capability of ANN to accurately predict in complex environments. Cigzoglu and Kisi [23] accurately predicted air pollution in Istanbul, Turkey using feedforward backpropagation combined with a radial basis function algorithm. In regard to the mining engineering, ANN is not a new concept. An initial example for the adoption of ANN [24] in the mining industry is the real-time control of mineral processing plants. Edwards et al. [25,26] collected data from a smoke sensor installed in a mine to accurately identify the combustibles present in the mine environment. Similarly, Karacan [27,28] conducted a series of modelling, simulation, and real experiments using a neural network to accurately predict the methane gas concentration and automatically control mine ventilation. For air quality in mines, Dixon et al. [29] applied ANN to the gas monitoring data and forecasted the concentration of methane gas inside the mine environment. Similarly, Park et al. [30] simulated ANN for the prediction of the PM10 concentration in metropolitan subway stations in Seoul. They found the prediction accuracy of ANN to be between 60–80% relative to measured values. They also described the effect of the architecture and depth of subway stations on the ANN results. Conclusively, ANN is a valuable technique for enhancing safety in mines through its ability to predict air quality and allow the automatic control of mine ventilation. A member from the family of ANN is multi-layered perception (MLP); this has proved its ability for prediction using time series [31]. It enables easy extraction of precise information from complicated databases. MLP has shown great potential for solving complex environmental problems. For instance, Ramedani et al. [32] used relative humidity, temperature, sunshine duration, and amount of precipitation as input variables of MLP-ANN for predicting global solar radiations. Thus, it can be expected that ANN, including MLP, will be able to provide accurate prediction of air quality in the complex environment of UCMs.

3. Proposed System Design

3.1. System Architecture

The proposed system has been specifically designed for air quality monitoring and assessment in UCMs. Figure 1 shows the basic architecture of the proposed system. The main frame comprises data acquisition, data transmission, data processing for air quality assessment and prediction, and finally, services for information sharing and intelligent control of mine ventilators. In this system, sensing units are based on sensor modules attached to an Arduino UNO (ATmega1280, Atmel, San Jose, CA, USA) [33]. Two sensing units make up the sensor nodes (SNs), which capture air quality related data and transmit this data to the base station via ZigBee. The base station runs AML, which operates as a platform as a service (PaaS). The air quality model extracts pollutant types and predicts air quality depending upon the concentrations of pollutants. Thus, this system enables AML-based decision making and autonomous control. Details of each component of the proposed IoT system are described in upcoming sections.

3.2. System Hardware

Sensor Node: The basic function of a SN is to sense and measure air quality parameters inside the mine environment. Usually, SNs are comprised of sensor modules, a microcontroller, and wireless transmitters. In this study, SN is based on an Arduino UNO microcontroller. Arduino is an open-source, low-cost, and low-power controller which is run without interfacing with a computer because of its ability to load script [34]. It has 2 kB of RAM and 32 kB of program memory and operates at 5 V DC. Arduino UNO can be easily programmed into the Arduino development environment (IDE).
Selection of appropriate sensors for monitoring a mine environment is a relatively complicated issue and it demands consideration of several factors, such as measurement range, accuracy, and sensitivity. The operational temperature range of DTH11 is from −40 °C to 120 °C with ±2 °C and it has the ability to measure humidity of 20–80% with ±5% accuracy. It operates at 3.3–5 V; 2.5 mA is the maximum current usage. Usually, the temperature in a working coal mine varies between 15 °C and 45 °C, and the humidity lies within the specific range of DTH11. These potentials make DTH11 highly suitable for its use in UCMs. Common gases found in UCMs are CH4, CO2, CO, NO2, H2S, and SO2. This study utilizes MQ-4, MQ-9, MQ-811, MQ-136, and MiCS-2714 sensor modules to monitor the concentration of various gases. Among these, most of the sensor modules are metal oxide (tin oxide (SnO2)) based and respond well to volatile gas molecules; thus, they are more reliable and efficient for gas monitoring. In addition, sensor modules, either for gas monitoring or for temperature measurement, are cost-effective, low-power, stable, and are environment ineffective operational clones of the Arduino board. Generally, the gas concentration and sensor resistance have following relationship:
R s = A ( c ) α
where Rs is the sensor resistance, A is a constant, c is the gas concentration and α is the slope of the Rs curve. Table 1 summarizes the detection principles and specifications of each sensor module. The circuit diagram of the sensor modules attached to the Arduino UNO is shown in Figure 2. As this study has been designed to monitor eight different air quality parameters; however, the limited capacity of Arduino UNO offers a great challenge in handling large number of sensor modules. Therefore, by keeping in mind the limited capacity of the Arduino UNO and to make the proposed network economical, each SN was divided into two units. Each unit was mounted and programmed with XBee shield and a specific number of sensor modules.
Communication Protocol: The ZigBee wireless communication protocol has been previously shown to have high transmissivity, stability, ultra-low power, and high communication in underground mines [35]. Therefore, this study used the ZigBee 802.15.4 protocol with a bandwidth of 2.4 GHz. The XBee module (24.38 mm × 27.61 mm) was connected in series with Arduino UNO. This module has a nominal range of 30 m in open air. The ZigBee data throughput was 250 kbps, and it had a receiver sensitivity of −92 dBm, with an error drop-down transmission of ±25. Generally, longer and narrower dimensions of mine tunnels impose difficulties for WSN topology. This study uses cluster topology, in which each cluster is comprised of number of SNs and a cluster head. Figure 3 shows the visual aspects of Arduino Uno, Xbee module, and sensor modules.
Base Station: The base station functions as a data logger; it collects the raw data, arranges it and pushes the arranged data to AML. In this case, the base station is a PC server (Intel® Xeon E5420 2.5 GHz with 8 Gb RAM) with Windows 7 (Microsoft, Redmond, WA, USA) as the operating system.

3.3. Azure Machine Learning for Cloud Computing

Cloud services in combination with machine learning (ML) are constantly helping organizations to grow their businesses by providing massive storage and data processing ability to allow the discovery of useful patterns and trends. With the enormous number of growing datasets, the traditional libraries for the ML are becoming insufficient. Therefore, the present study utilized AML for the development, training, and testing of the mine air quality ANN model. AML is the R programming, based on an open, drag-and-drop and collaborative platform, mainly used by IT professionals, developers, and the public. It is a comprehensive set of cloud computing services equipped with learning modules, data pre-processing, statistical tools, a SQL server, and an API to launch the model in the application [36]. The main objective of adopting this ontology is to provide a simple method for data storage, computation, streaming, and sharing services for diversified IoT applications. The screenshot of the prepared model is shown in Figure 4.

4. Methods and Models

4.1. Mine Environment Index (MEI)

Various indices, such as the air quality index (AQI) [37], air pollution index (API) [38], and indoor air quality index (IAIQ) [39] have been introduced as key tools for easy and quick assessment of air quality in various environments and to predict pollutant concentrations. Among these indices, AQI, introduced by United Stated Environmental Protection Agency (US-EPA), is the most widely adopted index for the representation of open air environments. The constitutive components of this index are CO, SO2, PM10, O3, and NO2, which are commonly present in open air. As people spend 90% of their time in indoor environments [40], therefore, some researchers have also introduced indoor air quality indices. Compared to open air and indoor ambience, the environment in underground coal mines is relatively harsh, confined, and toxic because of the presence of gases such as CO, CO2, CH4, SO2, NO2, and H2S, which are emitted from coal beds during excavation. Thus, the indices defined for open or indoor air quality are insufficient to fully represent underground mine air quality. There should be an index available that gives a true representation of the mine environment and can readily assesses mine air quality.
However, despite extensive research on ventilation systems for underground mines, the mining industry and underground structures still lack such an index for the true representation of air quality. Therefore, for quick assessment and easy interpretation of mine air quality, this study introduces the mine environment index (MEI). This index is coined from two individual indices: the mine air quality index (MAQI) and the thermal comfort index (TCI). MAQI relies on the concentration of air pollutants, while TCI is mainly concerned with comfortable working conditions, such as temperature and humidity. MAQI has been assigned a weighting of 0.7 because of its major contribution to the mine environment, and a weighting of 0.3 has been given to TCI. Thus,
MEI = 0.7 ( MAQI ) + 0.3 ( TCI )
MAQI has been defined in a similar manner as to AQI, but it has different variables. Its representative equation is the same as that of AQI for open-air and is given as
MAQI p = ( MAQI Hi MAQI Lo ) ( BP Hi BP Lo ) × ( C p BP Lo ) + I Lo
where, MAQIP is the index value for pollutant p, CP is the input concentration of a given pollutant p, BPHi is the higher breakpoint that is ≥CP, BPLo is the lower breakpoint that is ≤CP, MAQIHi is the index breakpoint value corresponding to BPHi, and MAQILo is the index breakpoint value corresponding to BPLo. The MAQI values have been categorized into five status categories: very good, good, moderate, poor, and very poor. The limiting values for each category are summarized in Table 2.
Regarding its application in mines, the TCI is the thermal comfort level of miners, and it is highly dependent on the ambient temperature and humidity. It can be given as
TCI = 42.379 + ( 2.0491523 × T ) + ( 10.14333127 × r h ) ( 0.22475541 × T × r h ) ( 6.83783 × 10 3 × T 2 ) ( 5.481717 × 10 2 × r h 2 ) + ( 1.22874 × 10 3 × T 2 × r h ) + ( 8.5282 × 10 4 × T × r h 2 ) ( 1.99 × 10 6 × T 2 × r h 2 )
where, T is the temperature measured in °F and rh is the relative humidity. The breakpoints for MEI are summarized in Table 3.

4.2. Data Pre-Processing

Prior to implementing any statistical approach, it is necessary to pre-process the collected data so that substantial characteristics of the sensors’ responses can be extracted to produce features for further processing. In the present study, transformation, as an initial step of pre-processing, was carried out using z-scale transformation with a mean of 0 and a standard deviation of 1, given as
z i j = ( x i j μ ) / σ
zij is the jth value of variable i, xij is the jth observation of variable i, μ is the mean and σ is the standard deviation. z-scale transformation ensures the equal weight of variables for any statistical process. This transformation also homogenizes the distribution variance and reduces the probability of any error arising because of the different sizes of data sets [43]. Finally, the sphericity and correlations between various air pollutants were determined using Bartlett’s test with a high significance level of (p < 0.0001) and a threshold limit of 0.5.

4.3. PCA Modelling

One of the most prevailing and valuable statistical approaches is PCA, which compresses and transforms m-dimension data into a new dataset of n-dimensions, where n < m. It uncovers the potential structure of the set of variables without losing important information. PCA transforms a large data set of interrelated variables into new and uncorrelated variables, known as principal components (PCs). The PCs are orthogonal, uncorrelated and have a linear relationship with variables of the original dataset, given as
P C p = w 1 p x 1 + w 2 p x 2 + ... + w n p x n
where the notation for the pth PC for the overall n number of data is PCp, Wnp is the regression coefficient weight determined by PCA, and xn is the adjusted matrix. The extraction of PCs usually occurs in the gradual decreasing order of their variance. In order to determine the optimal number of PCs, several approaches exist: Broken Stick rule, Velicer’s partial correlation procedure, Cattell’s scree test, and cumulative percentage of variance. More specific to the air quality problem, Kaiser’s criteria [44] of eigenvalue-one is the most commonly used method for the selection of optimal numbers of PCs. PCA has shown a high capability for identifying the most significant air pollutant. Therefore, this study used PCA along with Kaiser’s approach to determine the most significant pollutant present in the mine environment.

4.4. MLP-ANN Modelling

ANN is the most widely accepted information processing system in artificial intelligence, intended as a generalized mathematical model of the brain system [45]. ANNs are made up of several interconnected neurons and have the capability to change their structure based on internal or external data. ANN can be trained for non-linear and complex data, which makes it highly suitable for the faster interpolation of predictions, clustering, and classification, as compared to digital computer systems. The prediction results in ANN are highly dependent on the number of input variables and assigned weights of each neuron [46]. A few studies [47] have reported the use of PCA outputs as input variables for MLP-ANN and have proved the validity of this approach in decision making. Similarly, in this study, the outputs of PCA were used as inputs for the MLP-ANN model to predict the air quality accurately. A MLP-ANN network was fed with PCA results for the identification of pollutant sources that significantly affect the MEI.
In the network of MLP-ANN, the first layer is the input layer, responsible for the information collection, error removal, and transmission of data to the ANN structure. The second layer is the hidden layer, with arbitrary numbers of neurons and several layers. The number of hidden layers in the present study was set to two. In this network, the neurons activate during the functions of feedforward and backward propagation providing the connections between the layers. Each neuron interacts with other neurons depending upon its ability to make connections. Based on the interaction of neutrons, each neuron is assigned a weight. The most common interactive function is the sigmoid function for the transmission of information between layers. Hidden layers receive and transmit the data between input and output layers. The hidden layer hands over the data to the next layer until the output layer is reached, to provide the output. Finally, the process outputs, after processing the collected data, are provided to the output layer through nodes in each layer connected to each other. The used MLP-ANN architecture with inputs from PCA to determine the MEI is shown in Figure 5.
The accuracy and training capacities of the MLP-ANN model are highly dependent on the selection of the optimum number of neurons. If more neurons are present, the model will be over fitted and converge quickly; if too few neurons are present, the model will be less accurate and not trained properly [48]. In the present case, the MLP-ANN model was designed in AML with weights determined by the randomization function. The model was trained using 70% of the entire data set using the Levenberg-Marquardt (LM) training algorithm [49], and the remaining 30% of the data set was used for testing. This training algorithm was adopted because of its high speed, high efficiency, and because it precisely trains the network with a standardized range: 0–1. For model learning, a widely accepted approach for environmental studies, gradient descent with momentum back propagation (0.5), was adopted. A goal of 0.001 for the mean absolute error (MAE) or 1000 epoch was set; training was continued until the model fulfilled either of these conditions. Thus, MAE and epoch numbers were the stopping criteria for training. The same process was repeated by varying the number of neurons to determine the optimal number of neurons in the hidden layer. The characteristics of MLP-ANN training are summarized in Table 4.
The optimum number of hidden neurons were determined with a hit and trail approach. The output values determined in the forward phase were transmitted back from the hidden layer, and the weights of each node adjusted themselves accordingly to minimize error, relative to original values. The performance of MLP-ANN was monitored using MAE, root mean square error (RMSE), relative absolute error (RAE), relative square error (RSE), and coefficient of determination (R2).
MAE = i = 1 N | y exp y c a l | n
RMSE = ( 1 N i = 1 N ( y exp y c a l ) 2 ) 0.5
RAE = i = 1 n | y exp y c a l | i = 1 n | y ¯ y c a l |
RSE = i = 1 n ( y exp y c a l ) 2 i = 1 n ( y ¯ y c a l ) 2
R 2 = i = 1 n ( y p i y o m ) 2 i = 1 n ( y p i y o m ) 2 + i = 1 n ( y o i y p i ) 2
where yoi is the observed value, ypi is the estimated value, and yom is the average of observed values.

5. Calibration of Sensors and Sensor Nodes

In this study, all the sensors were calibrated in ambient air conditions, but our discussion is limited to the CO2 sensor. Calibration of each gas sensor was carried out on a laboratory scale under normal environmental conditions (25 °C) with the help of a sealed gas test box, SR3 (235 mm × 180 mm × 210 mm). This box can be air tightened, and it has been specially designed for the calibration of gas sensors. Before calibration, the box was opened in normal air conditions and an attached mixing fan was turned on for 4 min. Afterwards, the box lid was tightened, and CO2 was introduced into the box with the help of a syringe. The syringe was filled with 99.95% pure CO2, extracted from the gas cylinder using a syringe adaptor. After injecting CO2, the mixing fan was turned on for 1 min. Before recording the readings from the sensor, a time lapse of 45 s was given, as the response time of the CO2 sensor is 30 s. The sensor readings were recorded for 2 min, and the lid was opened in conjunction with turning on the mixing fan for 3 min.
The Arduino-based developed SNs were validated using a commercial instrument Aeroqual-900 (Aeroqual, Avondale, Auckland, New Zealand) with additional sensors of temperature and humidity. For this purpose, the prepared SNs and commercial equivalents were placed in a glass sealed container of 170 cm× 90 cm× 50 cm at normal room temperature (25 °C). The readings were recorded for more than three hours and regression plots of sensed data in comparison with commercial equivalents are shown in Figure 6. Regression plots of all gases showed a linear relationship between the injected concentration of gases and the readings shown by the sensors. In most of the cases, the coefficient of determination was found to be greater than 0.95, except in the case of the MiCS-2714 sensor (0.90), which was still in the acceptable range. The means and standard deviations of the SNs along with their commercial equivalents are summarized in Table 5. The table demonstrates that the means and standard deviations of almost all of the parameters are comparable. These values of means and standard deviations indicate that the developed nodes have same responses as those of commercial equivalents; therefore, the readings from prepared nodes are reliable for use in UCMs.

6. Test Bed

The mine tunnel used for implementation of the current system is the main roadway of an operating UCM. The dimensions of these mine tunnels were 1.8 m × 2.12 m, and almost all tunnels were supported with wooden planks at spacings of 1 m (Figure 7b). The units of SN programmed for XBee shield and sensor modules were attached to the rooftop center of the mine tunnel opening. Data transmission from both units was synchronized, based on the consensus synchronization algorithm. All of the SNs were attached to cluster heads via the ZigBee protocol. Finally, the sensed data were collected at the base station installed at the mine office. This base station arranges all the data into the specific format required by the AML. The AML model pre-processes the data to determine any missing values, trains itself by splitting the data, and then predicts the air quality.
The system’s installation was completed on 26 July 2016. Its first time start up took 90 s, and 1.5 kHz was the refresh rate. For the initial two days, the data collection rate was set at 15 min. Thus, after two days, there were 3104 samples collected, with 97 readings for each sensor per day. In the entire dataset, there were only 15 missing values, indicating the reliability of the data collection method. Among this data set, 2484 readings were used to train the model, and testing was carried out on 605 samples. At the completion of training and sensor evaluation, an extensive monitoring program was started on 29 July 2016 at midnight. Currently, this system predicts MEI recalling datasets from longer than a month.

7. Results and Discussion

7.1. Air Pollution Source Identification

The observed value of chi-square, 7.9 × 102 (p < 0.05, df = 28), obtained from Bartlett’s test showed that the air quality test meets the sphericity assumption. This test also indicated that the variables were correlated and not orthogonal; thus, PCA can allow the interpretation of the involved parameters with a much lower number of components, compared to the original number of variables. The input to PCA was sensor readings of eight different variables. PCA extraction gave principle components (PCs). The selection of the most significant components from these eight PCs was carried out on the basis of having an eigenvalue greater than 1. At this stage, the components having eigenvalues less than 1 were neglected because of being redundant to more important factors. The eigenvalues and percentages of variance of each extracted component are summarized by Table 6. This table and scree plot (Figure 8) clearly indicates that the first two components have eigenvalues greater than 1 and their cumulative variance is 79.370%, whereby, 65.61% was explained by the first component only, and 13.76% was explained by the second component. Therefore, it is easy to say that the most effective component is the first PC followed by the second based on their eigenvalues.
In order to completely understand and to accurately interpret the data, PCs were rotated using varimax rotation. Table 7 shows the eigenvalues of PCs after rotation. The relative significance and components’ structures have been optimized with the adoption of rotation. After rotation, the variance percentages of the first PC reduced by more than 3%, from 65.610% to 62.894%. On the contrary, the variance percentages of the second PC increased, indicating the major contribution of the first PC; this shows that the PC1 is more correlated with the data relative to PC2.
In this study, the vari-factors with absolute values greater than 0.85, indicating relatively strong loadings, were set as threshold limiting values. Table 7 indicates five out of eight components which satisfy the condition of the 0.85 threshold limit value. These components are the major pollution factors in the air of this mine. These pollutants are CH4, CO2, SO2 and CO along with temperature. In these pollutants, PC1 contributes 65.16% to the loading factors of temperature, CH4, CO2, CO, and SO2. These influencing factors are mostly related to the common gases present in mine environment of UCMs. Among these gases, CH4 and SO2 are generally confined by the coal bed in the form of gas pockets. During the excavation of coal, these pockets burst out and release gases into mine air. The major cause of CO2 in the mine environment may be because of the breathing activity of workers as well as the exhaust from any diesel operated machinery. On the other hand, CO is only the loading factor with a contributing factor from PC2. Table 8 shows the correlations of both selected PCs.

7.2. Prediction Results

In order to develop the MLP-ANN model, network structures with various numbers of neurons were tested, using both original and PCA extracted components as inputs. In these MLP-ANN structures, the optimum number of neurons were determined based on MAE, RMSE, RAE, RSE, and R2. The number of neurons was gradually increased, one-by-one, and errors were determined against each number of neurons. In the case of the original data sets, trials were initially started with eight neurons and this number was increased until the minimum number of errors was indicated. In the case of PCs, the trials were initially started with three neurons which were gradually increased to determine the optimum number of neurons. In the MLP-ANN process, the sigmoid transfer function provided the optimal activation function. A validation model with different numbers of neurons for both original and PCA extracted components as inputs is summarized in Table 9. For the original data set, the optimum number of neurons to give the minimum error was 18, and for the PCA extracted data sets, the optimum number of neurons was 6.
The performance of the proposed model was compared with models defined by multi-linear regression (MLR), PCA-MLR, and ANN. For all of these models, the MAE, RMSE, RAE, and RSE were compared, as the error values close to zero indicate a better model. On the other hand, the accuracy of the model was checked by calculating the coefficient of determination (R2). In these tests, the value of the model with high accuracy was close to one. Hence, the accuracy of model can vary depending upon the time interval required for prediction. In the case of underground mines, there are several limitations; therefore, it is necessary to predict gas concentration. The best prediction model was PCA-ANN, with MAE, RMSE, RAE, and RSE having values of 0.1519, 0.2104, 0.7619 and 0.6818 respectively. Moreover, the coefficient of determination, R2, was 0.6654; this is illustrated in Table 10.
PCA improved the accuracy of the linear regression model approximately by 2.1%, and 16.9% in the case of ANN. The results show that the PCA in combination with MLP-ANN improves the prediction accuracy of mine air pollutants. Moreover, the PCA extracts important information about the major pollutants present in the mine environment. Thus, the application of PCA is helpful for prediction studies specifically related to air quality.

8. Conclusions

In recent years, the computing power of cloud services has revolutionized the solution of complex and nonlinear data problems. Air pollutants present in UCMs have always showed non-linearity, ultimately causing uncertain predictions and lower reliability for early warnings. Therefore, this paper introduced an IoT-based mine air quality monitoring, assessment, and forecasting system that utilizes AML cloud computing to predict air quality and has the potential to widely enhance underground mine safety through early warnings. The system was installed in an operating underground coal mine, and it advocated an air quality assessment model to determine MEI.
The following are the conclusions:
(i)
In this system, IoT-sensors were used to monitor mine environment related parameters, and the limiting values of each parameter were defined separately to determine the mine’s air quality in terms of MEI. The calibration of the prepared SNs with regression constants was always greater than 95% for almost every parameter, which confirmed the reliability of system.
(ii)
Two different MLP-ANN models were designed in AML studio: one for PCA applied outputs to the dataset and other one was for original dataset.
(iii)
This system effectively alleviated the non-linear behavior of mine environment variables, such as temperature, humidity, CO2, CH4, CO, SO2, and H2S by pre-processing the sensors’ data. The parameters’ data was uploaded into AML studio as input data sets to the MLP-ANN model. Bartlett’s test confirmed the co-relations and non-orthogonality of the monitored variables. The PCA results indicated four mine gases (CH4, SO2, CO, and H2S) as having the most significant influences on mine air quality. Multi-layer perceptron from the family of ANN accurately predicted the MEI. As the accuracy and efficiency of the MLP-ANN model is highly dependent on the input parameters and hidden layer neurons numbers, the optimum number of hidden layer neurons was determined by observing minimum error.
(iv)
The proposed ANN-PCA model is 14.8%, and approximately 3%, more accurate, compared to linear regression and ANN models, respectively. This study suggests that an appropriately trained MLP-ANN model can effectively forecast MEI. Moreover, Azure Machine Learning enabled quick data processing with easy web service, based on an easy graphical user interface.
Despite the test results indicating accurate forecasting, some limitations in this study still exist. These limitations are the harsh environment of underground mines, data privacy, and the integration of multi-sensors’ outputs. Moreover, the present study only considered eight air quality parameters and ignored parameters which may affect the mine environment more severely. This study relied on the concentrations of air pollutants over time, while the other factors related to the forecasting effectiveness were ignored. We propose the following future directions: Firstly, determination of the complex non-linear behavior of pollutants’ concentrations demands a more precise hybrid model which would enhance early-warnings. Secondly, high pollutant concentrations are the major contributing factors to air quality; therefore, development of a model for forecasting peak air pollutant concentrations is required.

Acknowledgments

We are thankful to Jun Ho Jo, Taewoon Kim, and Omer Javaid for their continuous efforts in the completion and writing assistance of this paper.

Author Contributions

ByungWan Jo conceived the idea, provided the materials and helped with the technical aspects. Rana Muhammad Asad Khan implemented the proposed system, collected data, run various data analysis on the collected data, and wrote this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wang, H.; Cheng, Y.; Yuan, L. Gas outburst disasters and the mining technology of key protective seam in coal seam group in the Huainan coalfield. Nat. Hazards 2013, 67, 763–782. [Google Scholar] [CrossRef]
  2. MSHA. Accident/Illness Investigations Procedures. Available online: https://arlweb.msha.gov/READROOM/HANDBOOK/PH11-I-1.pdf (accessed on 19 February 2018).
  3. Annual Report, 2011 of Chief Inspector of Mines, Punjab. Available online: https://cim.punjab.gov.pk/system/files/Annual%20Report-14.pdf (accessed on 19 February 2018).
  4. Vural, S.; Ekici, E. Analysis of hop-distance relationship in spatially random sensor networks. In Proceedings of the 6th ACM International Symposium on MOBILE Ad Hoc Networking and Computing, Urbana-Champaign, IL, USA, 25–28 May 2005; pp. 320–331. [Google Scholar]
  5. Bhattacharjee, S.; Roy, P.; Ghosh, S.; Misra, S.; Obaidat, M.S. Wireless sensor network-based fire detection, alarming, monitoring and prevention system for Bord-and-Pillar coal mines. J. Syst. Softw. 2012, 85, 571–581. [Google Scholar] [CrossRef]
  6. Zhang, Y.; Yang, W.; Han, D.; Kim, Y.-I. An integrated environment monitoring system for underground coal mines—Wireless sensor network subsystem with multi-parameter monitoring. Sensors 2014, 14, 13149–13170. [Google Scholar] [CrossRef] [PubMed]
  7. Osunmakinde, I.O. Towards safety from toxic gases in underground mines using wireless sensor networks and ambient intelligence. Int. J. Distrib. Sens. Netw. 2013, 9. [Google Scholar] [CrossRef]
  8. Jo, B.W.; Khan, R.M.A. An event reporting and early-warning safety system based on the internet of things for underground coal mines: A case study. Appl. Sci 2017, 7, 925. [Google Scholar]
  9. Minhas, U.I.; Naqvi, I.H.; Qaisar, S.; Ali, K.; Shahid, S.; Aslam, M.A. A WSN for monitoring and event reporting in underground mine environments. IEEE Syst. J. 2017. [Google Scholar] [CrossRef]
  10. Mutalib, S.N.S.A.; Juahir, H.; Azid, A.; Sharif, S.M.; Latif, M.T.; Aris, A.Z.; Zain, S.M.; Dominick, D. Spatial and temporal air quality pattern recognition using environmetric techniques: A case study in Malaysia. Environ. Sci. Process. Impacts 2013, 15, 1717–1728. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Biancofiore, F.; Busilacchio, M.; Verdecchia, M.; Tomassetti, B.; Aruffo, E.; Bianco, S.; Di Tommaso, S.; Colangeli, C.; Rosatelli, G.; Di Carlo, P. Recursive neural network model for analysis and forecast of PM10 and PM2.5. Atmos. Pollut. Res. 2017, 8, 652–659. [Google Scholar] [CrossRef]
  12. Cobourn, W.G. An enhanced PM2.5 air quality forecast model based on nonlinear regression and back-trajectory concentrations. Atmos. Environ. 2010, 44, 3015–3023. [Google Scholar] [CrossRef]
  13. Domańska, D.; Wojtylak, M. Application of fuzzy time series models for forecasting pollution concentrations. Expert Syst. Appl. 2012, 39, 7673–7679. [Google Scholar] [CrossRef]
  14. Pearce, J.L.; Beringer, J.; Nicholls, N.; Hyndman, R.J.; Tapper, N.J. Quantifying the influence of local meteorology on air quality using generalized additive models. Atmos. Environ. 2011, 45, 1328–1336. [Google Scholar] [CrossRef]
  15. Harrison, R.M.; Deacon, A.R.; Jones, M.R.; Appleby, R.S. Sources and processes affecting concentrations of PM10 and PM2. 5 particulate matter in Birmingham (UK). Atmos. Environ. 1997, 31, 4103–4117. [Google Scholar] [CrossRef]
  16. Grivas, G.; Chaloulakou, A.; Kassomenos, P. An overview of the PM10 pollution problem, in the metropolitan area of Athens, Greece. Assessment of controlling factors and potential impact of long range transport. Sci. Total Environ. 2008, 389, 165–177. [Google Scholar] [CrossRef] [PubMed]
  17. So, K.; Wang, T. On the local and regional influence on ground-level ozone concentrations in Hong Kong. Environ. Pollut. 2003, 123, 307–317. [Google Scholar] [CrossRef]
  18. Shah, M.H.; Shaheen, N. Annual and seasonal variations of trace metals in atmospheric suspended particulate matter in Islamabad, Pakistan. Water Air Soil Pollut. 2008, 190, 13–25. [Google Scholar] [CrossRef]
  19. Juneng, L.; Latif, M.T.; Tangang, F.T.; Mansor, H. Spatio-temporal characteristics of PM10 concentration across Malaysia. Atmos. Environ. 2009, 43, 4584–4594. [Google Scholar] [CrossRef]
  20. Taner, S.; Pekey, B.; Pekey, H. Fine particulate matter in the indoor air of barbeque restaurants: Elemental compositions, sources and health risks. Sci. Total Environ. 2013, 454, 79–87. [Google Scholar] [CrossRef] [PubMed]
  21. Palade, V.; Neagu, C.-D.; Patton, R.J. Interpretation of trained neural networks by rule extraction. In Proceedings of the International Conference on Computational Intelligence, Dortmund, Germany, 1–3 October 2001; pp. 152–161. [Google Scholar]
  22. Sousa, S.; Martins, F.; Alvim-Ferraz, M.; Pereira, M.C. Multiple linear regression and artificial neural networks based on principal components to predict ozone concentrations. Environ. Model. Softw. 2007, 22, 97–103. [Google Scholar] [CrossRef]
  23. Cigizoglu, H.K.; Kisi, Ö. Methods to improve the neural network performance in suspended sediment estimation. J. Hydrol. 2006, 317, 221–238. [Google Scholar] [CrossRef]
  24. Hodouin, D.; Thibault, J.; Flamemt, F. Artificial neural networks: An emerging technique to model and control mineral processing plants. In Proceedings of the 120th Annual Meeting of the Society of Mining, Metallurgy and Exploration Inc., Denver, CO, USA, 25–28 February 1991. [Google Scholar]
  25. Edwards, J.; Friel, G.; Franks, R.; Lazzara, C.; Opferman, J. Mine fire source discrimination using fire sensors and neural network analysis. In Proceedings of the 2000 Technical Meeting of Central States Section of The Combustion Institute, Indianapolis, IN, USA, 16–18 April 2000; pp. 207–211. [Google Scholar]
  26. Edwards, J.; Franks, R.; Friel, G.; Lazzara, C.; Opferman, J. Real-time neural network application to mine fire-nuisance emissions discrimination. In Proceedings of the 10th US/North American Mine Ventilation Symposium, Anchorage, AK, USA, 16–19 May 2004; pp. 425–431. [Google Scholar]
  27. Karacan, C.Ö. Modeling and prediction of ventilation methane emissions of US longwall mines using supervised artificial neural networks. Int. J. Coal Geol. 2008, 73, 371–387. [Google Scholar] [CrossRef]
  28. Karacan, C.Ö. Forecasting gob gas venthole production performances using intelligent computing methods for optimum methane control in longwall coal mines. Int. J. Coal Geol. 2009, 79, 131–144. [Google Scholar] [CrossRef]
  29. Dixon, D.; Özveren, C.; Sapulek, A.; Tuck, M. The Application of Neural Networks to Underground Methane Prediction; Society for Mining, Metallurgy, and Exploration, Inc.: Littleton, CO, USA, 1995. [Google Scholar]
  30. Park, S.; Kim, M.; Kim, M.; Namgung, H.-G.; Kim, K.-T.; Cho, K.H.; Kwon, S.-B. Predicting PM10 concentration in Seoul metropolitan subway stations using artificial neural network (ANN). J. Hazard. Mater. 2018, 341, 75–82. [Google Scholar] [CrossRef] [PubMed]
  31. Voyant, C.; Nivet, M.L.; Paoli, C.; Muselli, M.; Notton, G. Meteorological Time Series Forecasting Based on MLP Modelling Using Heterogeneous Transfer Functions. J. Phys. Conf. Ser. 2015, 274, 012064. [Google Scholar] [CrossRef]
  32. Ramedani, Z.; Omid, M.; Keyhani, A. Modeling solar energy potential in a Tehran province using artificial neural networks. Int. J. Green Energy 2013, 10, 427–441. [Google Scholar] [CrossRef]
  33. Arduino Uno. Available online: https://www.arduino.cc/en/Guide/Introduction (accessed on 19 February 2018).
  34. Salamone, F.; Belussi, L.; Danza, L.; Ghellere, M.; Meroni, I. Design and development of nEMoS, an all-in-one, low-cost, web-connected and 3D-printed device for environmental analysis. Sensors 2015, 15, 13012–13027. [Google Scholar] [CrossRef] [PubMed]
  35. Moridi, M.A.; Kawamura, Y.; Sharifzadeh, M.; Chanda, E.K.; Jang, H. An investigation of underground monitoring and communication system based on radio waves attenuation using ZigBee. Tunn. Undergr. Space Technol. 2014, 43, 362–369. [Google Scholar] [CrossRef]
  36. Familiar, B.; Barnes, J. Business in Real-Time Using Azure IoT and Cortana Intelligence Suite; Apress: Berkeley, CA, USA, 2017. [Google Scholar]
  37. EPA. Air Quality Index: A Guide to Air Quality and Your Health. Washington, DC, USA, 2009. Available online: https://nepis.epa.gov/Exe/ZyPURL.cgi?Dockey=P100BZIZ.TXT (accessed on 20 March 2018).
  38. Ashirae Standard. Ventilation for Acceptable Indoor Air Quality. Available online: https://www.mintie.com/assets/pdf/education/ASHRAE%2062.1-2007.pdf (accessed on 20 March 2018).
  39. World Health Organization. WHO Guidelines for Indoor Air Quality: Selected Pollutants; World Health Organization: Geneva, Switzerland, 2010; Available online: https://wedocs.unep.org/handle/20.500.11822/8676 (accessed on 20 March 2018).
  40. EPA. Green Building. Available online: https://archive.epa.gov/greenbuilding/web/html/ (accessed on 18 February 2018).
  41. Center for Disease Control and Prevention. Statistics: All Mining. Available online: https://www.cdc.gov/niosh/mining/statistics/allmining.html (accessed on 18 February 2018).
  42. Moridi, M.A.; Kawamura, Y.; Sharifzadeh, M.; Chanda, E.K.; Wagner, M.; Jang, H.; Okawa, H. Development of underground mine monitoring and communication system integrated ZigBee and GIS. Int. J. Min. Sci. Technol. 2015, 25, 811–818. [Google Scholar] [CrossRef]
  43. Simeonov, V.; Einax, J.; Stanimirova, I.; Kraft, J. Environmetric modeling and interpretation of river water monitoring data. Anal. Bioanal. Chem. 2002, 374, 898–905. [Google Scholar] [PubMed]
  44. Kaiser, H.F. The application of electronic computers to factor analysis. Educ. Psychol. Meas. 1960, 20, 141–151. [Google Scholar] [CrossRef]
  45. Gholami, M.; Cai, N.; Brennan, R. An artificial neural network approach to the problem of wireless sensors network localization. Robot. Comput. Integr. Manuf. 2013, 29, 96–109. [Google Scholar] [CrossRef]
  46. Chaloulakou, A.; Grivas, G.; Spyrellis, N. Neural network and multiple regression models for PM10 prediction in Athens: A comparative assessment. J. Air Waste Manag. Assoc. 2003, 53, 1183–1190. [Google Scholar] [CrossRef] [PubMed]
  47. Zekić-Sušac, M.; Šarlija, N.; Pfeifer, S. Combining PCA analysis and artificial neural networks in modelling entrepreneurial intentions of students. Croat. Oper. Res. Rev. 2013, 4, 306–317. [Google Scholar]
  48. Cybenko, G. Approximation by superpositions of a sigmoidal function. Math. Control Signals Syst. 1989, 2, 303–314. [Google Scholar] [CrossRef]
  49. Marquardt, D.W. An algorithm for least-squares estimation of nonlinear parameters. J. Soc. Ind. Appl. Math. 1963, 11, 431–441. [Google Scholar] [CrossRef]
Figure 1. System Architecture comprising Arduino as a microcontroller for air quality monitoring in mines, based on Azure Machine Learning (AML), for air quality prediction.
Figure 1. System Architecture comprising Arduino as a microcontroller for air quality monitoring in mines, based on Azure Machine Learning (AML), for air quality prediction.
Sensors 18 00930 g001
Figure 2. Circuit diagram of each sensor modules attached to the Arduino UNO based sensor nodes (SN) (a) attached MQ4, MQ136, and MQ9, (b) attached MiCs2714, MQ-811, and DTH11.
Figure 2. Circuit diagram of each sensor modules attached to the Arduino UNO based sensor nodes (SN) (a) attached MQ4, MQ136, and MQ9, (b) attached MiCs2714, MQ-811, and DTH11.
Sensors 18 00930 g002
Figure 3. (a) Front pictorial aspect of Arduino UNO, (b) back-side pictorial aspect of Arduino UNO, (c) 2.4 GHz Zigbee shield for Arduino, and (d) visual aspect of MQ136, MQ4, MQ9, MQ811, MiCs2714, and DHT11.
Figure 3. (a) Front pictorial aspect of Arduino UNO, (b) back-side pictorial aspect of Arduino UNO, (c) 2.4 GHz Zigbee shield for Arduino, and (d) visual aspect of MQ136, MQ4, MQ9, MQ811, MiCs2714, and DHT11.
Sensors 18 00930 g003
Figure 4. Prepared artificial neural network (ANN) model in Azure Machine Learning (AML) Studio.
Figure 4. Prepared artificial neural network (ANN) model in Azure Machine Learning (AML) Studio.
Sensors 18 00930 g004
Figure 5. Network architecture of the multilayer perception-artificial neural network (MLP-ANN) for the mine environment index (MEI).
Figure 5. Network architecture of the multilayer perception-artificial neural network (MLP-ANN) for the mine environment index (MEI).
Sensors 18 00930 g005
Figure 6. Correlation and regression plots of sensor nodes 1 and 2 for the attached sensor modules, (a,b) temperature, (c,d) humidity, (e,f) CH4, (g,h) CO2, (i,j) CO, (k,l) H2S, (m,n) SO2, and (o,p) NO2, in comparison to their commercial equivalents.
Figure 6. Correlation and regression plots of sensor nodes 1 and 2 for the attached sensor modules, (a,b) temperature, (c,d) humidity, (e,f) CH4, (g,h) CO2, (i,j) CO, (k,l) H2S, (m,n) SO2, and (o,p) NO2, in comparison to their commercial equivalents.
Sensors 18 00930 g006
Figure 7. (a) Mine portal opening, (b) a closed entry of the mine with installed wooden support.
Figure 7. (a) Mine portal opening, (b) a closed entry of the mine with installed wooden support.
Sensors 18 00930 g007
Figure 8. Scree plot for PCA with normalized eigenvalues.
Figure 8. Scree plot for PCA with normalized eigenvalues.
Sensors 18 00930 g008
Table 1. Sensors used for experimentation and their characteristics.
Table 1. Sensors used for experimentation and their characteristics.
CharacteristicsSensor Module
MQ9MQ4MQ811DTH11MQ136MiCs2714
Sensor typeMOSMOSMOSNegative Temperature coefficient (NTC) sensorMetal Oxide Semiconductor (MOS)MOS and Microelectro-mechanical systems (MEMS)
Target gasCOCH4CO2Temperature and humiditySO2 and H2SNO2
Typical detection range10–1000 ppm200–1000 ppm350–10,000 ppm20–90% RH *, 0–50 °C0–100 ppm (SO2 and H2S)0.05–5 ppm
Accuracy±5%±5%±5%±5% RH, ±2 °C temperature±5%±5%
Sensitive materialSnO2SnO2SnO2--SnO2--
SensitivityHighHighHigh±0.7 °C3 ppm0.25 ppm
Respond time20 s20 s20 s6 s (hum) and 10 s (temp)30 s30 s
ManufacturerHanwei electronic Co.Hanwei electronics Co.Hanwei electronics Co./Sandbox electronicsAosong electronics Co.Microelectr-onics Ltd.SGX sensor Tech.
* Relative humidity.
Table 2. Breakpoints of various gases for mine air quality index (MAQI) and its categories [41,42].
Table 2. Breakpoints of various gases for mine air quality index (MAQI) and its categories [41,42].
ConditionGases (ppm)
NO2COSO2H2SCH4CO2
Very good0–10–120–2.50–30–10000–2000
Good1.1–2.013–222.6–4.03.1–51001–20002001–3000
Moderate2.1–3.023–304.1–6.05.1–12.92001–40003001–4000
Poor3.1–431–496.1–8.013–204001–50004001–5000
Very poor>4>50>8>20>5000>5000
Table 3. Break points for the mine environmental index (MEI).
Table 3. Break points for the mine environmental index (MEI).
MEIMAQITCI
0–50Very goodMostly comfortable
51–100GoodComfortable
101–200ModerateNeutral
201–300PoorDiscomfort
301–500Very poorLeast comfort
Table 4. Training parameters for the multilayer perceptron (MLP) model.
Table 4. Training parameters for the multilayer perceptron (MLP) model.
Training ParametersValue
Sample:
Training samples: 2484
Testing samples: 605
Missing values: 15
3104
Input parameters8
Hidden neuronsFlexible
Output neurons1
Performance MAE, RMSE, RAE, RSE, R2
Goal0.001
Learning rate0.01
Momentum constant0.5
Table 5. Mean and standard deviations of nodes for gas module calibration.
Table 5. Mean and standard deviations of nodes for gas module calibration.
ParametersNode No. 1Node No. 2Commercial Equivalent
MeanSTDEVMeanSTDEVMeanSTDEV
Temperature (°C)26.421.0926.41.1226.361.06
Humidity (%)62.72.0862.932.3162.982.13
CH4 (ppm)545.6623.5454325.99544.824.68
CO2 (ppm)684.4681.76689.4680.78689.283.47
CO (ppm)22.241.4922.141.5922.251.51
H2S (ppm)35.610.4435.210.8834.9310.51
SO2 (ppm)19.24.2220.465.04204.52
NO2 (ppm)2.330.162.270.262.280.219
Table 6. Initial eigenvalues and total variance explained.
Table 6. Initial eigenvalues and total variance explained.
ComponentInitial EigenvaluesExtraction Sums of Squared LoadingsRotation Sums of Squared Loadings
TotalVar (%)Cum (%)TotalVar (%)Cum (%)TotalVar (%)Cum (%)
15.24965.61065.6105.24965.61065.6105.03162.89462.894
21.10113.76079.3701.10113.76079.3701.31816.47779.370
30.7989.97789.348
40.4245.30594.653
50.1682.10296.755
60.1231.54298.297
70.0861.07299.369
80.0510.631100.000
Var = variance, Cum = cumulative.
Table 7. Rotated component matrix and possible source identification.
Table 7. Rotated component matrix and possible source identification.
Component
12
Temperature0.8730.171
Humidity−0.7390.162
CH40.9080.203
CO20.824−0.084
CO0.0660.960
NO20.7720.129
SO20.8680.414
H2S0.9300.324
Table 8. Component transformation matrix for selected PCs.
Table 8. Component transformation matrix for selected PCs.
Component12
10.9730.229
2−0.2290.973
Table 9. Model validation with various neurons based on original dataset as input and PCA extracted components as input.
Table 9. Model validation with various neurons based on original dataset as input and PCA extracted components as input.
Model CharacteristicsNo. of Hidden NodesMAERMSERAERSER2
Original parameters80.20600.28760.67900.58280.4371
90.19040.27760.66490.58770.4372
100.18850.27270.66010.59360.4463
110.18970.26880.65790.61120.4737
120.17690.26330.65860.61490.4850
130.17340.26280.64440.61840.5165
140.17250.25760.64250.62280.5371
150.17040.25410.63960.62730.5526
160.16920.25060.63930.63650.5584
170.16910.24480.63400.63390.5806
180.16470.24300.63170.6240.5862
190.17680.25680.64570.62500.5749
200.18590.26180.63330.63000.5849
PCA extracted components30.1824230.2508380.788080.7010390.608961
40.1697450.2335750.7758040.6967840.633216
50.1619980.2237790.7638090.6872940.652706
60.15190.21040.76190.68180.6654
70.1574510.2288640.7626940.6950960.624904
80.163920.2436480.7867980.7023370.597663
Table 10. Performance indicators for MEI prediction models.
Table 10. Performance indicators for MEI prediction models.
ModelMAERMSERAERSER2
LR *0.18290.26530.61630.45470.5252
PCA-LR0.1790.2400.69950.49370.6063
ANN0.16470.24300.63170.6420.5862
Proposed model0.15190.21040.76190.68180.6654
* linear regression.

Share and Cite

MDPI and ACS Style

Jo, B.; Khan, R.M.A. An Internet of Things System for Underground Mine Air Quality Pollutant Prediction Based on Azure Machine Learning. Sensors 2018, 18, 930. https://0-doi-org.brum.beds.ac.uk/10.3390/s18040930

AMA Style

Jo B, Khan RMA. An Internet of Things System for Underground Mine Air Quality Pollutant Prediction Based on Azure Machine Learning. Sensors. 2018; 18(4):930. https://0-doi-org.brum.beds.ac.uk/10.3390/s18040930

Chicago/Turabian Style

Jo, ByungWan, and Rana Muhammad Asad Khan. 2018. "An Internet of Things System for Underground Mine Air Quality Pollutant Prediction Based on Azure Machine Learning" Sensors 18, no. 4: 930. https://0-doi-org.brum.beds.ac.uk/10.3390/s18040930

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop