Next Article in Journal
High Strength and Deformation Mechanisms of Al0.3CoCrFeNi High-Entropy Alloy Thin Films Fabricated by Magnetron Sputtering
Next Article in Special Issue
An Intuitionistic Evidential Method for Weight Determination in FMEA Based on Belief Entropy
Previous Article in Journal
Symmetry, Entropy, Diversity and (Why Not?) Quantum Statistics in Society
Previous Article in Special Issue
A Method for Detecting Dynamic Mutation of Complex Systems Using Improved Information Entropy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improving the Performance of Storage Tank Fault Diagnosis by Removing Unwanted Components and Utilizing Wavelet-Based Features

School of Electrical, Electronics and Computer Engineering, University of Ulsan, Ulsan 680-749, Korea
*
Author to whom correspondence should be addressed.
Submission received: 10 January 2019 / Revised: 24 January 2019 / Accepted: 3 February 2019 / Published: 4 February 2019
(This article belongs to the Special Issue Entropy-Based Fault Diagnosis)

Abstract

:
This paper proposes a reliable fault diagnosis model for a spherical storage tank. The proposed method first used a blind source separation (BSS) technique to de-noise the input signals so that the signals acquired from a spherical tank under two types of conditions (i.e., normal and crack conditions) were easily distinguishable. BSS split the signals into different sources that provided information about the noise and useful components of the signals. Therefore, an unimpaired signal could be restored from the useful components. From the de-noised signals, wavelet-based fault features, i.e., the relative energy (REWPN) and entropy (EWPN) of a wavelet packet node, were extracted. Finally, these features were used to train one-against-all multiclass support vector machines (OAA MCSVMs), which classified the instances of normal and faulty states of the tank. The efficiency of the proposed fault diagnosis model was examined by visualizing the de-noised signals obtained from the BSS method and its classification performance. The proposed fault diagnostic model was also compared to existing techniques. Experimental results showed that the proposed method outperformed conventional techniques, yielding average classification accuracies of 97.25% and 98.48% for the two datasets used in this study.

1. Introduction

Spherical storage tanks contain fluids, such as liquids and compressed gases, and are widely used in different industries (e.g., the petrochemical, chemical, and aerospace industries) [1,2]. These tanks can be used to store highly hazardous substances that can contaminate the environment easily. However, cracks can develop on the surface of these tanks due to aging, harsh environments, etc. Such cracks can threaten the integrity of the tanks. If a crack on the surface of a tank grows, and the openings of cracks become sufficiently wide, the contained hazardous substance can leak through the cracks and cause air and water pollution [3]. To avoid accidents, it is necessary to manage storage tanks responsibly and periodically inspect them. The traditional procedure for detecting cracks or corrosion in storage tanks is usually carried out on an annual basis via the following steps. First, the storage tank is turned off, emptied, and cleaned. Next, maintenance staff manually identifies any defects, such as cracks or leaks in the storage tank, using either an ultrasonic B-scan or magnetic flux leakage methods. However, the main disadvantages of these processes are the high cost and time-consuming nature.
In recent years, existing techniques that analyze the vibration acceleration signal, motor stator current, and acoustic emission (AE) signal have been widely used in diagnosing bearings [4,5] and commutator motors [6]. Out of these techniques, AE-based techniques are more effective for low speed rotational machines and storage tanks since they are sensitive to the low energy acoustic emissions [7,8]. Acoustic emissions (AEs) are elastic waves generated within a material. The use of acoustic emissions as an alternative tool to detect and locate seeded defects and corrosion in/on the surface of storage tanks, as well as other types of containers working under the pressure of vapor, liquid, or gas like pressure vessels, gas pipelines, and so on, has been adopted by researchers [9,10,11]. The main advantage of this method is that it does not severely interrupt the production process; devices only have to be turned off for a short-term inspection. Moreover, the cost of this method is relatively low compared with traditional methods, which helps businesses save time and money, while minimizing environmental damage. For these reasons, we used the AE signals acquired from the storage tank to determine its condition in this paper. However, due to the complications of the surrounding environment, as well as the limitations of the AE data acquisition device (DAQ), the recorded AE signals were contaminated with noise. As the surrounding environment became more complicated, the noise had a more significant effect on the distribution of the collected signals. Unlike the predictable fault signal, noise sources can produce almost any kind of signal and easily mask the useful signals that characterize the state of a device. These interfering sources can affect signals in both the time and frequency domains; therefore, features extracted from noise-buried signals do not resemble the features of noise-free signals. Moreover, these noise-contaminated features also have a negative impact on the performance of the classifier. To overcome this drawback, we suggested a de-noising method to isolate the original signal from the noise sources.
Traditional de-nosing methods usually rely on frequency characteristics to filter the signal; however, such methods are often prone to obvious disadvantages when used to de-noise non-stationary signals with broadband noise [12]. A state-of-the-art technique that surpasses traditional de-noising methods is the wavelet threshold de-noising method. Nevertheless, this method still does not produce optimal results due to the problem of overlapping frequencies and its difficulty in selecting an appropriate threshold [13]. Similarly, another notable method is empirical mode decomposition (EMD), which has significant noise-reduction ability, but runs into issues related to mode mixing and deficient enveloping [14]. In other words, contemporary de-noising methods are not especially efficient in isolating noise from short-time transient signals. The AE signal collected from sensors is the combination of a useful signal, which is directly related to the health status of a specific object, and diverse exterior and interior interference sources. However, diagnostic systems only utilize characterizing information contained in the useful signal to diagnose an object’s status. In this paper, therefore, we proposed a method to retrieve a noise-free signal that was associated with specific storage tank conditions. To do this, we first separated the observed signal into distinct signal sources using a blind source separation (BSS) technique. These sources were then analyzed to determine whether they were useful or interfering. In this study, we defined useful signal sources as ones that could differentiate between different conditions of the tank. Alternatively, the useless signals (i.e., noise) were sources that appeared every time, regardless of the condition of the tank. These may have been sources coming from the external environment or mechanical noise coming from the interior of the device itself. After removing useless sources, we restored the noise-free signal from the remaining useful components of the signal. Next, features were extracted from the noise-free signals; these were used to identify the different conditions of the tank.
Feature analysis -based fault diagnosis methods have long been used on industrial devices [15,16,17] and have been adopted to detect leakages, cracks, and corrosion in devices working under high pressure [18,19,20]. Typical fault diagnosis methods consist of two basic steps: fault feature calculation and fault classification. Feature calculation (or feature extraction) is an essential step in demonstrating signal statistical characteristics; it is often performed by analyzing the signal in the time domain, frequency domain, or time-frequency domain. In order to reveal underlying properties in the fault-characterizing signal, spectral kurtosis is one of the most cutting-edge methods and is extensively utilized. Its performance in identifying transients was manifested by Dwyer [21] in his early work. Based on the foundation of spectral kurtosis, Antoni and Randall developed its fast realization, the fast kurtogram [22]. The main idea of this work was to find the optimal demodulation band before the use of squared envelope analysis. Following this work, Lei [23,24] proposed the improved and the enhanced kurtograms based on the framework of wavelet packets to diagnose various bearing faults. Barszcz [25] introduced the idea of the protrugram, which calculated kurtosis from the envelope spectrum. From this study, Peter [26] introduced the sparsogram to faster determine the optimal demodulation band. Afterwards, Antoni [27] developed the fast kurtogram to a new method named the infogram, and Wang [28] extended the infograms to novel Bayesian inference. The purpose of the Bayesian inference is to determine optimal wavelet parameters, and thus able to conduct optimal wavelet filtering. Recently, Wang proposed the dynamic Bayesian wavelet transform [29]. The concept of this method was to utilize spectral kurtosis to initialize wavelet parameters and then use dynamic Bayesian inference to find the optimal wavelet transform. In addition to these methods, which try to find an optimal filtering band in order to extract short-time transient properties from the signal, there is another acclaimed approach which extracts features from wavelet coefficients of the wavelet packet transform (WPT). Such features were proved to be sensitive to fault occurrence and robust to different operating conditions. Numerous studies have experimentally demonstrated the effectiveness of these features in diagnosing industrial devices. Zarei and Poshtan, for instance, adopted the WPT to induce a motor’s stator current and calculated sub-frequency band energy to diagnose faults in bearings [30]. Boskoski and Juricic computed Renyi entropy values from coefficients of vibration signals as fault indexes to detect mechanical defects in rotational drives [31]. Li et al. developed a wavelet-based higher-order statistic method, in which Kurtosis values calculated from wavelet coefficients of both the WPT and discrete wavelet transform (DWT) were utilized to detect and classify bearing defects [32]. In this study, we utilized two normalized wavelet packets quantifiers: the relative energy in a wavelet packet node (REWPN) and the entropy in a wavelet packet node (EWPN) as input fault features. REWPN measured the normalized energy of the wavelet packets node, and EWPN defined the uncertainty of the normalized coefficients of the wavelet packets node. Unlike other methods that directly use the amplitude of coefficients, these new quantifiers were derived from probability distributions and were more robust in diagnostic applications [33].
Regarding fault classification, several methods have been introduced and achieved remarkable results, such as a neural network-based method [34,35], fuzzy logic [36,37], and other techniques [38,39]. However, their uses are limited in real-world applications. For instance, network-based classifiers require a sufficient training dataset to generalize and avoid overfitting to deal with complicated and nonlinear problems. On the other hand, although fuzzy logic-based techniques can provide diagnostic outcomes promptly without establishing an appropriate model, their learning abilities are restricted. To address these limitations, we employed one-against-all multiclass support vector machines (OAA MCSVMs), which offered greater classification accuracy with a limited training dataset compared to other classifiers (e.g., artificial neural networks) [40,41,42].
The remainder of this paper is organized as follows. Section 2 demonstrates the experimental test bed of the storage tank and the AE data acquisition system used in this study. Section 3 illustrates the elementary steps of the proposed method for diagnosing abnormal tank conditions. Section 4 exhaustively details the experiments carried out to validate the proposed method. Finally, our conclusions are provided in Section 5.

2. The Experimental Test Bed of the Storage Tank and Acoustic Emission Data Acquisition

Experimental data were collected from a test bed consisting of a spherical storage tank made of steel, which contained gasoline, as shown in Figure 1a. In addition, for more details, a schematic diagram is presented in Figure 2a. The tank had a volume of approximately 4.2 m3 and a diameter of 2 m. It was loaded under a pressure of 5 bar. In this experiment, we first record AE signals when there was no crack on the wall of the tank; this was done by utilizing four general-purpose wideband frequency AE sensors mounted on the surface of the tank at different locations, which were precisely indicated in the schematic diagram. After completing data acquisition for the crack-free tank condition, we artificially made a 3-mm crack on the surface of the tank bottom to simulate defective conditions in the tank. The shape of the crack is presented in Figure 1c and the distances between the sensors ( x 1 , x 2 , x 3 , x 4 ) and the crack were 825 mm, 750 mm, 1040 mm, and 430 mm, respectively. Repeating the above data acquisition procedure, we acquired data for the defective storage tank.
To validate the performance of the developed diagnosis model for the storage tank, we sampled the acquired AE signals at 1 MHz to capture very detailed information related to the storage tank condition. This is done with a peripheral component interconnect bus (PCI-2)-based system, as shown in Figure 1b. The connection diagram between sensors and the digitized system is described in Figure 2b, and the technical specifications of AE sensors and the PCI-2-based system are given in Table 1.

3. The Proposed Method for Diagnosing Abnormal Tank Conditions

The proposed method for detecting fault indications in a storage tank is illustrated in Figure 3, which consists of the following fundamental steps. First, noise-contaminated signals collected via observations were used as the input of a BSS-based denoising mode, where such original signals were split into individual signal sources. These individual sources were then analyzed to determine whether they were useful sources or interference sources. After removing the assumed noise sources, we reconstructed the noise-free signal from the remaining valuable components of the de-noised signal. In the proposed method, wavelet-based fault features that reflect symptoms of different conditions in the storage tank were then extracted from the reconstructed signal. The support vector machine (SVM) trained with such extracted wavelet-based features was then used to classify the “normal” and “crack” conditions of the tank.

3.1. Application of Blind Source Separation to Noise-Contaminated Acoustic Emission Signals

It is almost impossible to determine the number of individual sources that are mixed in a measured signal before signal analysis. Therefore, the specific signal separation model utilized in a particular case depends on the correlation between the number of observed signals (sensors) and the actual source (fault and noise) signals. These models can be categorized as determined BSS, over-determined BSS (OBSS), and under-determined BSS (UBSS). This categorization depends on whether the number of observations is equal to, greater than, or less than the number of signal sources, respectively. In practice, fault analysis-related problems usually fall into the case of under-determined BSS because the number of employed sensors is limited, and the noise sources are complex. Among popular approaches, sparse component analysis (SCA), which relies on the sparsity of sources in the time domain or other transformed domains, has been widely utilized to resolve under-determined BSS [43,44]. Several methods rely on the sparsity in the time–frequency domain to separate independent sources [45,46,47]. For instance, in Reference [45], value decomposition was performed on wavelet coefficients to determine frequencies and damping ratios, which makes this method faster than other edge detection approaches. In Reference [46], the principal component analysis was carried out on wavelet coefficients to create the partial mixing matrix of UBSS. In this study, we used the algorithm proposed by Qin [48], which uses a hybrid method combining References [49] and [50], to enhance the efficiency by estimating the mixing matrix and separating individual sources.

3.1.1. Algorithm Description

The most notable features of SCA are the cluster lines in the scattered space, which can be utilized to estimate the mixing matrix. Therefore, it is vital to examine these cluster-based features. Given a numerical example, three harmonic sources s 1 ( t ) = sin ( 2 π 0.8 t ) , s 2 ( t ) = sin ( 2 π 2.5 t ) , and s 3 ( t ) = sin ( 2 π 4.3 t ) are linearly unified into two observations: x 1 = s 1 + 0.3 s 2 + 0.8 s 3 and x 2 = 0.5 s 1 + 0.7 s 2 + 0.2 s 3 . Figure 4a shows the three time–domain observations in a scatter plot. It could be seen that no useful information about the three sources was obtained from this scatter. All the points in the plot were randomly scattered and did not form any distinct cluster that could be used to identify a specific source. Alternatively, when we converted the original signal into the time-frequency domain and plot the real values of the two observations, three distinct clusters were formed. These clusters can be seen as three lines in the scatter pot, where their directions correspond to the column vectors of the mixing matrix, as shown in Figure 4b. By estimating these directions, the mixing matrix can be determined effectively. The regular short-time Fourier transform (STFT) is given as:
G ( t , ω ) = + g ( u t ) s ( u ) e i ω u d u ,
where g ( u t ) is the step size used to move the window for STFT calculation and s ( u ) is the truncated signal. The modified STFT with an additional phase shift can be presented as:
G ( t , ω ) = + g ( u t ) s ( u ) e i ω ( u t ) d u .
According to Parseval’s theorem, the modified STFT can be rewritten as:
G ( t , ω ) = 1 2 π + g ^ ( ω ξ ) s ^ ( ξ ) e i ξ t d ξ ,
where g ^ ( ω ξ ) and s ^ ( ξ ) are the Fourier transforms (FTs) of the window function and the signal, respectively. With the purely harmonic signal s ( t ) , whose frequency is ω 0 , s ( t ) = A e i ω 0 t and s ^ ( ξ ) | ξ ω 0 = 0 , the STFT of the harmonic signal s ( t ) can be written as:
G ( t , ω ) = A g ^ ( ω ω 0 ) e i ω 0 t
For two harmonic signals with different frequencies, s 1 ( t ) = A 1 e i ω 1 t and s 2 ( t ) = A 2 e i ω 2 t , their STFT can be written as:
G s 1 ( t , ω ) = A 1 g ^ ( ω ω 1 ) e i ω 1 t G s 2 ( t , ω ) = A 2 g ^ ( ω ω 2 ) e i ω 2 t
Suppose x 1 ( t ) = a 11 s 1 ( t ) + a 12 s 2 ( t ) and x 2 ( t ) = a 21 s 1 ( t ) + a 22 s 2 ( t ) are two observations that are linearly mixed from the signal sources. According to the linearity property of STFT, the STFT of these two observations can be written as:
G x 1 ( t , ω ) = a 11 G s 1 ( t , ω ) + a 12 G s 2 ( t , ω ) = a 11 A 1 g ^ ( ω ω 1 ) e i ω 1 t + a 12 A 2 g ^ ( ω ω 2 ) e i ω 2 t . G x 2 ( t , ω ) = a 21 G s 1 ( t , ω ) + a 22 G s 2 ( t , ω ) = a 21 A 1 g ^ ( ω ω 1 ) e i ω 1 t + a 22 A 2 g ^ ( ω ω 2 ) e i ω 2 t
We assume | ω 1 ω 2 | > Δ , where Δ denotes the frequency support of the window. Therefore,
G s 1 ( t , ω ) 0 ,   G s 2 ( t , ω ) = 0 , ω ( ω 1 Δ / 2 ,   ω 1 + Δ / 2 ) G s 1 ( t , ω ) = 0 ,   G s 2 ( t , ω ) 0 , ω ( ω 2 Δ / 2 ,   ω 2 + Δ / 2 ) .
This means that there is no overlap between the two sources in the time-frequency domain. For ω ( ω 1 Δ / 2 , ω 1 + Δ / 2 ) , the ratio of the two signal sources in time frequency domain is given as:
G x 1 ( t , ω ) G x 2 ( t , ω ) = a 11 G s 1 ( t , ω ) a 12 G s 1 ( t , ω ) = a 11 a 12 ,
and, for ω ( ω 2 Δ / 2 , ω 2 + Δ / 2 ) , the ratio is:
G x 1 ( t , ω ) G x 2 ( t , ω ) = a 21 G s 1 ( t , ω ) a 22 G s 1 ( t , ω ) = a 21 a 22 .
Equations (8) and (9) indicate that, for the linear mixture, we can plot the STFT of two observations into the scatter, i.e., G ( t , ω ) x 1 versus G ( t , ω ) x 2 in each frequency bin. These lines have the same direction, which is determined by the amplitude of sources mixed in the observations, i.e., the column vector of the mixing matrix. This explains why some cluster lines appear in the scatter plot, which can be employed to obtain the mixing matrix.

3.1.2. Application of BSS to Noisy Acoustic Emission Signals

Classifiers only use characteristics related to the health status of a device to identify the condition of the device. Therefore, the idea in this paper was to separate the composite signals obtained from observations into individual signal sources. These individual sources could then be assessed to identify whether a source represents a useful signal or a noise source. Useful signals are ones that can highlight differences between the different conditions of the tank. In contrast, the useless signals, or noise, appear in all the conditions of the tank. Therefore, noise is eliminated, and only useful signals are retained. The obtained de-noised signals are then subjected to a feature extraction process to create a feature pool that can characterize different conditions of the storage tank.
In this study, AE signals were collected from four different sensors mounted on the wall of the storage tank. The time domain and frequency domain illustrations of these observations are presented in Figure 5 and Figure 6, respectively. From the time domain waveforms and frequency spectrograms, it was obvious that procuring useful information is difficult unless and until the signals have been preprocessed. To de-noise the acquired AE signals from the healthy and defective conditions of the tank, we used the BSS algorithm mentioned in Section 3.1.1. Through the BSS algorithm, we were able to decompose signals of faulty conditions into three individual signal sources. Likewise, signals of the normal condition could be split into two individual signal sources through the de-noising algorithm.
It can be seen in Figure 7 that the individual sources separated from both the faulty signal and normal signal were similar. This suggested that this source was not associated with either of the tank conditions; instead, it may have been caused by external interference. Therefore, it was safe to eliminate this signal source. In contrast, the remaining sources helped to improve the discriminative characteristics between the two tank conditions. Therefore, they were retained to reconstruct noise-free signals that could define the behavior of the different conditions of the tank. Figure 8 shows the waveforms of the recovered signals in the time domain. The recovered signals were also transformed into the frequency domain in order to better examine their characteristics, as presented in Figure 8. Later, wavelet transform features were extracted from the de-noised signals and provided to the classifier.

3.2. Feature Calculation

To collect AE signals for identifying non-stationary tank defects, AE sensors were attached to different locations of the tank wall. Due to the attenuation in the measured AE signals caused by the distance between crack sources and sensors, underlying information related to the storage tank failures mainly lies on the mid-frequency and high-frequency bands. The WPT, with its decomposition ability that can split a signal into low-frequency and high-frequency bands, has been widely utilized as a favorable time-frequency analysis tool for short-time transient AE signal processing. Therefore, in this study, WPT was used to analyze fault features by computing the REWPN and EWPN [51]. These fault features provided promising details regarding fault symptoms in fault diagnosis applications. To calculate these features, we decomposed a one-dimensional AE signal into eight wavelet packet nodes via three-level WPT with the help of Daubechies 20 (or db20). From Figure 8, it is noticeable that the inherited information regarding faults in the AE signals of the tank was mostly located in the mid-frequency and high-frequency bands. Thus, we used wavelet-based fault features (REWPN and EWPN) that were extracted from the last seven mid-frequency and high-frequency wavelet packet nodes, as shown in Figure 9. The REWPN of the nodes was calculated as follows:
R E W P N ( i ) = j = 1 K w i , j 2 n = 1 N t n o d e j = 1 K w n , j 2 ,
where N t n o d e was the total number of wavelet packet nodes analyzed in this study ( N t n o d e = 7 ), and w i , j was the j t h wavelet coefficient out of a total of K wavelet coefficients from the i t h wavelet packet node. The formula used to calculate EWPN was:
E W P N ( i ) = j = 1 K p i ( j ) log 2 p i ( j ) ,
where p i ( j ) = w i , j 2 / j = 1 K w i , j 2 . A total of 14 wavelet-based features, consisting of seven REWPNs and seven EWPNs, were utilized in this study to diagnose tank defects.

3.3. Fault Classification

An SVM is a discriminative classifier officially defined by a hyperplane that separates two classes with the largest margin in the feature space. To find the ideal hyperplane that separates two classes, we needed to solve the optimal minimization problem:
argmin w , ξ i { 1 2 w T w + C i = 1 n ξ i } , , subject   to y i ( w T ϕ ( x i ) + b ) 1 ξ i , ξ 0 , i = 1 , 2 ... , n
where w was the normal vector to the hyperplane, ϕ was a nonlinear function used to project the initial feature space into the high-dimensional nonlinear feature space, C was a penalty variable used to tune the generalization capability, and ξ i ’s were the slack variables used to regulate the training error. Using Lagrange multipliers, the optimal minimization problem can be solved as follows:
argmax α i { i = 1 n α i 1 2 i = 1 n j = 1 n α i α j y i y j K ( x i , x j ) } , , subject   to   i = 1 n α i y i = 0 , 0 α i C    i = 1 , 2 , ... , n
where α i s are Lagrange multipliers and x i and x j present samples in the training dataset. In this study, we used the Gaussian radial basis function (RBF) kernel function for the SVM due to its satisfactory performance. The RBF kernel is computed as follows:
k ( x i , x j ) = exp ( γ | | x i x j | | ) ,
where γ = 1 / 2 σ 2 and σ are adjustable parameters that can be cautiously tuned. If σ is small, the exponential is linear, and the higher dimensional mapping loses its nonlinear power. In contrast, if σ is large, the decision boundary is very sensitive to noise during training, which leads to a lack of regularization.
In this study, a one-against-all scheme was used to design multiclass support vector machines (OAA MCSVMs). This scheme was useful due to its simplicity and ability to solve multiclass classification problems. Each SVM structure in the OAA scheme separated one class from the others, and the SVM structure that offered the highest output value was selected to make the final decision. In addition, during the OAA MCSVMs training process, each SVM structure was individually evaluated to gain the highest classification accuracy for its own class. The SVM structures were analyzed together to make a final decision.

4. Experimental Result and Discussion

4.1. Configuration of Training and Testing Data

To validate the effectiveness of the proposed approach, we used two datasets corresponding to crack sizes of 3 mm and 6 mm on the storage tank wall (Table 2). Both datasets contained AE signals for a storage tank with a crack located on its wall and a defect-free tank, i.e., a total of two different tank conditions. For each dataset, the k-fold cross validation (k-cv) approach was employed to calculate the generalized classification accuracy. In k-cv, the dataset utilized for evaluating the diagnostic capability of the proposed method was first divided into k mutual folds, which were denoted as D 1 , D 2 , ... , D k . At the j-th iteration in k-cv, fold D j was reserved as the testing set, while the remaining folds were utilized to train OAA MCSVMs. The classification accuracy was successively estimated k times by training and testing OAA MCSVMs, and the final classification accuracy was the mean value of the accuracies obtained in each fold.
A k-cv scheme was used to evaluate the developed OAA MCSVMs, and the value of k was set to 3. Therefore, an initial sample set containing a total of 1080 samples from the two conditions of the tank (i.e., 540 samples for each condition) was split into three subsets (each fold included 360 samples for each condition). To determine the classification performance of the methodology, each of the three subsets was used as the testing set and the other two were used as the training set. The individual classification accuracy was determined by testing OAA MCSVMs in the given fold, and the final classification accuracy was the average value of the three individual accuracies calculated in each fold.

4.2. Efficacy of Removing Useless Elements

To demonstrate the effectiveness of the proposed scheme, we visualized the input signal in the frequency domain before and after removing noise, as shown in Figure 10. It was noticeable that the two tank conditions, i.e., a normal state and faulty state, were easily distinguishable after eliminating noise. To be specific, wavelet packet nodes from two types of signals showed a clear difference from each other. When a classifier was trained with such discriminating features, it yielded higher classification accuracy. Alternatively, original signals without preprocessing were contaminated with noise; therefore, these signals were undistinguishable.
In addition to visualization results, we also provided qualitative evaluation of the diagnosis system by classifying the performance when original signals without preprocessing were used with the model. In this study, we used the k-cv scheme to partition the dataset into k subsets, where k = 3 in this case, the individual classification accuracy was measured using two subsets for training and the remaining subset for testing OAA MCSVMs. The individual classification accuracy was calculated as follows:
C a c c u r a c y = L N T P N s a m p l e s × 100 ( % ) ,
where L was the number of classes, i.e., L = 2 ; N T P was the number of true positive (TP) instances, which was the total number of samples in class i of the testing set that were exactly classified as class i ; and N s a m p l e s was the total number of samples in the testing set, i.e., N s a m p l e s = 360 . The performance of the proposed tank fault diagnosis model was evaluated using the average classification accuracy (ACA), which was the mean value of the three individual accuracies calculated using the k-cv scheme. Table 3 presents the average classification accuracies of the fault diagnosis models. According to Table 3, it could be seen that the proposed fault diagnosis method using the signal de-noising scheme yielded better average classification accuracies than the one that did not use signal pre-processing steps. The noise-contamination was the main reason that the classifier faced difficulties in analyzing the fault information inherited in the original signals, which resulted in classification accuracy degradation.

4.3. Efficacy of the Wavelet-Based Features

In this study, the feature pool values created from wavelet-based attributes were used as inputs for the standard OAA MCSVMs. The reason behind this choice is that it can be used to catch underlying properties in short-time transient signals by splitting a signal into low-frequency and high-frequency bands. This decision was also made based on the apparent difference between frequency bands or wavelet packet nodes of two tank signal types, i.e., a normal tank signal and a defective tank signal. To validate our scheme, it was necessary to compare our model’s classification performance with a model that uses another fault feature set. The rival in this experiment was a fault feature vector with different statistical measures of the time and frequency domain [15]. Table 4 presents the diagnostic performance of the proposed model and the fault diagnosis model with a different feature pool (i.e., different time and frequency features). The diagnostic performance of all the methods was measured in terms of the ACA mentioned in Section 4.2. From the results given in Table 4, it could be concluded that the proposed model yielded greater diagnostic performance than the other method. This means that the feature set used in this study provided better discrimination between the signal types compared to the other feature set using different time and frequency features. The ACA of the proposed method was 98.48%, which showed a 4.76% improvement compared to the method which used the different types of feature vectors (i.e., 93.72%).
The effectiveness of the suggested input features is also evident in Figure 11, which shows the distribution of various features in the three-dimensional feature space. As shown in Figure 11a, the WPT-based features form distinct clusters helped the classifier to easily draw a hyperplane between the samples of different classes in order to classify them. In contrast, when we used features in the time and frequency domain, some samples that belong to different categories overlapped, as shown in Figure 11b, which deteriorated the diagnostic performance of the classifier.
Moreover, besides selecting the appropriate fault feature set, choosing good vales for the penalty and kernel parameters ( C , σ ) in OAA MCSVMs, as mentioned in Section 3.3, was also critical. We employed a grid search technique to decide the optimal combinations of these parameters in the SVM structure in terms of the classification performance. The grid search algorithm trained OAA MCSVMs with a pair ( C , σ ) in the cross product of the following two sets and evaluated their performances: C { 2 5 , 2 3 , 2 1 , 2 , 2 3 , 2 5 , 2 7 , 2 9 , 2 11 , 2 13 , 2 15 } and σ { 2 2 , 2 1 , 2 , 2 2 , 2 3 , 2 4 , 2 5 , 2 6 , 2 7 } . Finally, the pair ( C , σ ) , which helps OAA MCSVMs deliver the highest classification performance, was considered as the best combination of these parameters.

5. Conclusions

This paper proposed a new approach to effectively discriminate between two different types of signals taken from a spherical tank based on a BSS technique. This discrimination between the signals, therefore, improved the reliability of the fault diagnosis system developed for a storage tank. Moreover, this paper presented WPT-based fault features that had an ability to explore salient information in nonstationary AE signals. These features were then used to train OAA MCSVMs. Experimental results indicated that the proposed tank fault diagnosis method achieved a higher classification accuracy of 98.48% compared to other two-fault diagnosis models. In a fault diagnosis model without a signal preprocessing step and a model using a different feature set (i.e., features in the time and frequency domain), the yields were 90.38% and 93.72%, respectively.
In the future, we will explore several state-of-the-art techniques to compare and validate the effectiveness of the proposed method since some notable methods described in the introduction section can exploit hidden characteristics of such nonstationary signals.

Author Contributions

All the authors contributed equally to the conception of the idea, the design of experiments, the analysis and interpretation of results, and the writing and improvement of the manuscript.

Funding

This work was supported by the Korea Institute of Energy Technology Evaluation and Planning (KETEP) and the Ministry of Trade, Industry & Energy (MOTIE) of the Republic of Korea (No. 20162220100050).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Luo, T.; Wu, C.; Duan, L. Fishbone diagram and risk matrix analysis method and its application in safety assessment of natural gas spherical tank. J. Clean. Prod. 2018, 174, 296–304. [Google Scholar] [CrossRef]
  2. Morofuji, K.; Tsui, N.; Yamada, M.; Maie, A.; Yuyama, S.; Li, Z. Quantitative study of acoustic emission due to leaks from water tanks. System 2003, 5, 228–236. [Google Scholar]
  3. Wilson, S.; Zhang, H.; Burwell, K.; Samantapudi, A.; Dalemarre, L.; Jiang, C.; Rice, L.; Williams, E.; Naney, C. Leaking underground storage tanks and environmental injustice: Is there a hidden and unequal threat to public health in South Carolina? Environ. Justice 2013, 6, 175–182. [Google Scholar] [CrossRef] [PubMed]
  4. Lu, S.; Zhou, P.; Wang, X.; Liu, Y.; Liu, F.; Zhao, J. Condition monitoring and fault diagnosis of motor bearings using undersampled vibration signals from a wireless sensor network. J. Sound Vib. 2018, 414, 81–96. [Google Scholar] [CrossRef]
  5. Lu, S.; He, Q.; Zhao, J. Bearing fault diagnosis of a permanent magnet synchronous motor via a fast and online order analysis method in an embedded system. Mech. Syst. Signal Process. 2018, 113, 36–49. [Google Scholar] [CrossRef]
  6. Glowacz, A. Acoustic-based fault diagnosis of commutator motor. Electronics 2018, 7, 299. [Google Scholar] [CrossRef]
  7. Caesarendra, W.; Kosasih, B.; Tieu, A.K.; Zhu, H.; Moodie, C.A.; Zhu, Q. Acoustic emission-based condition monitoring methods: Review and application for low speed slew bearing. Mech. Syst. Signal Process. 2016, 72, 134–159. [Google Scholar] [CrossRef]
  8. Glowacz, A. Fault Detection of Electric Impact Drills and Coffee Grinders Using Acoustic Signals. Sensors 2019, 19, 269. [Google Scholar] [CrossRef]
  9. Martin, G.; Dimopoulos, J.; Cacic, J. Acoustic Emission for Tank Bottom Monitoring. In Proceedings of the Advanced Materials Research, Melbourne, Australia, 5–7 December 2012; pp. 499–509. [Google Scholar]
  10. Cole, P.; Watson, J. Acoustic emission for corrosion detection. In Proceedings of the Advanced Materials Research, Bahrain, Manama, 27–30 November 2005; pp. 231–236. [Google Scholar]
  11. Kwon, J.-R.; Lyu, G.-J.; Lee, T.-H.; Kim, J.-Y. Acoustic emission testing of repaired storage tank. Int. J. Press. Vessel. Pip. 2001, 78, 373–378. [Google Scholar] [CrossRef]
  12. Chen, S.-W.; Chen, Y.-H. Hardware design and implementation of a wavelet de-noising procedure for medical signal preprocessing. Sensors 2015, 15, 26396–26414. [Google Scholar] [CrossRef]
  13. Nguyen, P.; Kang, M.; Kim, J.-M.; Ahn, B.-H.; Ha, J.-M.; Choi, B.-K. Robust condition monitoring of rolling element bearings using de-noising and envelope analysis with signal decomposition techniques. Expert Syst. Appl. 2015, 42, 9024–9032. [Google Scholar] [CrossRef]
  14. Lei, Y.; Li, N.; Lin, J.; Wang, S. Fault diagnosis of rotating machinery based on an adaptive ensemble empirical mode decomposition. Sensors 2013, 13, 16950–16964. [Google Scholar] [CrossRef]
  15. Tra, V.; Kim, J.; Khan, S.A.; Kim, J.-M. Incipient fault diagnosis in bearings under variable speed conditions using multiresolution analysis and a weighted committee machine. J. Acoust. Soc. Am. 2017, 142, EL35–EL41. [Google Scholar] [CrossRef] [PubMed]
  16. Tra, V.; Kim, J.; Khan, S.A.; Kim, J.-M. Bearing fault diagnosis under variable speed using convolutional neural networks and the stochastic diagonal levenberg-marquardt algorithm. Sensors 2017, 17, 2834. [Google Scholar] [CrossRef] [PubMed]
  17. Tra, V.; Khan, S.A.; Kim, J.-M. Diagnosis of bearing defects under variable speed conditions using energy distribution maps of acoustic emission spectra and convolutional neural networks. J. Acoust. Soc. Am. 2018, 144, EL322–EL327. [Google Scholar] [CrossRef]
  18. Xiao, Q.; Li, J.; Bai, Z.; Sun, J.; Zhou, N.; Zeng, Z. A small leak detection method based on VMD adaptive de-noising and ambiguity correlation classification intended for natural gas pipelines. Sensors 2016, 16, 2116. [Google Scholar] [CrossRef]
  19. Sohaib, M.; Islam, M.; Kim, J.; Jeon, D.-C.; Kim, J.-M. Leakage Detection of a Spherical Water Storage Tank in a Chemical Industry Using Acoustic Emissions. Appl. Sci. 2019, 9, 196. [Google Scholar] [CrossRef]
  20. Islam, M.; Sohaib, M.; Kim, J.; Kim, J.-M. Crack Classification of a Pressure Vessel Using Feature Selection and Deep Learning Methods. Sensors 2018, 18, 4379. [Google Scholar] [CrossRef]
  21. Dwyer, R. Detection of non-Gaussian signals by frequency domain kurtosis estimation. In Proceedings of the ICASSP′83 IEEE International Conference on Acoustics, Speech, and Signal, Boston, MA, USA, 14–16 April 1983; pp. 607–610. [Google Scholar]
  22. Antoni, J. Fast computation of the kurtogram for the detection of transient faults. Mech. Syst. Signal Process. 2007, 21, 108–124. [Google Scholar] [CrossRef]
  23. Lei, Y.; Lin, J.; He, Z.; Zi, Y. Application of an improved kurtogram method for fault diagnosis of rolling element bearings. Mech. Syst. Signal Process. 2011, 25, 1738–1749. [Google Scholar] [CrossRef]
  24. Wang, D.; Peter, W.T.; Tsui, K.L. An enhanced Kurtogram method for fault diagnosis of rolling element bearings. Mech. Syst. Signal Process. 2013, 35, 176–199. [Google Scholar] [CrossRef]
  25. Barszcz, T.; JabŁoński, A. A novel method for the optimal band selection for vibration signal demodulation and comparison with the Kurtogram. Mech. Syst. Signal Process. 2011, 25, 431–451. [Google Scholar] [CrossRef]
  26. Peter, W.T.; Wang, D. The design of a new sparsogram for fast bearing fault diagnosis: Part 1 of the two related manuscripts that have a joint title as “Two automatic vibration-based fault diagnostic methods using the novel sparsity measurement—Parts 1 and 2”. Mech. Syst. Signal Process. 2013, 40, 499–519. [Google Scholar]
  27. Antoni, J. The infogram: Entropic evidence of the signature of repetitive transients. Mech. Syst. Signal Process. 2016, 74, 73–94. [Google Scholar] [CrossRef]
  28. Wang, D. An extension of the infograms to novel Bayesian inference for bearing fault feature identification. Mech. Syst. Signal Process. 2016, 80, 19–30. [Google Scholar] [CrossRef]
  29. Wang, D.; Tsui, K.-L. Dynamic Bayesian wavelet transform: New methodology for extraction of repetitive transients. Mech. Syst. Signal Process. 2017, 88, 137–144. [Google Scholar] [CrossRef]
  30. Zarei, J.; Poshtan, J. Bearing fault detection using wavelet packet transform of induction motor stator current. Tribol. Int. 2007, 40, 763–769. [Google Scholar] [CrossRef]
  31. Boskoski, P.; Juricic, D. Fault detection of mechanical drives under variable operating conditions based on wavelet packet Renyi entropy signatures. Mech. Syst. Signal Process. 2012, 31, 369–381. [Google Scholar] [CrossRef]
  32. Li, F.; Meng, G.; Ye, L.; Chen, P. Wavelet transform-based higher-order statistics for fault diagnosis in rolling element bearings. J. Vib. Control 2008, 14, 1691–1709. [Google Scholar]
  33. Feng, Y.; Schlindwein, F.S. Normalized wavelet packets quantifiers for condition monitoring. Mech. Syst. Signal Process. 2009, 23, 712–723. [Google Scholar] [CrossRef]
  34. Barbosa, F.R.; Almeida, O.M.; Braga, A.P.; Amora, M.A.; Cartaxo, S.J. Application of an artificial neural network in the use of physicochemical properties as a low cost proxy of power transformers DGA data. IEEE Trans. Dielectr. Electr. Insul. 2012, 19, 239–246. [Google Scholar] [CrossRef]
  35. Ghoneim, S.S.; Taha, I.B. Artificial neural networks for power transformers fault diagnosis based on IEC code using dissolved gas analysis. Int. J. Control Autom. Syst. 2015, 4, 18–21. [Google Scholar]
  36. Khan, S.A.; Equbal, M.D.; Islam, T. A comprehensive comparative study of DGA based transformer fault diagnosis using fuzzy logic and ANFIS models. IEEE Trans. Dielectr. Electr. Insul. 2015, 22, 590–596. [Google Scholar] [CrossRef]
  37. Wang, T.; Zhang, G.; Zhao, J.; He, Z.; Wang, J.; Pérez-Jiménez, M.J. Fault diagnosis of electric power systems based on fuzzy reasoning spiking neural P systems. IEEE Trans. Power Syst. 2015, 30, 1182–1194. [Google Scholar] [CrossRef]
  38. Peimankar, A.; Weddell, S.J.; Jalal, T.; Lapthorn, A.C. Evolutionary multi-objective fault diagnosis of power transformers. Swarm Evol. Comput. 2017, 36, 62–75. [Google Scholar] [CrossRef]
  39. Dai, J.; Song, H.; Sheng, G.; Jiang, X. Dissolved gas analysis of insulating oil for power transformer fault diagnosis with deep belief network. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 2828–2835. [Google Scholar] [CrossRef]
  40. Zhang, Y.; Wang, P.; Ni, T.; Cheng, P.; Lei, S. Wind power prediction based on LS-SVM model with error correction. Adv. Electr. Comput. Eng. 2017, 17, 3–9. [Google Scholar] [CrossRef]
  41. Zhang, Y.; Wang, P.; Zhang, C.; Lei, S. Wind energy prediction with LS-SVM based on Lorenz perturbation. J. Eng. 2017, 2017, 1724–1727. [Google Scholar] [CrossRef]
  42. Wilk-Kolodziejczyk, D.; Regulski, K.; Gumienny, G. Comparative analysis of the properties of the nodular cast iron with carbides and the austempered ductile iron with use of the machine learning and the support vector machine. Int. J. Adv. Manuf. Technol. 2016, 87, 1077–1093. [Google Scholar] [CrossRef]
  43. Bofill, P.; Zibulevsky, M. Underdetermined blind source separation using sparse representations. Signal Process. 2001, 81, 2353–2362. [Google Scholar] [CrossRef]
  44. Li, Y.; Amari, S.-I.; Cichocki, A.; Ho, D.W.; Xie, S. Underdetermined blind source separation based on sparse representation. IEEE Trans. Signal Process. 2006, 54, 423–437. [Google Scholar]
  45. Le, T.-P.; Paultre, P. Modal identification based on the time–frequency domain decomposition of unknown-input dynamic tests. Int. J. Mech. Sci. 2013, 71, 41–50. [Google Scholar] [CrossRef]
  46. Sadhu, A.; Hazra, B.; Narasimhan, S.; Pandey, M. Decentralized modal identification using sparse blind source separation. Smart Mater. Struct. 2011, 20, 125009. [Google Scholar] [CrossRef]
  47. Hazra, B.; Sadhu, A.; Roffel, A.J.; Narasimhan, S. Hybrid time-frequency blind source separation towards ambient system identification of structures. Comput.-Aided Civ. Infrastruct. Eng. 2012, 27, 314–332. [Google Scholar] [CrossRef]
  48. Qin, S.; Guo, J.; Zhu, C. Sparse component analysis using time-frequency representations for operational modal analysis. Sensors 2015, 15, 6497–6519. [Google Scholar] [CrossRef] [PubMed]
  49. Yang, Y.; Nagarajaiah, S. Output-only modal identification with limited sensors using sparse component analysis. J. Sound Vib. 2013, 332, 4741–4765. [Google Scholar] [CrossRef]
  50. Yu, K.; Yang, K.; Bai, Y. Estimation of modal parameters using the sparse component analysis based underdetermined blind source separation. Mech. Syst. Signal Process. 2014, 45, 302–316. [Google Scholar] [CrossRef]
  51. Yan, R.; Gao, R.X.; Chen, X. Wavelets for fault diagnosis of rotary machines: A review with applications. Signal Process. 2014, 96, 1–15. [Google Scholar] [CrossRef]
Figure 1. (a) The storage tank test bed, (b) the data acquisition system using the PCI-2 device, and (c) the shape of the crack introduced on the test bed.
Figure 1. (a) The storage tank test bed, (b) the data acquisition system using the PCI-2 device, and (c) the shape of the crack introduced on the test bed.
Entropy 21 00145 g001
Figure 2. (a) Pictorial interpretation of the experimental setup. (b) The data acquisition system where PC is a personal computer, PCI-2 represents two channels peripheral component interconnect, and WDI-AST is a wideband differential acoustic emission (AE) sensor.
Figure 2. (a) Pictorial interpretation of the experimental setup. (b) The data acquisition system where PC is a personal computer, PCI-2 represents two channels peripheral component interconnect, and WDI-AST is a wideband differential acoustic emission (AE) sensor.
Entropy 21 00145 g002
Figure 3. Flowchart of the proposed method for the diagnosis of a storage tank.
Figure 3. Flowchart of the proposed method for the diagnosis of a storage tank.
Entropy 21 00145 g003
Figure 4. Scatter space of two observations with three sources in (a) the time domain and (b) the time-frequency domain by STFT.
Figure 4. Scatter space of two observations with three sources in (a) the time domain and (b) the time-frequency domain by STFT.
Entropy 21 00145 g004
Figure 5. The time domain wave of observations: (a) crack condition and (b) normal condition.
Figure 5. The time domain wave of observations: (a) crack condition and (b) normal condition.
Entropy 21 00145 g005
Figure 6. The frequency spectrum of observations: (a) crack condition and (b) normal condition.
Figure 6. The frequency spectrum of observations: (a) crack condition and (b) normal condition.
Entropy 21 00145 g006
Figure 7. The time domain waves and spectra of recovered sources: (a) crack condition and (b) normal condition.
Figure 7. The time domain waves and spectra of recovered sources: (a) crack condition and (b) normal condition.
Entropy 21 00145 g007
Figure 8. The time domain waves and spectra of synthetic signals: (a) crack condition and (b) normal condition.
Figure 8. The time domain waves and spectra of synthetic signals: (a) crack condition and (b) normal condition.
Entropy 21 00145 g008
Figure 9. Three-level wavelet packet transform (WPT) on an AE signal.
Figure 9. Three-level wavelet packet transform (WPT) on an AE signal.
Entropy 21 00145 g009
Figure 10. The differences in input signals after being processed.
Figure 10. The differences in input signals after being processed.
Entropy 21 00145 g010
Figure 11. Feature space of (a) wavelet-based features and (b) time domain-based features.
Figure 11. Feature space of (a) wavelet-based features and (b) time domain-based features.
Entropy 21 00145 g011
Table 1. Detailed specifications of the data acquisition system.
Table 1. Detailed specifications of the data acquisition system.
AE sensor (PAC WSα)
Peak sensitivity (V/ μ bar): −62 dB
Operating frequency range: 100–900 kHz
Resonant frequency: 650 kHz
Directionality: ±1.5 dB
Two-channel AE PCI board
18-bit 40 MHz A/D conversion
10 M samples/s rate as one channel is used (5 M samples/s as two channels are used simultaneously)
Table 2. Details of the two datasets utilized to evaluate the proposed methodology.
Table 2. Details of the two datasets utilized to evaluate the proposed methodology.
fs = 1 MHzNo. Samples 1Crack Size
Length (mm)Width (mm)Depth (mm)
Dataset 1Training set108030.50.4
Testing set0.50.4
Dataset 2Training set108060.70.5
Testing set0.70.5
1 For each tank condition, i.e., a defect-free tank and defective tank, 540 feature vectors are used.
Table 3. The average classification accuracies (ACAs) of the proposed method and the fault diagnosis model without using signal pre-processing.
Table 3. The average classification accuracies (ACAs) of the proposed method and the fault diagnosis model without using signal pre-processing.
DatasetsMethodologiesAverage Accuracy (%)
Dataset 1[*]90.15
Proposed97.25
Dataset 2[*]90.38
Proposed98.47
[*] is the classification model that uses a noise-contaminated signal as the input.
Table 4. The average classification accuracies (ACAs) the proposed method and the counterpart.
Table 4. The average classification accuracies (ACAs) the proposed method and the counterpart.
DatasetsMethodologiesAccuracy (%)
Dataset 1[**]92.35
Proposed97.25
Dataset 2[**]93.72
Proposed98.48
[**] is the classification model that uses time and frequency domain-based fault features.

Share and Cite

MDPI and ACS Style

Tra, V.; Duong, B.-P.; Kim, J.-Y.; Sohaib, M.; Kim, J.-M. Improving the Performance of Storage Tank Fault Diagnosis by Removing Unwanted Components and Utilizing Wavelet-Based Features. Entropy 2019, 21, 145. https://0-doi-org.brum.beds.ac.uk/10.3390/e21020145

AMA Style

Tra V, Duong B-P, Kim J-Y, Sohaib M, Kim J-M. Improving the Performance of Storage Tank Fault Diagnosis by Removing Unwanted Components and Utilizing Wavelet-Based Features. Entropy. 2019; 21(2):145. https://0-doi-org.brum.beds.ac.uk/10.3390/e21020145

Chicago/Turabian Style

Tra, Viet, Bach-Phi Duong, Jae-Young Kim, Muhammad Sohaib, and Jong-Myon Kim. 2019. "Improving the Performance of Storage Tank Fault Diagnosis by Removing Unwanted Components and Utilizing Wavelet-Based Features" Entropy 21, no. 2: 145. https://0-doi-org.brum.beds.ac.uk/10.3390/e21020145

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop