Next Article in Journal
Entropic Dynamics
Next Article in Special Issue
Wavelet Entropy Automatically Detects Episodes of Atrial Fibrillation from Single-Lead Electrocardiograms
Previous Article in Journal
Distributing Secret Keys with Quantum Continuous Variables: Principle, Security and Implementations
Previous Article in Special Issue
Combined Power Quality Disturbances Recognition Using Wavelet Packet Entropies and S-Transform
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal Base Wavelet Selection for ECG Noise Reduction Using a Comprehensive Entropy Criterion

Department of Electrical Engineering, College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 200234, China
*
Author to whom correspondence should be addressed.
Entropy 2015, 17(9), 6093-6109; https://0-doi-org.brum.beds.ac.uk/10.3390/e17096093
Submission received: 29 May 2015 / Revised: 12 August 2015 / Accepted: 17 August 2015 / Published: 1 September 2015
(This article belongs to the Special Issue Wavelet Entropy: Computation and Applications)

Abstract

:
The selection of an appropriate wavelet is an essential issue that should be addressed in the wavelet-based filtering of electrocardiogram (ECG) signals. Since entropy can measure the features of uncertainty associated with the ECG signal, a novel comprehensive entropy criterion Ecom based on multiple criteria related to entropy and energy is proposed in this paper to search for an optimal base wavelet for a specific ECG signal. Taking account of the decomposition capability of wavelets and the similarity in information between the decomposed coefficients and the analyzed signal, the proposed Ecom criterion integrates eight criteria, i.e., energy, entropy, energy-to-entropy ratio, joint entropy, conditional entropy, mutual information, relative entropy, as well as comparison information entropy for optimal wavelet selection. The experimental validation is conducted on the basis of ECG signals of sixteen subjects selected from the MIT-BIH Arrhythmia Database. The Ecom is compared with each of these eight criteria through four filtering performance indexes, i.e., output signal to noise ratio (SNRo), root mean square error (RMSE), percent root mean-square difference (PRD) and correlation coefficients. The filtering results of ninety-six ECG signals contaminated by noise have verified that Ecom has outperformed the other eight criteria in the selection of best base wavelets for ECG signal filtering. The wavelet identified by the Ecom has achieved the best filtering performance than the other comparative criteria. A hypothesis test also validates that SNRo, RMSE, PRD and correlation coefficients of Ecom are significantly different from those of the shape-matched approach (α = 0.05, two-sided t- test).

1. Introduction

Cardiovascular disease is one of the most causes of death in the world. With the aging trend of the population, people are paying more and more attention to research into telemedicine systems for the immediate and accurate detection of cardiac diseases [1]. As a noninvasive test for recording the electric activity of the heart, electrocardiogram (ECG) plays a vital role in cardiac telemedicine systems. The assessment of alterations in the features of ECG signals provides useful information for the detection, diagnosis and treatment of cardiac diseases. However, during the ECG signal acquisition and transmission procedures, the sampled ECG signal is inevitably corrupted by various noises, such as baseline wander, electrode motion, power line interference, motion artifact and so on [2]. Usually, some specific measures such as median filter and band-stop filter can be implemented to suppress the influence of baseline wander and power line interference existing in ECG signals, respectively. However, electromagnetic disturbances such as thermal noise existing in measurement circuits have a significant influence on ECG signals. Thus, the noise reduction of ECG signals is a key requirement prior to pathological feature analysis [3].
Among a variety of filtering techniques, the wavelet transform has been proven as a useful tool for ECG signal denoising due to its powerful analysis ability in both the time and frequency domains. Moreover, the abundance of the base wavelets developed over the past decades is also another prominent advantage for the enhancement of various ECG signals. Nevertheless, since ECG signals have diverse wave shapes from individual to individual, how to choose an optimal base wavelet for analyzing a specific signal has become a crucial problem in the wavelet transform application domain of [4].
In recent years, the topic of base wavelet selection has been addressed by researchers from different points of view. Theoretically, a good base wavelet for noise reduction should satisfy the properties of orthogonality, symmetry, compact support and regularity so as to ensure the smoothness of the denoised signal without distortion. The orthogonal wavelets are not redundant and are suitable for signal or image denoising and compression [5], whilst the regularity is essential for getting nice features, e.g., the smoothness of the reconstructed signal or image, and function estimation in nonlinear regression analysis [6]. In addition, shape matching is also an important issue investigated in the stage of base wavelet selection. Bhatia et al. [7] used both orthogonality and the property of complexity to guide the base wavelet choosing procedure for ECG signal analysis. Singh et al. [6] introduced a cross correlation coefficient between the ECG signal and a selected wavelet filter for the selection of an appropriate base wavelet. In their method, in terms of root mean square error (RMSE), root means square bias (RMSB), and L1 norm, the Daubechies mother wavelet of order 8 was validated as the most appropriate wavelet basis function for denoising applications. However, the basis properties of a wavelet only qualitatively determine its suitability for a particular application. As far as shape matching is concerned, it is generally difficult to accurately match the shape of a signal to that of a base wavelet through a visual comparison [4]. As a result, quite a few of ECG denoising schemes implement the choice of the best base wavelet through the noise suppression evaluation criteria, such as percentage root mean square difference (PRD) [8], signal to noise ratio (SNR) [9,10] and mean square error (MSE) [11]. Furthermore, in order to maximize the filtering performance, a genetic algorithm [12] based on SNR and PRD is also applied to the wavelet transform for base selection.
However, these filtering performance measures can only be obtained by comparing the noisy ECG signals and denoised ones. A great deal of filtering tests has to be done before removing the noise from all the signals, which inevitably increases the computational complexity of noise reduction. It is well known that entropy is a measure that can be used to evaluate the uncertainty associated with a random variable, i.e., the expected value of information in a message. Besides, Shannon entropy is one of the most important metrics in information theory. Regarding a sampled ECG signal as a message sequence, the characteristics of the ECG signal can be defined by its entropy. Therefore, in order to automatically find the best base wavelet in a simple and easy way, a comprehensive entropy measure (Ecom) based on the integration of multiple entropy criteria including the energy to entropy ratio, relative entropy, joint entropy, conditional entropy, mutual information and comparison information entropy is proposed in this paper. In terms of the proposed index Ecom, the best base wavelet is chosen from 22 widely used wavelets for wavelet thresholding filtering (WTF) of ECG signals. Then the comprehensive analysis of denoising results will also be conducted for the evaluation of the proposed approach through the ECG data derived from the MIT database. Meanwhile, four filtering measures, i.e., signal to noise ratio (SNRo), Root Mean square Error (RMSE), Percent Root Difference (PRD), as well as Correlation coefficient (r) are utilized for comparison so as to evaluate the performance of the proposed approach.
This paper is organized as follows: the principle of wavelet thresholding filtering is briefly described in Section 2. Then, the proposed comprehensive entropy measure is illustrated in Section 3. Subsequently, the analysis and comparison of ECG signal filtering based on multiple criteria is presented in Section 4. Finally, concluding remarks are provided in Section 5.

2. Discrete Wavelet Thresholding Denoising of ECG Signals

Suppose S ˜ ( n ) = [ s ˜ 0 , s ˜ 1 , ... , s ˜ N ] is the observed data vector of a noisy ECG signal, and s ˜ i is given by: s ˜ i = s i + n i , i = 0, 1, 2, ... , N−1, where si is a signal function S to be recovered and n is the Gaussian white noise with independent and identical distribution N ( 0 , σ ) , which mimics the electromagnetic noise in measurement environment N(0,σ). The objective of wavelet filtering is to achieve enhanced signal by remaining useful components and removing the noise. Through discrete wavelet transform (DWT), a discrete signal S ˜ ( n ) can be expanded as follows:
S ˜ ( n ) = k Z a 2 j ( k ) ϕ 2 j ( n 2 j k ) + k Z d 2 j ( k ) ψ 2 j ( n 2 j k )
where n, k, j represent the sample number, the number of wavelet coefficients and the decomposition scale, respectively. The frequency spectrum of S ˜ ( n ) is divided into a high frequency sub-band and low frequency sub-band as the decomposition scale increases. The approximated component of S ˜ ( n ) is calculated by multiplying the scale function ϕ 2 j ( n 2 j k ) by scale factors a 2 j ( k ) . Similarly, the product of wavelet function ψ 2 j ( n 2 j k ) and translation factors d 2 j ( k ) represents the detailed component of S ˜ ( n ) .
After wavelet decomposition, the WTF is also involved in deleting the wavelet coefficients of the noisy signal whose modulus is below a threshold and reconstructing the denoised signal from the remaining coefficients. To summarize, the conventional wavelet thresholding filtering method includes the following three steps [3,13]:
(1)
Choose an optimal base wavelet according to the noisy signal. Then, decompose the noisy signal into sub-bands by wavelet transform to yield coarse level scaling coefficients and wavelet coefficients.
(2)
Estimate the noise threshold T and design the threshold shrinkage function according to the requirements of noise reduction. Then, filter the noise by applying the shrinkage function to every wavelet coefficient.
(3)
Reconstruct the signal with new coefficients to obtain the estimated signal S ^ .
At present, soft thresholding is a commonly-used wavelet shrinkage function [14,15,16] which has better signal filtering performance than hard thresholding. It can be described as follows:
y S ( c ) = { 0 | c | T sgn ( c ) ( | c | T ) | c | > T
where c is the wavelet coefficient of S ^ after wavelet decomposition. T is the Universal threshold proposed by Donoho and Johnstone [17]. It can be expressed as:
T = σ ^ 2 log ( N )
where N is the number of samples in the signal vector. σ ^ is the estimate of standard deviation of noise and σ ^ = M A D / 0.6745 , where the MAD denotes the median absolute deviation of wavelet coefficients. It should be noted that no matter what kind of shrinkage function is applied, the selection of an optimal wavelet function for a given ECG will directly affect the results of wavelet thresholding in the end. On account of the diversity of wavelets, how to choose a base wavelet that is best suited for analyzing a special signal is an important issue in the filtering procedure. Therefore, the base wavelet selection based on a comprehensive entropy criterion is proposed in this paper for removing noise from ECG signals.

3. Comprehensive Entropy Criterion for Optimal Wavelet Selection

From the viewpoint of signal filtering, the objective of noise cancellation is to estimate a function S with minimum mean square error (MSE), i.e., to minimize L2 risk for a given noisy function:
R ( S ^ , S ) = 1 N S ^ S 2 = 1 N i = 0 N 1 ( s ^ i s i ) 2
By considering representations of such functions in orthonormal bases such as wavelets, the Parseval relation establishes the equivalence between the L2 risk in the function space and the l2 risk of the wavelet coefficients of the functions, and consequently the related theorems can be proved in the sequence space of wavelet coefficients rather than the function space itself [18]. Moreover, when a signal to be analyzed is decomposed by the discrete wavelet transform, its characteristics are mainly depicted by its wavelet coefficients. Since an ECG signal is a periodic cardiac signal, the corresponding ECG wavelet coefficients generated by the best base wavelet should reveal a certain regularity in its distribution. Simultaneously, the wavelet coefficients of the decomposed ECG signal should be closely related to the sample sequence of the original signal so as to recover the main variation of an ECG signal. Because of the diverse characteristics and applicability of base wavelets, not all wavelets are suitable for denoising a specific ECG signal. Hence, an optimal base wavelet for a specific ECG signal can be determined by the evaluation of decomposing ability of wavelets and information comparison between the wavelet decomposed coefficients and the sample sequence of the original ECG signal.

3.1. Decomposing Capability Measure

The energy of a signal is one of the measures that characterize the feature of a signal. For a noisy ECG signal S ˜ ( n ) = [ s ˜ 0 , s ˜ 1 , ... , s ˜ N ] , suppose C ( n ) = [ c 0 , c 1 , ... , c N ] is its wavelet coefficient sequence at decomposing scale j after the DWT. The energy of C(n) can be defined by its wavelet coefficients as follows:
E e n e r g y ( c ) = i = 1 N | c ( j , i ) | 2
where c ( i , j ) is the ith coefficient of the jth level of decomposed S ˜ ( n ) . For the same amount of energy within a frequency sub-band, the specific features of the signal may be significantly different. The spectral distribution of the energy needs to be considered to ensure the effective feature extraction. In information theory, Shannon entropy is a measure of uncertainty associated with random variables. Thus, the energy distribution of wavelet coefficients can be described by Shannon entropy as follows:
E e n t r o p y ( c ) = i = 1 N p i log 2 p i
where pi is the energy probability distribution of wavelet coefficients, expressed as:
P i = | c ( i ) | 2 E e n e r g y ( c )
and i = 1 n p i = 1 . If p i = 0 , then p i log 2 p i = 0 . E e n t r o p y ( c ) is bounded by 0 E e n t r o p y ( c ) log 2 N .
Note that the ECG signal has prominent characteristics of small amplitude, usually 10 μV ~ 5 mV, and with frequencies of interest in the range 0.05–100 Hz. Hence, when the ECG signal is decomposed into scales, an appropriate base wavelet should yield large magnitude coefficients at low frequency and negligible magnitude coefficients at the others. Moreover, the higher the energy extracted from the corrupted ECG signal, the more effective the wavelet transform of the signal will be. The lower the entropy is, the higher the energy concentration will be. Therefore, a self-evaluation criterion for the wavelet decomposing coefficients can be defined by the energy-to-Shannon entropy ratio:
E r = E e n e r g y ( c ) E e n t r o p y ( c )
By combining the energy measure with the Shannon entropy, an appropriate base wavelet should extract the maximum amount of energy from the signal being analyzed, while minimizing the Shannon entropy of the corresponding wavelet coefficients. The bigger Er is, the more suitable a base wavelet will be for filtering.

3.2. Information Comparison Measure

The energy-to-Shannon entropy ratio only evaluates the contents of wavelet coefficients themselves. It is necessary to compare C(n) with S ˜ ( n ) in order to ensure the coefficients of the decomposed signal are inherently related to the signal. Thus, several information theoretic criteria are used to further evaluate the base wavelet [19], i.e.:
(1) Joint entropy
E j o int = H ( S ˜ , C ) = s ˜ S ˜ c C p ( s ˜ , c ) log p ( s ˜ , c )
where p ( s ˜ , c ) is the joint probability distribution of two data sequences. The joint entropy measures information associate with S ˜ and C as a whole.
(2) Conditional Entropy
E c o n = H ( C | S ˜ ) = s ˜ S ˜ p ( s ˜ ) H ( C | S ˜ = s ˜ ) = s ˜ S ˜ p ( s ˜ ) c C p ( c | s ˜ ) log p ( c | s ˜ ) = s ˜ S ˜ c C p ( s ˜ , c ) log p ( s ˜ , c ) p ( s ˜ ) = H ( S ˜ , C ) H ( S ˜ )
where p ( s ˜ ) is the probability distribution of the noisy ECG sequence S ˜ , p ( c | s ˜ ) denotes the conditional probability distribution of the wavelet coefficient sequence C when S ˜ is known. The conditional entropy indicates the information that is particular to each corresponding data sequence itself.
(3) Mutual Information
I m u = I ( S ˜ ; C ) = s ˜ S ˜ c C p ( s ˜ , c ) log p ( s ˜ , c ) p ( s ˜ ) p ( c ) = s ˜ S ˜ c C p ( s ˜ , c ) log p ( s ˜ , c ) s ˜ S ˜ c C p ( s ˜ , c ) log [ p ( s ˜ ) p ( c ) ] = H ( S ˜ , C ) + H ( S ˜ ) + H ( C )
Mutual information is a measure of the information contained in one process related to another process. The average mutual information between the two processes can be expressed as the sum of two self entropies minus the entropy of the pair [19]. In the wavelet thresholding filtering, the mutual information represents the amount of shared information contained in both S ˜ and C . Usually, the larger value of the mutual information of S ˜ and C indicates that the decomposed wavelet coefficients contain more information of the signal to be analyzed.
(4) Relative Entropy
E r e = D ( S ˜ | | C ) = s ˜ S ˜ p ( s ˜ ) log p ( s ˜ ) p ( c )
with p ( s ˜ ) log p ( s ˜ ) p ( c ) = 0 when p ( s ˜ ) = 0 , and p ( s ˜ ) log p ( s ˜ ) p ( c ) = when p ( c ) = 0 . The relative entropy measures the distance between probability distributions of data sequences S ˜ and C. It is zero if and only if both probability distributions are equivalent to each other.
(5) Comparison information entropy
To recover the main characteristics of the ECG signal, the decomposed coefficients obtained by the best base wavelet should have shared information with the ECG signal as much as possible and exclusive information as little as possible. Therefore, an appropriate wavelet should minimize the joint entropy, conditional entropy, and relative entropy of S ˜ and C while maximizing their mutual information. Taking four measures Ejoint, Econ, Imu as well as Ere into account, a comparison information criterion is obtained as follows:
I c = I ( S ˜ ; C ) H ( S ˜ , C ) × H ( C | S ˜ ) × D ( S ˜ | | C ) = I m u E j o int E c o n E r e
It can be seen that the bigger Ic is, the more suitable a base wavelet will be for filtering.

3.3. Comprehensive Entropy Criterion

According to the decomposition capability measures and information comparison measures, by integrating the advantages of the aforementioned entropy criteria, a novel comprehensive entropy criterion for the optimal base wavelet selection is proposed, which is defined as:
E c o m = ( E e n e r g y E e n t r o p y ) ( I m u E j o int E c o n E r e ) = E r I c
It is noticeable that the comprehensive entropy criterion not only measures the decomposition capability of a base wavelet, but also measures the similarity between a noisy signal and its wavelet decomposition coefficients. It comprehensively depicts the suitability of a base wavelet for a specific noisy signal. Generally, the base wavelet that produces the maximum of Ecom should be chosen as the most appropriate wavelet for the noise reduction of an ECG signal.

4. Experiments and Comparison

4.1. ECG Signals

In order to validate the proposed criterion, ECG signals in the MIT-BIH Arrhythmia Database are used for the analysis. The recordings in the MIT-BIH Arrhythmia Database are sampled at 360 Hz per channel with an 11-bit resolution over the 10 mV range with lead II configuration [20,21]. In the noise filtering procedure, the ECG signals in the MIT-BIH Arrhythmia database are regarded as the original clean ECG signals, but the ECG signals of some subjects in the MIT-BIH Arrhythmia Database are still noisy. They cannot be utilized as clean ECG signals and are not suitable for filtering comparison. Therefore, after careful investigation, the ECG signals of sixteen subjects (Subject No. 100, 103, 105, 106, 107, 113, 115, 116, 117, 123, 124, 217, 219, 220, 223, and 234) from the MIT-BIH Arrhythmia Database are selected for evaluating the performance of wavelet thresholding filtering. Moreover, for the same reason, only 10,000 consecutive samples of an analyzed ECG signal with less noise were chosen as the original clean ECG signal. Since electromagnetic noise such as thermal noise is one kind of noise difficult to filter from ECG signals, Gaussian white noise which can mimic the electromagnetic noise usually happening in measurements was added to contaminate the clean ECG signals with noise level SNR of 15, 20, 25, 30, 35 and 40 dB, respectively. Thus, ninety-six noisy ECG signals are generated for the comparison of various criteria for the best base wavelet selection.

4.2. Wavelet Set

Twenty-two widely-used wavelets are chosen as the base wavelet set for signal filtering, including Daubechies (db) series (db1, db2, db3,db4, db5, db6, db7, db8, db9, db10), sym series (sym2, sym3, sym4, sym5, sym6, sym7, sym8) and coif series (coif1,coif2, coif3, coif4,coif5). To compare the criteria described in Section 3, each noisy ECG signal was firstly decomposed up to eight levels by every wavelet in the set. Then, the best wavelet for a noisy ECG signal is chosen according to nine criteria: energy, entropy, energy-to-entropy ratio, joint entropy, conditional entropy, mutual information, relative entropy, comparison information entropy as well as the comprehensive entropy criterion, respectively. In wavelet thresholding filtering, the universal threshold expressed as Equation (3) is employed to estimate the noise in the noisy ECG signal. The soft thresholding function shown as Equation (2) is used to realize the transformation of wavelet decomposing coefficients. The filtering process is conducted according to three steps described in Section 2. All the analysis of ECG signals in this work are carried out through software Matlab7.10.0 (R2010a) (The MathWorks, Inc., Natick, MA, USA).

4.3. Denoising Performance Indexes

Suppose S ( n ) = [ s 0 , s 1 , ... , s N ] is the original clean ECG signal from the MIT database, while S ^ ( n ) = [ s ^ 0 , s ^ 1 , ... , s ^ N ] denotes the denoised ECG signal by using the selected optimal base wavelet and soft thresholding filtering. The filtering performance of different best wavelets are evaluated through comparing S and S ^ based on four filtering indexes defined in the following:
The output signal to noise ratio SNRo:
S N R o = 10 log 10 i = 1 N S 2 ( i ) i = 1 N | S ( i ) S ^ ( i ) | 2
root mean square error RMSE:
R M S E = 1 N i = 1 N [ S ( i ) S ^ ( i ) ] 2
percent root mean-square difference PRD:
P R D = 100 { n = 1 N [ S ( n ) S ^ ( n ) ] 2 / n = 1 N [ S ( n ) ] 2 } }
and correlation coefficient r [22]:
r = i = 1 N ( s i s ¯ ) ( s ^ i s ^ ¯ ) i = 1 N ( s i s ¯ ) 2 i = 1 N ( s ^ i s ^ ¯ ) 2
It is noticed that the SNRo is a measure of signal strength related to the background noise after wavelet thresholding denoising. Theoretically, the SNRo of a filtered ECG signal should be large in amplitude in order to recover the useful signal. On the other hand, the RMSE reflects the distortion of the filtering result. The smaller the RMSE is, the closer the denoised signal is to the original signal and less distortion of the denoised signal after filtering. Moreover, the PRD gives the information about the percentage of distortion of the filtered signal. A small PRD value indicates the efficiency of the denoising procedure. In addition, the correlation coefficient r is a statistical concept that measures how well the denoised signals follow the actual signal. The larger value of r is, the closer the denoised signal is to the original signal.

4.4. Results and Discussion

Through the discrete wavelet decomposition, each of ninety-six noisy ECG signals is transformed into a wavelet coefficient sequence. There are twenty-two test wavelets in total. Hence, every noisy ECG signal is converted into 22 coefficient sequences in terms of 22 base wavelets. Nine criteria for the best base selection are calculated for every coefficient sequence of the same noisy signal. The best base wavelet based on the same criterion is finally selected and recorded through the comparison of criterion values. Because of the diversity of ECG signals of individuals, the selected best wavelet of all subjects reveals a great variety even through the same criterion. Moreover, for the same noisy signal, the selection results of the best base wavelets may also be different depending on the different criteria. Figure 1 shows a typical example of the best base wavelet selection for the ECG signals of all 16 subjects with same noise level SNR 20 dB. Each line represents the best wavelets of one noisy ECG signal obtained respectively by nine criteria. In total, there are 16 wavelet lines in Figure 1. It can be seen that each line of best wavelets fluctuates dramatically. Besides, the best wavelets selected by nine criteria are not identical to each other for all the subjects. Therefore, it is a significant issue to find an appropriate criterion for the best wavelet selection in ECG signal filtering.
Figure 1. Best wavelets for ECG signals with input SNR = 20 dB of all subjects.
Figure 1. Best wavelets for ECG signals with input SNR = 20 dB of all subjects.
Entropy 17 06093 g001
It can also be seen from Figure 1 that two criteria Ejoint and Econ always choose the same wavelets for all noisy signals of 16 subjects. The optimal base wavelets for all subjects obtained by Eentropy, Ejoint, Econ, Imu and Ic are scattered in the wavelet family; while those selected by Eenergy, Er, Ere and Ecom are relatively concentrated. In fact, for the same noisy ECG signal, obvious differences exist in filtering performance based on these nine criteria. Take the ECG signal with noise level of SNR = 20 dB of subject No. 116 as an example. In terms of the nine criteria, the nine best base wavelets and the corresponding criterion values are recorded in Table 1. Then, the corresponding nine denoised ECG signals are shown in Figure 2 for comparison with the original ECG signal and the corrupted ECG signal. Since the soft thresholding inherently shrinks large wavelet coefficients with larger bias [23], the amplitudes of all the denoised signals are lower than those of the original ECG signal. The difference of denoised signals based on nine criteria of the best base wavelet selection can be seen in Figure 3, which is an enlarged version of the denoised one-cycle ECG signals. It is noticeable that the wave shapes of the filtered signals in Figure 2 and Figure 3 based on Eentropy, Ejoint, Econ and Ere are more or less distorted.
Table 1. Best base wavelets and corresponding criterion values of ECG signal with input SNR 20 dB of subject 116.
Table 1. Best base wavelets and corresponding criterion values of ECG signal with input SNR 20 dB of subject 116.
CriterionEenergyEentropyErEjointEconImuEreIcEcom
Waveletcoif5coif1coif5coif1coif1db9db1db8coif4
Criterion value8.3943 × 1034.22241.5282 × 10314.53074.21951.4138 × 10−25.54643.4654 × 10−52.6248 × 10−2
Figure 2. Filtering comparison of ECG signals with noise level SNR = 20 dB of subject No. 116.
Figure 2. Filtering comparison of ECG signals with noise level SNR = 20 dB of subject No. 116.
Entropy 17 06093 g002
Figure 3. Filtering comparison of one-cycle ECG signal with noise SNR = 20 dB of subject No. 116. Notes: The dotted line represents the original clean ECG signal, while the solid line indicates the denoised ECG signal.
Figure 3. Filtering comparison of one-cycle ECG signal with noise SNR = 20 dB of subject No. 116. Notes: The dotted line represents the original clean ECG signal, while the solid line indicates the denoised ECG signal.
Entropy 17 06093 g003
Figure 4. Frequency spectrum comparison of ECG signals with input SNR 20 dB of subject No. 116.
Figure 4. Frequency spectrum comparison of ECG signals with input SNR 20 dB of subject No. 116.
Entropy 17 06093 g004
The corresponding frequency spectra shown in Figure 4 also indicate that some undesirable noise still remains in the filtered signals based on these four criteria. Furthermore, it is a common phenomenon for the most of noisy ECG signals with different noise levels. Hence, though Eentropy, Ejoint, Econ and Ere reflect some features of an ECG signal, they cannot be used independently for the selection of best base wavelet. With the optimal wavelets, the filtering results of subject No. 116 are recorded in Table 2. The comprehensive entropy criterion chooses the wavelet Coif4 as the optimal base wavelet for the noisy ECG signal with noise level of SNR 20 dB of subject No. 116. By excluding undesirable criteria Eentropy, Ejoint, Econ and Ere, criterion Ecom obtains the best filtering indexes, i.e., the highest values of SNRo and r, as well as the lowest values of RMSE and PRD. The same conclusion can also be made according to the filtering results shown in Table 3, which indicates the mean values of four denoising indexes (i.e., S N R o ¯ , R M S E ¯ , P R D ¯ and r ¯ ) for the ECG signals with noise level SNR 20 dB of all subjects.
Furthermore, on the basis of nine best wavelet selection criteria, the mean filtering index values for all 96 ECG signals with six noise levels, i.e., S N R o ¯ ¯ , R M S E ¯ ¯ , P R D ¯ ¯ and r ¯ ¯ , are shown in Table 4. For the comprehensive entropy criterion, the mean values of SNRo, RMSE, PRD, and r of 96 ECG signals are 19.4922, 0.0763, 13.9420 and 0.9758, respectively. It is noted that, except P R D ¯ ¯ , indexes S N R o ¯ ¯ , R M S E ¯ ¯ and r ¯ ¯ of Ecom are better than those of the other eight criteria. Four filtering indexes of Ere are the worst. Besides, though the PRD of Eentropy is 13.9304 which is the lowest value and less than that of Ecom, too much noise remaining in the most of denoised signals greatly attenuates the efficacy of Eentropy.
In order to further validate the proposed approach, the shape-match (SM) approach based on cross correlation coefficient [6] is implemented since it is commonly used in the selection of optimal base wavelets. The procedure is described as follows: firstly, all the ECG signals are segmented into ECG cycle data. The cross correlation coefficients of cycle ECG segments and base wavelets are calculated. Then, the wavelet with highest correlation coefficient is chosen as the best base wavelet of an ECG signal. With the wavelet obtained by the SM, all the ECG signals are filtered by the same soft-thresholding method used in the former experiment. Moreover, the mean values of four filtering indexes of six denoised ECG signals of every subjects, i.e., SNRom, RMSEm, PRDm and rm, are respectively computed for the E c o m and the SM. The two-sided t-hypothesis test is conducted in the mean values of SNRom, RMSEm, PRDm and rm of two methods. The values of statistic variables of these four indexes for two methods are recorded in Table 5 and the corresponding box figures are illustrated in Figure 5. It can be clearly seen that the values of statistic variables of four indexes of Ecom are better than those of the SM. Besides, the t-test results show that all the statistic values of the test are larger than the value of two-sided t-test standard deviation u α 2 = 1.6973 as the significance level α = 0.05 . Therefore, it can be concluded that the SNRom, RMSEm, PRDm and rm of Ecom is significantly different from those of the shape-matched approach (n1 = n2 = 16, significance level α = 0.05 , two-sided t- test).
Finally, the mean value curves of SNRo, RMSE, PRD and r for all the noisy signals based on the comprehensive entropy criterion are depicted in Figure 6. It is noted that both SNRo and r varies proportionally to the increase of input SNR, while RMSE and PRD changes inversely as input SNR rises. Moreover, the relation between the output SNRo and the input SNR is close to linear. It indicates that the proposed criterion Ecom has the stable performance in the optimal base selection. The variation of RMSE, PRD and r tends to be slowly as the input SNR rises merely because there is less noise in the contaminated ECG signals to be reduced.
Table 2. Filtering performance comparison for ECG signal with input SNR 20 dB of subject No.116.
Table 2. Filtering performance comparison for ECG signal with input SNR 20 dB of subject No.116.
CriterionSNRo (dB)RMSE (mV)PRD (%)R
Eenergy17.78890.148112.89900.9796
Eentropy18.30610.139512.15330.9823
Er17.78890.148112.89900.9796
Ejoint18.30610.139512.15330.9823
Econ18.30610.139512.15330.9823
Imu17.70780.149413.01990.9823
Ere16.97740.162614.16220.9760
Ic17.36930.155413.53740.9803
Ecom17.88960.146312.75020.9829
Table 3. Filtering performance comparison for ECG signal with input SNR 20 dB of all subject.
Table 3. Filtering performance comparison for ECG signal with input SNR 20 dB of all subject.
Criterion S N R o ¯ (dB) R M S E ¯ (mV) P R D ¯ (%) r ¯
Eenergy14.69890.109319.91520.9662
Eentropy14.68350.109119.72160.9625
Er14.69150.109419.92440.9658
Ejoint14.65960.109519.77840.9623
Econ E c o n 14.65960.109519.77840.9623
Imu14.58860.110720.23600.9628
Ere13.95410.119721.71190.9578
Ic14.46730.112220.47410.9614
Ecom14.73140.109019.91280.9670
Table 4. Filtering performance comparison for all ECG signals of all subjects.
Table 4. Filtering performance comparison for all ECG signals of all subjects.
Criterion S N R ¯ ¯ (dB) R M S E ¯ ¯ (mV) P R D ¯ ¯ (%) r ¯ ¯
Eenergy19.46080.076613.98110.9757
Eentropy19.11590.077313.93040.9738
Er19.49160.076313.94320.9757
Ejoint19.10470.077413.97440.9741
Econ19.10470.077413.97440.9741
Imu19.45680.076714.04520.9745
Ere17.7827 *0.0882 *15.7884 *0.9712 *
Ic19.44520.077014.08240.9743
Ecom19.49220.076313.94200.9758
Note: * denotes the worst filtering result of nine criteria.
Table 5. Statistic variables of SNRom, RMSEm, PRDm and rm of Ecom and those of shape-matched approach.
Table 5. Statistic variables of SNRom, RMSEm, PRDm and rm of Ecom and those of shape-matched approach.
IndexSNRomRMSEmPRDmrm
Statistic VariableMeanStandard DeviationMeanStandard DeviationMeanStandard DeviationMeanStandard Deviation
E c o m 19.49223.64290.07630.010713.94205.79040.97580.0163
Shape-Matched (SM)17.33043.48440.08650.013117.58096.06170.96320.0137
Figure 5. Box plots of SNRom, RMSEm, PRDm, and rm of E c o m and shape-matched approach.
Figure 5. Box plots of SNRom, RMSEm, PRDm, and rm of E c o m and shape-matched approach.
Entropy 17 06093 g005
Figure 6. Variation of SNRo, RMSE, PRD, and r of all ECG signals with different noise levels based on the comprehensive entropy criterion.
Figure 6. Variation of SNRo, RMSE, PRD, and r of all ECG signals with different noise levels based on the comprehensive entropy criterion.
Entropy 17 06093 g006

5. Conclusions

Wavelet transforms have become efficient analyzing tools for time-varying and non-stationary ECG signals due to their powerful decomposition ability of both high and low resolution signals. However, the selection of the best wavelet is a crucial issue in the wavelet filtering of ECG signals. Inspired by information entropy, a novel comprehensive entropy criterion based on the combination of multiple entropy measures is proposed in this paper to find an appropriate wavelet for a specific ECG signal. It is devised through evaluating both the decomposing ability of wavelet and the information similarity between decomposed coefficients and the signal to be analyzed. The ECG signals of sixteen subjects selected from the MIT-BIH Arrhythmia Database were used to validate the proposed criterion. Ninety-six noisy ECG signals were generated by artificially adding the Gaussian white noise with noise levels of SNR with 15, 20, 25, 30, 35 and 40 dB to the original ECG signal, respectively. A detailed comparison for the optimal wavelet selection was conducted between the comprehensive criterion and the other eight common criteria, i.e., energy, entropy, energy-to-entropy ratio, joint entropy, conditional entropy, mutual information, relative entropy, and comparison information entropy. Four filtering indexes SNRo, RMSE, PRD and correlation coefficient were used for performance comparison of the nine criteria. The filtering results have verified that entropy, joint entropy, conditional entropy, and relative entropy cannot be used independently as the criterion for best base wavelet selection due to the prominent distortion and residual noise in most of the filtered signals. Nevertheless, the comprehensive entropy criterion can find the best base wavelet for a specific ECG signal and obtain better filtering performance than those of the compared eight criteria. The statistical analysis results also show that the four filtering indexes of denoised ECG signals of Ecom have significant differences from those of the shape-matched approach for base wavelet selection. By comparison to the shape-matched approach and other filtering-index-based methods, the comprehensive criterion chooses the optimal base wavelet only through decomposed wavelet coefficients without utilizing any other extra data or computations. Hence, the lesser complexity in computation of Ecom will definitely shorten the computational time for optimal base wavelet selection. In terms of its efficiency, the comprehensive criterion can be further applied to the wavelet noise reduction of other biomedical or mechanical signals. However, it should be noted that the comprehensive criterion is based on the entropy which reflects the regularity of an ECG signal. Because of the diversity of cardiac diseases, the comprehensive criterion will be less effective for noise filtering of irregular burst ECG signals, such as heart attack signals. Hence, our research in the future will focus on the improvement of comprehensive entropy criterion for different accidental pathological ECG signals.

Acknowledgments

This work is partially supported by National Natural Science Foundation of China (Grant Nos. 61203108, 61171088, 61263016, and 61571302), The Natural Science Foundation of Shanghai (Grant No. 14ZR1430300), the Research Projects of Science and Technology Commission of Shanghai (Grant No. 14140711200) and the Innovation Program of Shanghai Municipal Education Commission (Grant No. 13YZ056).

Author Contributions

Hong He devised the new criterion for the best base wavelet selection and the validation filtering experiment, supervised data collection, processed signals, analyzed results and wrote the paper. Yonghong Tan revised and edited the paper. Yuexia Wang collected the data and devised noisy data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. SØensen, J.T.; Clemmensen, P.; Sejersten, M. Telecardiology: Past, Present and Future. Rev. Esp. Cardiol. 2013, 66, 212–218. [Google Scholar] [CrossRef]
  2. Luo, S.; Johnston, P. A Review of Electrocardiogram Filtering. J. Electrocardiol. 2010, 43, 486–496. [Google Scholar] [CrossRef] [PubMed]
  3. Gacek, A.; Pedrycz, W. ECG Signal. Processing, Classification and Interpretation: A Comprehensive Framework of Computational Intelligence; Springer: London, UK, 2012. [Google Scholar]
  4. Gao, R.X.; Yan, R. Wavelets: Theory and Applications for Manufacturing; Springer: Berlin, Germany, 2011. [Google Scholar]
  5. Castillo, E.; Morales, D.P.; García, A.; Martínez-Martí, F.; Parrilla, L.; Palma, A.J. Noise Suppression in ECG Signals through Efficient One-Step Wavelet Processing Techniques. J. Appl. Math. 2013, 2013. [Google Scholar] [CrossRef]
  6. Singh, B.N.; Tiwari, A.K. Optimal Selection of Wavelet Basis Function Applied to ECG Signal Denoising. Digit. Signal. Process. 2006, 16, 275–287. [Google Scholar] [CrossRef]
  7. Bhatia, P.; Boudy, J.; Andreao, R.V. Wavelet Transformation and Pre-Selection of Mother Wavelet for ECG Signal Processing. In Proceedings of the 24th IASTED International Multiconference: Biomedical Engineering, Innsbruck, Austria, 15–17 Februray 2006; pp. 390–395.
  8. Amit, K.; Mandeep, S. Optimal Selection of Wavelet Function and Decomposition Level for Removal of ECG Signal Artifacts. J. Med. Imaging Health Inf. 2015, 5, 138–146. [Google Scholar]
  9. Tan, H.G.R.; Tan, A.C.; Khong, P.Y.; Mok, V.H. Best Wavelet Function Identification System for ECG Signal Denoise Applications. In Proceedings of the IEEE International Conference on Intelligent and Advanced Systems, ICIAS 2007, Kuala Lumpur, Malaysia, 25–28 November 2007; pp. 631–634.
  10. Stantic, D.; Jo, J. Selection of Optimal Parameters for ECG Signal Smoothing and Baseline Drift Removal. Comput. Inf. Sci. 2014, 7, 99–110. [Google Scholar] [CrossRef]
  11. Ranjeet, K.; Farida, J. Retained Signal Energy Based Optimal Wavelet Selection for Denoising of ECG Signal Using Modifide Thresholding. In Proceedings of the IEEE 2011 International Conference on Multimedia, Signal Processing and Communication Technologies (IMPACT), Aligarh, India, 17–19 December 2011; pp. 196–199.
  12. El-Sayed, A.; Dahshan, E. Genetic Algorithm and Wavelet Hybrid Scheme for ECG Signal Denoising. Telecommun. Syst. 2011, 46, 209–215. [Google Scholar]
  13. Addison, P.S. Wavelet Transforms and the ECG: A Review. Physiol. Meas. 2005, 26, 155–199. [Google Scholar] [CrossRef] [PubMed]
  14. Mallat, S. A Wavelet Tour of Signal Processing, 3rd ed.; Academic Press: Waltham, MA, USA, 2009. [Google Scholar]
  15. Mateo, J.; Torres, A.M.; Soria, C.; Santos, J.L. A Method for Removing Noise from Continuous Brain Signal Recordings. Comput. Electr. Eng. 2013, 39, 1561–1570. [Google Scholar] [CrossRef]
  16. Donoho, D.L.; Johnstone, I.M. Ideal Spatial Adaptation via Wavelet Shrinkage. Biometrika 1994, 81, 425–455. [Google Scholar] [CrossRef]
  17. Donoho, D.L. Denoising by Soft-thresholding. IEEE Trans. Inform. Theory 1995, 41, 613–627. [Google Scholar] [CrossRef]
  18. Von Sachs, R.; MacGibbon, B. Nonparametric Curve Estimation by Wavelet Thresholding with Locally Stationary Errors. Scand. J. Stat. 2000, 27, 475–499. [Google Scholar] [CrossRef]
  19. Gray, R.M. Entropy and Information Theory, 2nd ed.; Springer: New York, NY, USA, 2011. [Google Scholar]
  20. Moody, G.B.; Mark, R.G. The Impact of the MIT-BIH Arrhythmia Database. IEEE Eng. Med. Biol. 2001, 20, 45–50. [Google Scholar] [CrossRef]
  21. Goldberger, A.L.; Amaral, L.A.N.; Glass, L.; Hausdorff, J.M.; Ivanov, P.C.; Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.-K.; Stanley, H.E. Physiobank, Physiotoolkit, and Physionet: Components of a New Research. Circulation 2000, 101, e215–e220. [Google Scholar] [CrossRef] [PubMed]
  22. Üstundag, M.; Gökbulut, M.; Sengür, A.; Ata, F. Denoising of Weak ECG Signals by Using Wavelet Analysis and Fuzzy Thresholding. Netw. Model. Anal. Health Inform. Bioinfor. 2012, 1, 1–6. [Google Scholar] [CrossRef]
  23. Bruce, A.G.; Gao, H.Y. WaveShrink with Firm Shrinkage. Stat. Sin. 1997, 7, 855–874. [Google Scholar]

Share and Cite

MDPI and ACS Style

He, H.; Tan, Y.; Wang, Y. Optimal Base Wavelet Selection for ECG Noise Reduction Using a Comprehensive Entropy Criterion. Entropy 2015, 17, 6093-6109. https://0-doi-org.brum.beds.ac.uk/10.3390/e17096093

AMA Style

He H, Tan Y, Wang Y. Optimal Base Wavelet Selection for ECG Noise Reduction Using a Comprehensive Entropy Criterion. Entropy. 2015; 17(9):6093-6109. https://0-doi-org.brum.beds.ac.uk/10.3390/e17096093

Chicago/Turabian Style

He, Hong, Yonghong Tan, and Yuexia Wang. 2015. "Optimal Base Wavelet Selection for ECG Noise Reduction Using a Comprehensive Entropy Criterion" Entropy 17, no. 9: 6093-6109. https://0-doi-org.brum.beds.ac.uk/10.3390/e17096093

Article Metrics

Back to TopTop