Electronic Tongue Recognition with Feature Specificity Enhancement

Liu, Tao; Chen, Yanbing; Li, Dongqi; Yang, Tao; Cao, Jianhua

doi:10.3390/s20030772

Open AccessArticle

Electronic Tongue Recognition with Feature Specificity Enhancement

School of Microelectronics and Communication Engineering, Chongqing University, No. 174 Shazheng Street, Shapingba District, Chongqing 400044, China

^*

Author to whom correspondence should be addressed.

Sensors 2020, 20(3), 772; https://0-doi-org.brum.beds.ac.uk/10.3390/s20030772

Submission received: 31 December 2019 / Revised: 24 January 2020 / Accepted: 29 January 2020 / Published: 31 January 2020

(This article belongs to the Special Issue Electronic Tongues, Electronic Noses, and Electronic Eyes)

Download

Browse Figures

Versions Notes

Abstract

:

As a kind of intelligent instrument, an electronic tongue (E-tongue) realizes liquid analysis with an electrode-sensor array and certain machine learning methods. The large amplitude pulse voltammetry (LAPV) is a regular E-tongue type that prefers to collect a large amount of response data at a high sampling frequency within a short time. Therefore, a fast and effective feature extraction method is necessary for machine learning methods. Considering the fact that massive common-mode components (high correlated signals) in the sensor-array responses would depress the recognition performance of the machine learning models, we have proposed an alternative feature extraction method named feature specificity enhancement (FSE) for feature specificity enhancement and feature dimension reduction. The proposed FSE method highlights the specificity signals by eliminating the common mode signals on paired sensor responses. Meanwhile, the radial basis function is utilized to project the original features into a nonlinear space. Furthermore, we selected the kernel extreme learning machine (KELM) as the recognition part owing to its fast speed and excellent flexibility. Two datasets from LAPV E-tongues have been adopted for the evaluation of the machine-learning models. One is collected by a designed E-tongue for beverage identification and the other one is a public benchmark. For performance comparison, we introduced several machine-learning models consisting of different combinations of feature extraction and recognition methods. The experimental results show that the proposed FSE coupled with KELM demonstrates obvious superiority to other models in accuracy, time consumption and memory cost. Additionally, low parameter sensitivity of the proposed model has been demonstrated as well.

Keywords:

electronic tongue; feature extraction; kernel extreme learning machine; specificity enhancement

1. Introduction

Electronic tongues (E-tongues) consist of an electrode-sensor array and a machine-learning unit, and have made considerable contributions to liquid analysis since 1990s [1,2,3]. The adopted electrode sensors for an E-tongue often have both a low selectivity and high cross-sensitivity while the machine-learning part realizes smart identifications according to the responses of the electrode-sensor array [4]. As far as we know, E-tongue systems have been applied in extensive scenarios such as environmental monitoring [5,6], food identification [7,8,9] and beverage discrimination [10]. With regard to beverage discrimination, several kinds of beverage, including coffee [11], tea [12,13], wine [14,15] and dairy products [16], have been assessed by E-tongues to clarify their origins, grades or brands. Among these studies, tea and wine seem to have been used as the main testing substances.

Current E-tongues can be broadly divided into three categories according to the employed sensors: potentiometric [17,18], optical [19,20] and voltammetric types [21,22]. The potentiometric E-tongue usually operates on the basis of high impedance sensitive membrane sensors. It relies on a working electrode and a reference electrode to constitute a two-electrode system [23]. The working electrode potential varies according to the nature and strength of the solutions, which reflects the characteristics of the samples. Different from potentiometric methods, optical E-tongues involve various optical sensors that consist of three main parts—a light source, an optrode and a detector [24]. The optical properties of the indicator contained in the optrode would be changed when the sensor is exposed to a target analyte, and the detector can monitor this change and convert the optical signal into an electrical one. Conversely, the voltammetric type E-tongue is based on noble metal bare electrodes, and it employs a three-electrode system for measurement, containing a working electrode, an auxiliary electrode and a reference electrode [25]. The potential of the reference electrode is assumed to be constant while a signal loop is formed between the auxiliary and working electrodes for response signal obtainment. Among these three kinds of E-tongue, the voltammetric type has drawn more attention owing to its abundant original responses, high signal-to-noise ratio, simple operation and low cost. Winquist et al. reported a voltammetric E-tongue with an array of noble-metal electrodes which sensed the characteristics of liquid substances and compounds in a large amplitude pulse voltammetry (LAPV) manner [26]. The collected LAPV response is regularly a digital time series containing huge amount of data due to the high sampling rates of the devices. In other words, the raw responses of a LAPV E-tongue are high-dimensional vectors, which may include any redundant features for following recognitions. Thus, a fast and effective feature extraction method is needed to obtain concise and meaningful features from raw responses. Consequently, dimension reduction methods have been utilized to extract features from original responses. Primarily, Principal Component Analysis (PCA) has been widely used as a classic feature extraction method with satisfied signal compression ability for LAPV responses in recent studies [27,28,29]. Furthermore, the concealed information in the LAPV time series could be revealed by time-frequency-analysis methods. Palit et al. employed discrete wavelet transform (DWT) to extract features from E-tongue responses [30]. However, the above methods directly process the original electrode responses, and may ignore the common-mode signals existing in different electrodes. Here, the “common-mode” signals mean the high-correlated parts in different working-sensor responses caused by the same LAPV excitation signal. The amplitude of the common-mode signal regularly occupies most of the raw LAPV responses, which may cover up the specificity of different sensor responses. As a result, the recognition performance of a LAPV E-tongue would be degraded unavoidably with common-mode signals.

In this paper, the electrode sensor response is regarded as a superposition of specificity and common-mode components. We consider the specificity parts as a significant basis for E-tongue classifications, whereas the common-mode components in different sensor responses provide harmful information for recognition. According to the above assumption, we have proposed a feature enhancement method using a nonlinear specificity metric that alleviates the common-mode components in sensor responses. Meanwhile, we introduce a machine-learning model that integrates the proposed feature enhancement method and kernel extreme learning machine (KELM) for E-tongue recognition. The main work of this paper can be summarized as follows:

(1): A specificity metric scheme is presented to enhance the useful features in electrode responses in Hilbert space by exploiting a kernel function.
(2): We present the scheme with both the proposed feature extraction method and KELM to obtain remarkable effects in speed and accuracy compared with other methodologies.

The rest of this paper is organized as follows: Section 2 introduces the details of the machine-learning model for E-tongues, including feature extraction and recognition approaches. The experimental results and analysis are presented in Section 3. Finally, Section 4 concludes this paper.

2. Methods

2.1. Notations

In this paper,

X = {[x_{1,} x_{2}, \dots, x_{m}]}^{T} \in R^{m \times d}

represents the raw responses of one sample, where m represents the number of sensors while d represents the time length of a sensor response. Here, d is decided by the ratio of the sampling time to sampling rate.

2.2. Machine Learning Model

In this subsection, our target is to explore a proper machine-learning model for E-tongue recognition on raw sensor responses. Then, the proposed model should have the following properties:

(1): Recognition results should be quickly obtained from a large number of digital pulse-like signals.
(2): Feature specificity should be enhanced by eliminating the common-mode components.
(3): The nonlinear relationships exhibited in sensor responses should be well exploited.

To achieve the above goals, we consequently proposed a machine-learning model for LAPV E-tongues containing a feature specificity enhancement (FSE) scheme and KELM classifier.

2.2.1. Feature Specificity Enhancement

The proposed FSE method abstracts the specificity signals concealed in paired sensor responses by removing the common-mode components in Hilbert space. Thus, the feature extraction method can be formulated as:

Z_{i j}^{} = k (x_{i}^{}, x_{j}^{}), i \neq j

(1)

where

x_{i} = {x_{i}^{n}}

represents the i-th sensor response, n represents the n-th sample,

Z_{i j} = {Z_{i j}^{n}}

indicates the feature extracted from the i-th and j-th sensor responses,

k (\cdot)

denotes a kernel function projecting the original specificity component to a nonlinear space. We assumed that an intact sensor response may consist of both specificity and common-mode components. Then, we could redefine

x_{i}^{}

and

x_{j}^{}

as:

{\begin{cases} x_{i}^{} = x_{c}^{i j} + x_{s}^{i j} \\ x_{j}^{} = x_{c}^{i j} - x_{s}^{i j} \end{cases}

(2)

where

x_{c}^{i j}

and

x_{s}^{i j}

denote the common-mode component and the specificity signal in paired sensor responses, respectively. Hence,

x_{c}^{i j}

and

x_{s}^{i j}

can be presented as follows:

\begin{array}{l} x_{c}^{i j} = (x_{i}^{} + x_{j}^{}) / 2 \\ x_{s}^{i j} = (x_{i}^{} - x_{j}^{}) / 2 \end{array}

(3)

It is obvious that

x_{s}^{i j}

is the useful portion (specificity part) for further classification while

x_{c}^{i j}

is the common-mode component containing rare classification information between Sensor i and j. Thus, we decided to adopt

x_{s}^{i j}

for following process. Moreover, we introduced the kernel function to deal with the nonlinearity information presented in the specificity signals and solve “dimension disaster” problem of LAPV responses in space projection [31]. In general, some differentiable formulae including polynomials, Gaussian (radial basis function, RBF), exponential, sigmoid and delta functions can be selected as the kernel function. For these kernel functions, RBF causes the lowest computation load due to the minimum number of parameters and demonstrates favorable performance in data classification. Therefore, we chose RBF as the primary exploration kernel function in this paper, which is defined as:

k (x_{i}, x_{j}) = k (2 x_{s}^{i j}) = \exp (- \frac{{‖ x_{i} - x_{j} ‖}_{2}^{2}}{2 σ^{2}})

(4)

where exp(•) represents an exponential function,

σ

is the width of the kernel function and

{‖ \cdot ‖}_{2}

denotes the l₂-norm. From this expression, we could infer that the feature size of each sample is m(m-1) after feature extraction with RBF. In other words, the size of feature subset is only associated with the electrode number of the sensor array instead of the sensor-response length. In addition, the proposed FSE method can achieve meaningful features in unsupervised manner (without any priori labels). As a result, the proposed FSE is a simple and light calculation approach for feature extraction from high dimensional space. The details of FSE have been summarized in Algorithm 1.

Algorithm 1. The FSE Method

Input:

The sample data matrix

X \in R^{m \times d}

The kernel function form

k (\cdot)

, kernel parameters

σ

Output:

The feature matrix

Z \in R^{m \times (m - 1)}

Procedure:

for

i = 1, \dots, m

for

j = 1, \dots, m

Z_{i j} = k (x_{i,} x_{j}), \forall i \neq j

end

output the feature matrix

Z \in R^{m \times (m - 1)}

2.2.2. Kernel Extreme Learning Machine

The extreme learning machine (ELM) is a fast learning algorithm with a single hidden layered forward neural network [32]. A recent research work reported the feasibility of this method in liquid recognition [33]. It randomly initializes the input weights

W = {[w_{1}, w_{2}, \dots, w_{L}]}^{T} \in R^{L \times D}

and bias

b = [b_{1}, b_{2}, \dots, b_{L}] \in R^{L}

. The output weight matrix

β \in R^{L \times C}

is calculated based on the output matrix of the hidden layer analytically. The output matrix H of the hidden layer with L hidden neurons is computed as:

H = {[\begin{matrix} g (w_{1}^{T} x_{1} + b_{1}) & \dots & g (w_{L}^{T} x_{1} + b_{L}) \\ ⋮ & ⋱ & ⋮ \\ g (w_{1}^{T} x_{N} + b_{1}) & \dots & g (w_{L}^{T} x_{N} + b_{L}) \end{matrix}]}_{N \times L}

(5)

where

g (\cdot)

is the activation function and N is the number of the training samples. The ELM learning method reaches the tradeoff between minimum training error and smallest norm of weights as follows:

\min_{β} \frac{1}{2} {‖ β ‖}^{2} + μ \cdot \frac{1}{2} \sum_{i = 1}^{N} {‖ ξ ‖}^{2}

s . t . h (x_{i}) β = t_{i} - ξ_{i}, i = 1, 2, \dots, N

(6)

where

μ

is the regularization coefficient,

ξ

denotes the prediction error on training set. The approximate solutions of output weight matrix

β

are directly calculated as:

β = {\begin{matrix} {(H^{T} H + \frac{I_{L \times L}}{μ})}^{- 1} H^{T} T, & N \geq L \\ H^{T} {(H^{T} H + \frac{I_{N \times N}}{μ})}^{- 1} T, & N < L \end{matrix}

(7)

where

T = {[t_{1}, t_{2}, \dots, t_{N}]}^{T} \in R^{N \times C}

. Therefore, the final output can be computed as:

f (x) = h (x) H^{T} {(H^{T} H + \frac{I_{N \times N}}{u})}^{- 1} T

(8)

The KELM introduces a kernel function when calculating the output of the network [34] which denotes

K_{i j} = h (x_{i}) \cdot h (x_{j}) = k (x_{i}, x_{j})

, thus, Equation (17) can be expressed by:

f (x) = [\begin{matrix} k (x, x_{1}) \\ ⋮ \\ k (x, x_{N}) \end{matrix}] {(K + \frac{I_{N \times N}}{μ})}^{- 1} T

(9)

When the kernel function is employed by KELM, not only is the computation time reduced, but also the number of the hidden layer is free of pre-designation.

3. Experiments and Discussions

3.1. Datasets

In this section, two experimental datasets about LAPV-based E-tongues were conducted to verify the effectiveness of the proposed machine learning model. One dataset was collected from the E-tongue we designed while the other one was a public E-tongue benchmark.

3.1.1. Our Own Dataset

This dataset was acquired from a self-designed E-tongue system, as shown in Figure 1. The system is composed of an electrolytic cell, a sensor array, a signal transformation board, a control unit and an upper computer [35]. The sensor array comprises one reference electrode, one auxiliary electrode and six working electrodes to form a three-electrode system on the basis of multiple LAPV (MLAPV) [28]. A MLAPV-type E-tongue combines the excitation pulses of different potential intensities, so that the solution composition with different properties can show unique oxidation, charging and discharging characteristics at different pulse-like excitation potentials. Meanwhile, the excitation pulse interval is used as the frequency parameter, which makes corresponding responses of working electrodes show special electrochemical characteristics in different frequency segments. As a result, the overall characteristics of solution compositions can be sampled at multi-frequency scale, enriching the feature information for further identification [36].

The working electrodes include gold, platinum, palladium, titanium, tungsten and silver types. The auxiliary and reference electrodes are made of pillar platinum and Ag/AgCl solution, respectively. Before each measurement, the working and auxiliary electrodes were cleaned with a polishing cloth and polishing powder to make the surface mirror-polished, while the reference electrode was washed with distilled water. After each measurement, all the electrodes were washed with distilled water to prevent the residues adhesion.

Seven beverages including red wine, white spirit, beer, oolong tea, black tea, maofeng tea and pu’er tea were selected for experiments. With regard to each type of tea, 2 g of solid tea leaves weresoaked with 200 mL of distilled water for 5 min. Afterwards, the original solution of tea can be attained by filtering out the liquid, while the ones of red wine, white spirit and beer were bought directly from the manufacturers. Then, we formulated samples at different concentrations with both the original solution and distilled water which maintained the temperature around 25 °C. Accordingly, low, medium and high concentration samples were formulated for each beverage according to the ratio of original solution at 14%, 25% and 100%, respectively. We tested each prepared sample three times in an hour with the designed E-tongue. In total, 63 (7 kinds × 3 concentrations × 3 times) samples were collected for seven kinds of liquid.

The excitation signal of MLAPV includes three frequency segments, 0.2 Hz (low frequency), 1 Hz (medium frequency) and 2 Hz (high frequency), in one-time cycle to stimulate different transient pulse-like responses. As Figure 2a shows, the excitation signal on the auxiliary electrode lasts 60 s per circle. A blank segment is located between two frequency segments as a cooling-off period for electrodes. While, each frequency segment contains five pulses with voltages of 3.3, 3.1, 2.9, 2.7 and 2.5 V relative to a reference voltage of 2.3 V. In our circuit design, the signal potential acting on the solution was equal to the difference between the pulse voltage and the reference voltage. Thus, the practical pulse voltages in the solution are 1.0, 0.8, 0.6, 0.4 and 0.2 V, respectively. Subsequently, the response on a working electrode, as shown in Figure 2b, is a series of transient signal by the excitation signal. Totally, 9000 response points have been collected for one working electrode in terms of the sampling rate at 150 Hz. As a result, a 6 × 9000 data matrix can be obtained for one experiment owing to six working electrodes.

3.1.2. Public Benchmark

In order to evaluate the effectiveness of the proposed method comprehensively, a benchmark dataset of 13 kinds of liquid collected on the basis of LAPV E-tongue was used [33]. The 13 kinds of liquid were beer, red wine, white spirit, black tea, Mao Feng tea, Pu’er tea, oolong tea, coffee, milk, cola, vinegar, medicine and salt. The dataset consisted of 114 samples and the matrix size of each sample was 6 × 2050. This corresponds to six sensors and 2050 points collected per sensor. In this paper, we used this dataset with fourfold cross-validation strategy.

3.2. Method Settings

In order to evaluate the proposed machine learning model comprehensively, we chose several popular feature extraction methods and classifiers to form different machine-learning models for comparisons.

For feature extraction methods, we adopted Raw Data (RD), PCA, DWT and the proposed FSE approaches for the following evaluation. Here, RD means using raw responses directly for recognition [37]. PCA effectively removes the redundant components from the original data based on orthogonal transformation by keeping the components with the large variance [38]. DWT can process MLAPV signals by employing class-separability criterion to find the suitable level for wavelet decomposition [30]. For fair comparison, we defined another kernel-based method (KBM) using the same kernel functions and parameters of FSE with common-mode signals. The formulation of KBM is by:

Z_{i} = \exp (- \frac{{‖ x_{i} ‖}_{2}^{2}}{2 σ^{2}}), i = 1, \dots, m

(10)

In terms of the recognition part, Support Vector Machine (SVM) [31], Random Forest (RF) [39], Linear Discriminant Analysis (LDA) [40] and Naive Bayes (NB) [41] were considered in addition to KELM. We selected SVM as a representative of the classifier in a tradeoff between approximation and generalization. Moreover, RF is a typical method using the classifier-ensemble strategy. SVM can employ different kernel functions to map low-dimensional samples to high-dimensional spaces for more accurate classification. We selected RBF as the kernel function of SVM to perform nonlinear projection. After grid optimization, the penalty coefficient C and kernel parameter of SVM was set to 1000 and 0.001, respectively. RF has few parameters to be adjusted and it works well on both classification and regression problems. In the tradeoff of computation load and classification accuracy, the decision-tree number of RF is set to 50. The basic idea of LDA is to project high-dimensional samples into an optimal discriminant space where the ratio of inner-class and the inter-class distance is minimized. Additionally, NB is a classification algorithm based on probability theory.

3.3. Model Performance Evaluation

As mentioned above, the machine-learning models to be evaluated were composed of different feature extraction methods and classifiers. The available choices of the feature extraction methods are RD, PCA, DWT, KBM and FSE. Meanwhile, the recognition part should be one of these classifiers—RF, LDA, NB, SVM and KELM. Each model was tested on both datasets for recognition rates, running time and memory cost.

In terms of data usage, we adopted leave-one-out (LOO) [42] and four-fold cross validation for our own dataset and the public benchmark, respectively. For our own dataset, we abstract eight of nine samples from each category as training samples, while the remaining samples are treated as validation ones. That is to say, the sizes of training and validation sets were 56 and 7, respectively. Considering that there are nine different ways to select training samples for each kind, we calculated the average accuracy and associate standard deviation (STD) of these nine recognitions as final scores of a certain model. As for the public benchmark, we divided overall 114 samples into four groups (28, 29, 28 and 29 samples, respectively), and ensured that each group covered all the categories. Then, three of four groups were utilized as training data while the remaining one was used as validation data. At last, the average accuracy and associate STD from these four different recognitions were seemed as a score of a certain model.

Table 1 illustrates the average accuracies and associate STD of different models. As for our own dataset, FSE-KELM obtains the highest accuracy (95.24%), and FSE-SVM reaches the second place. FSE-RF is the best model among the RF-based models, showing a steady recognition performance fluctuating in the range of 61% to 81%. When the RD features are input to LDA and NB, we cannot obtain any outputs due to memory overflow caused by big feature size. Compared with other three classifiers, both LDA and NB have poor performance. 79.37% and 68.25% are the highest values among the LDA’s and NB’s accuracies, respectively. Both SVM and KELM have dramatic decline in accuracy when coupled with RD, which may be caused by the high dimension of features. We noticed that the recognition rates of the SVM-based models exceeded the KELM ones when using RD, PCA and KBM as feature extraction approaches. Considering the public benchmark, FSE-KELM and FSE-SVM still hold the first and second place in recognition, which improve by nearly 38% and 70% over RD-KELM and RD-SVM, respectively. Furthermore, FSE-LDA follows with an accuracy of 79.80%, though LDA cannot work coupled with RD. It is apparent that the models with KBM have poor performance owing to the impact of the common-mode components. Considering the STD of accuracy, FSE performs robustly enough, with the STD of FSE varying from 4.0% to 15.22%. It also obtained the lowest value of 7.10%, 7.10% and 6.73% with LDA, SVM and KELM, respectively, among the feature extraction methods, in our own dataset, respectively. Other STD values of FSE are still in a moderate range, although not the lowest. Generally, we can infer some universal similarities from the recognition rates in Table 1:

(1): The proposed FSE method is an effective feature extraction method in E-tongue identifications. FSE based models are the winners on accuracy in all scenarios with the same classifiers.
(2): The recognition rates of the KBM-based models seriously fall for the public benchmark, which indicates the importance of common-mode signal reduction. We believe FSE method enhances the feature specificity via common-mode signal counteraction.
(3): As for classifiers, KELM-based machine-learning models reached the highest accuracy six times in 10 scenarios (2 datasets × 5 feature extraction methods). This means that KELM has sufficient approximation and generalization ability for E-tongue recognition.

3.4. Time and Memory Cost Comparisons

In the view of execution time, less time implies a lower energy consumption and faster detection speed for E-tongue devices. Table 2 provides the execution time of different machine-learning models. Almost the same trends have appeared on both our own dataset and the public benchmark. The consumption time of the models with KBM and FSE seems obviously shorter (two orders of magnitude in most cases) than those of other models. Although the time of KBM is slightly less than that of FSE, the accuracies of KBM-based models are unstable. When using SVM, FSE is nearly 40 times faster than RD on our own dataset and 50 times faster than DWT on the public benchmark. Hence, FSE is the most suitable feature extraction method considering both accuracy and time consumption. Moreover, KELM expends the least execution time in most cases (seven out of 10 scenarios) among all the classifiers. This proves that KELM requires little computation as well as providing excellent classification results.

In addition, the details of memory cost are shown in Figure 3. Regarding memory consumption, RD and PCA are significantly higher than the other three methods. When SVM is combined with RD, the memory consumption is the highest, and for other feature extraction methods, the RF has the highest memory cost compared to the other four classifiers. It is noted that when the original feature is used, the LDA and NB classifiers cannot work because the feature dimension is too high, so there is no record of the two classifiers at this time.

3.5. Parameter Sensitivity Analysis

3.5.1. Analysis of FSE Parameters

The kernel function width

σ

is an adjustable parameter in the proposed FSE method. In order to observe the parameter sensitivity of FSE, we tuned

σ

from the parameter set

{10^{- 3} {, 10}^{- 2}, \dots {, 10}^{3}}

and collected the accuracy on both two datasets. The results are drawn in Figure 4a and b. In particular, NB cannot work for our own datasets on

σ = 10^{- 3}

and the benchmark on

σ = 10^{- 2}

, because parts of the features are equal to 0. For our own dataset, as shown in Figure 4a, FSE-RF is insensitive to the alteration of the width. FSE, coupled with the other four classifiers, has the same tendency in that the accuracy moves upwards rapidly in the scale

σ = {[10}^{- 3} {, 10}^{- 1}]

and then keeps the performance steady in the follow-up scope. In terms of the public benchmark, as shown in Figure 4b, all the FSE-based methods have an early rising stage and, then, stable curves would be presented in the subsequent range

σ = {[10}^{- 1} {, 10}^{3}]

. In a word, FSE method has strong robustness for parameter variation. In practice, when the kernel function width

σ

belongs to [10⁻¹,10³], the recognition results are not sensitive to the parameter values.

3.5.2. Analysis of KELM Parameters

The regularization coefficient

γ

and kernel function width

θ

are two adjustable parameters in KELM. For better sight of the parameter sensitivity of KELM, we tuned two parameters from the parameter set

{10^{- 3} {, 10}^{- 2}, \dots {, 10}^{3}}

. The results of regularization coefficient

γ

on both our own dataset and the public benchmark are presented in the Figure 5a and b, from which the optimal

γ

is clearly observed. Thus,

γ = 1000

was empirically set. Figure 5c and d show the results in terms of kernel function width

θ

on both datasets, and we have observed that the favorable performance could be obtained when

θ = 10

.

4. Conclusions

In this study, we have proposed a machine-learning model for E-tongue recognitions. In the proposed model, FSE method is designed for nonlinear feature extraction while KELM is adopted to accelerate the calculation speed of the model. In experimental comparison, the proposed FSE coupled with KELM achieves both the best accuracy and computational efficiency on the two datasets. It can be summarized that the common-mode signal counteraction of E-tongue sensors and nonlinear projection can make great contribution to E-tongue classification. Meanwhile, KELM shows dominant specialties in data approximation, model generalization and computational-time reduction. We discover that FSE seems to be effective in dealing with high dimensional data, especially for LAPV signals. Moreover, KELM can greatly promote the overall performance in accuracy and speed for E-tongue-based recognitions.

Author Contributions

T.L. and Y.C. conceived and designed the experiments; D.L., T.Y. and J.C. performed the experiments; Y.C. analyzed the data; T.L. contributed reagents/materials/analysis tools; T.L. and Y.C. wrote the paper. All authors have read and agree to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (NSFC, 61403049), the Fundamental Research Funds for the Central Universities (No. 2018CDXYTX0010), 2018, P.R. China.

Conflicts of Interest

The authors declare no conflict of interest. The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

References

Legin, A.; Rudnitskaya, A.; Lvova, L.; Vlasov, Y.; Di Natale, C.; D’amico, A. Evaluation of Italian wine by the electronic tongue: Recognition, quantitative analysis and correlation with human sensory perception. Analytica Chimica Acta 2003, 484, 33–44. [Google Scholar] [CrossRef]
Ghosh, A.; Bag, A.K.; Sharma, P.; Tudu, B.; Sabhapondit, S.; Baruah, B.D.; Tamuly, P.; Bhattacharyya, N.; Bandyopadhyay, R. Monitoring the Fermentation Process and Detection of Optimum Fermentation Time of Black Tea Using an Electronic Tongue. IEEE Sensors J. 2015, 15, 6255–6262. [Google Scholar] [CrossRef]
Verrelli, G.; Lvova, L.; Paolesse, R.; Di Natale, C.; D’Amico, A. Metalloporphyrin—Based Electronic Tongue: An Application for the Analysis of Italian White Wines. Sensors 2007, 7, 2750–2762. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Tahara, Y.; Toko, K. Electronic Tongues—A Review. IEEE Sensors J. 2013, 13, 3001–3011. [Google Scholar] [CrossRef]
Krantz-Rülcker, C.; Stenberg, M.; Winquist, F.; Lundström, I. Electronic tongues for environmental monitoring based on sensor arrays and pattern recognition: a review. Anal. Chim. Acta 2001, 426, 217–226. [Google Scholar] [CrossRef]
Kirsanov, D.; Legin, E.; Zagrebin, A.; Ignatieva, N.; Rybakin, V.; Legin, A. Mimicking Daphnia magna bioassay performance by an electronic tongue for urban water quality control. Anal. Chim. Acta 2014, 824, 64–70. [Google Scholar] [CrossRef]
Wei, Z.; Wang, J. Tracing floral and geographical origins of honeys by potentiometric and voltammetric electronic tongue. Comput. Electron. Agric. 2014, 108, 112–122. [Google Scholar] [CrossRef]
Wang, L.; Niu, Q.; Hui, Y.; Jin, H. Discrimination of Rice with Different Pretreatment Methods by Using a Voltammetric Electronic Tongue. Sensors 2015, 15, 17767–17785. [Google Scholar] [CrossRef] [Green Version]
Apetrei, I.; Apetrei, C. Application of voltammetric e-tongue for the detection of ammonia and putrescine in beef products. Sensors Actuators B Chem. 2016, 234, 371–379. [Google Scholar] [CrossRef]
Ciosek, P.; Brzozka, Z.; Wróblewski, W. Classification of beverages using a reduced sensor array. Sensors Actuators B Chem. 2004, 103, 76–83. [Google Scholar] [CrossRef]
Domínguez, R.B.; Moreno-Barón, L.; Muñoz, R.; Gutiérrez, J.M. Voltammetric Electronic Tongue and Support Vector Machines for Identification of Selected Features in Mexican Coffee. Sensors 2014, 14, 17770–17785. [Google Scholar] [CrossRef] [PubMed]
Rodrigues, D.R.; De Oliveira, D.S.M.; Pontes, M.J.C.; Lemos, S.G. Voltammetric e-Tongue Based on a Single Sensor and Variable Selection for the Classification of Teas. Food Anal. Methods 2018, 11, 1958–1968. [Google Scholar] [CrossRef]
Palit, M.; Tudu, B.; Bhattacharyya, N.; Dutta, A.; Dutta, P.K.; Jana, A.; Bandyopadhyay, R.; Chatterjee, A. Comparison of multivariate preprocessing techniques as applied to electronic tongue based pattern classification for black tea. Anal. Chim. Acta 2010, 675, 8–15. [Google Scholar] [CrossRef] [PubMed]
Cetó, X.; Apetrei, C.; Del Valle, M.; Rodríguez-Méndez, M.L.; Cetó, X. Evaluation of red wines antioxidant capacity by means of a voltammetric e-tongue with an optimized sensor array. Electrochim. Acta 2014, 120, 180–186. [Google Scholar] [CrossRef]
Gutiérrez, M.; Llobera, A.; Ipatov, A.; Vila-Planas, J.; Mínguez, S.; Demming, S.; Büttgenbach, S.; Capdevila, F.; Domingo, C.; Jiménez-Jorquera, C. Application of an E-Tongue to the analysis of monovarietal and blends of white wines. Sensors 2011, 11, 4840–4857. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dias, L.; Peres, A.; Veloso, A.; Reis, F.S.; Vilas-Boas, M.; Machado, A. An electronic tongue taste evaluation: Identification of goat milk adulteration with bovine milk. Sensors Actuators B Chem. 2009, 136, 209–217. [Google Scholar] [CrossRef]
Ciosek, P.; Maminska, R.; Dybko, A.; Wróblewski, W. Potentiometric electronic tongue based on integrated array of microelectrodes. Sensors Actuators B Chem. 2007, 127, 8–14. [Google Scholar] [CrossRef]
Legin, A.; Bychkov, E.; Seleznev, B.; Vlasov, Y. Development and analytical evaluation of a multisensor system for water quality monitoring. Sensors Actuators B Chem. 1995, 27, 377–379. [Google Scholar] [CrossRef]
Erenas, M.M.; Pegalajar, M.; Cuéllar, M.; De Orbe-Payá, I.; Capitán-Vallvey, L. Disposable optical tongue for alkaline ion analysis. Sensors Actuators B Chem. 2011, 156, 976–982. [Google Scholar] [CrossRef]
Ariza-Avidad, M.; Cuéllar, M.; Salinas-Castillo, A.; Pegalajar, M.; Vuković, J.; Capitán-Vallvey, L. Feasibility of the use of disposable optical tongue based on neural networks for heavy metal identification and determination. Anal. Chim. Acta 2013, 783, 56–64. [Google Scholar] [CrossRef]
Ivarsson, P.; Holmin, S.; Höjer, N.-E.; Krantz-Rülcker, C.; Winquist, F. Discrimination of tea by means of a voltammetric electronic tongue and different applied waveforms. Sensors Actuators B Chem. 2001, 76, 449–454. [Google Scholar] [CrossRef]
Gutés, A.; Ibanez, A.B.; Del Valle, M.; Cespedes, F. Automated SIA E-tongue Employing a Voltammetric Biosensor Array for the Simultaneous Determination of Glucose and Ascorbic Acid. Electroanalysis 2010, 18, 82–88. [Google Scholar] [CrossRef]
Lvova, L.; Martinelli, E.; Dini, F.; Bergamini, A.; Paolesse, R.; Di Natale, C.; Damico, A. Clinical analysis of human urine by means of potentiometric Electronic tongue. Talanta 2009, 77, 1097–1104. [Google Scholar] [CrossRef] [PubMed]
Erenas, M.M.; Piñeiro, O.; Pegalajar, M.; Cuellar, M.; De Orbe-Payá, I.; Capitán-Vallvey, L. A surface fit approach with a disposable optical tongue for alkaline ion analysis. Anal. Chim. Acta 2011, 694, 128–135. [Google Scholar] [CrossRef] [PubMed]
Ciosek, P.; Wróblewski, W. Sensor arrays for liquid sensing—Electronic tongue systems. Analyst 2007, 132, 963. [Google Scholar] [CrossRef] [PubMed]
Winquist, F.; Wide, P.; Lundström, I. An electronic tongue based on voltammetry. Anal. Chim. Acta 1997, 357, 21–31. [Google Scholar] [CrossRef]
Lu, L.; Hu, X.; Tian, S.; Deng, S.; Zhu, Z. Visualized attribute analysis approach for characterization and quantification of rice taste flavor using electronic tongue. Anal. Chim. Acta 2016, 919, 11–19. [Google Scholar] [CrossRef]
Tian, S.-Y.; Deng, S.-P.; Chen, Z.-X. Multifrequency large amplitude pulse voltammetry: A novel electrochemical method for electronic tongue. Sensors Actuators B Chem. 2007, 123, 1049–1056. [Google Scholar] [CrossRef]
Ghosh, A.; Sharma, P.; Tudu, B.; Sabhapondit, S.; Baruah, B.D.; Tamuly, P.; Bhattacharyya, N.; Bandyopadhyay, R. Detection of Optimum Fermentation Time of Black CTC Tea Using a Voltammetric Electronic Tongue. IEEE Trans. Instrum. Meas. 2015, 64, 2720–2729. [Google Scholar] [CrossRef]
Palit, M.; Tudu, B.; Dutta, P.K.; Dutta, A.; Jana, A.; Roy, J.K.; Bhattacharyya, N.; Bandyopadhyay, R.; Chatterjee, A. Classification of black tea taste and correlation with tea taster’s mark using voltammetric electronic tongue. IEEE Trans. Instrum. Meas. 2010, 59, 2230–2239. [Google Scholar] [CrossRef]
Cortes, C.; Vapnik, V. Support-vector networks. Machine Learning 1995, 20, 273–297. [Google Scholar] [CrossRef]
Huang, G.-B.; Zhu, Q.-Y.; Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
Zhang, L.; Wang, X.; Huang, G.-B.; Liu, T.; Tan, X. Taste Recognition in E-Tongue Using Local Discriminant Preservation Projection. IEEE Trans. Cybern. 2018, 49, 947–960. [Google Scholar] [CrossRef] [PubMed]
Huang, G.B.; Zhou, H.; Ding, X.; Zhang, R. Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. Part B Cybern. 2012, 42, 513–529. [Google Scholar] [CrossRef] [Green Version]
Liu, T.; Chen, Y.; Li, D.; Wu, M. An Active Feature Selection Strategy for DWT in Artificial Taste. J. Sensors 2018, 2018, 9709505. [Google Scholar] [CrossRef] [Green Version]
Tian, S.; Xiao, X.; Deng, S. Sinusoidal envelope voltammetry as a new readout technique for electronic tongues. Microchim. Acta 2012, 178, 315–321. [Google Scholar] [CrossRef]
Adhikari, B.; Mahato, M.; Sinha, T.; Halder, A.; Bhattacharya, N.; Sinha, T.; Halder, A. Development of novel polymeric sensors for taste sensing: Electronic tongue. In Proceedings of the SENSORS, 2013 IEEE, Baltimore, MD, USA, 3–6 November 2013; pp. 1–4. [Google Scholar] [CrossRef]
Wold, S.; Esbensen, K.; Geladi, P. Principal component analysis. Chemom. Intell. Lab. Syst. 1987, 2, 37–52. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Machine Learning 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Li, T.; Zhu, S.; Ogihara, M. Using discriminant analysis for multi-class classification: an experimental investigation. Knowl. Inf. Syst. 2006, 10, 453–472. [Google Scholar] [CrossRef]
Friedman, N.; Geiger, D.; Goldszmidt, M. Bayesian Network Classifiers. Mach. Learn. 1997, 29, 131–163. [Google Scholar] [CrossRef] [Green Version]
Kearns, M.; Ron, D. Algorithmic Stability and Sanity-Check Bounds for Leave-One-Out Cross-Validation. Neural Comput. 1999, 11, 1427–1453. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Structure of developed large amplitude pulse voltammetry (LAPV) electronic tongue (E-tongue).

Figure 2. (a) Excitation signal of multiple LAPV (MLAPV). (b) Typical response of a working electrode.

Figure 3. Memory cost: (a) Our own dataset. (b) Public benchmark.

Figure 4. Sensitivity analysis of feature specificity enhancement (FSE) (

σ

): (a) Our own dataset. (b) Public benchmark.

Figure 4. Sensitivity analysis of feature specificity enhancement (FSE) (

σ

): (a) Our own dataset. (b) Public benchmark.

Figure 5. Sensitivity analysis of the kernel extreme learning machine (KELM): (a)

γ

on our own dataset. (b)

γ

on public benchmark. (c)

θ

on our own dataset. (d)

θ

on public benchmark.

Figure 5. Sensitivity analysis of the kernel extreme learning machine (KELM): (a)

γ

on our own dataset. (b)

γ

on public benchmark. (c)

θ

on our own dataset. (d)

θ

on public benchmark.

Table 1. Accuracy and standard deviation of models (%).

	Classifier	Feature Extraction Methods
		RD		PCA		DWT		KBM		FSE
		Acc.	STD	Acc.	STD	Acc.	STD	Acc.	STD	Acc.	STD
Our own dataset	RF	74.60	17.82	61.90	17.53	77.78	7.10	71.43	15.06	80.95	13.47
	LDA	-	-	46.03	16.19	53.97	16.19	60.32	17.53	79.37	7.10
	NB	-	-	46.03	17.53	55.56	14.20	68.25	16.19	63.49	15.22
	SVM	34.92	11.88	80.95	15.06	66.67	16.50	71.43	17.82	93.65	7.10
	KELM	22.22	9.78	69.84	21.76	93.65	9.78	68.25	22.11	95.24	6.73
Public benchmark	RF	71.59	6.85	70.32	7.91	75.38	5.28	18.27	6.07	79.63	6.07
	LDA	-	-	61.63	10.26	63.89	5.61	24.53	3.89	79.80	11.90
	NB	59.28	12.11	63.38	9.19	66.70	3.17	24.38	8.25	76.34	8.04
	SVM	16.76	2.14	60.66	5.62	69.36	3.85	24.53	3.89	86.73	4.00
	KELM	50.93	3.61	72.67	11.31	78.76	4.40	29.88	3.06	88.65	4.36

Table 2. Time consumption of models.

	Classifier	Feature Extraction Methods
	Classifier	RD	PCA	DWT	KBM	FSE
Our Own Dataset	RF	164.74 s	37.80 s	52.01 s	4.32 s	4.18s
	LDA	-	39.61 s	53.71 s	1.03 s	1.65 s
	NB	-	38.31 s	50.84 s	0.45 s	0.53 s
	SVM	10.64 s	33.31 s	47.92 s	0.30 s	0.45 s
	KELM	6.40 s	37.50 s	53.77 s	0.27 s	0.31 s
Public Benchmark	RF	25.08 s	11.19 s	97.98 s	2.26 s	2.62 s
	LDA	-	9.16 s	91.28 s	0.78 s	1.67 s
	NB	3.34 s	8.77 s	90.62 s	0.25 s	0.33 s
	SVM	2.56 s	8.33 s	89.45 s	0.11 s	0.17 s
	KELM	0.95 s	7.89 s	90.32 s	0.09 s	0.16 s

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, T.; Chen, Y.; Li, D.; Yang, T.; Cao, J. Electronic Tongue Recognition with Feature Specificity Enhancement. Sensors 2020, 20, 772. https://0-doi-org.brum.beds.ac.uk/10.3390/s20030772

AMA Style

Liu T, Chen Y, Li D, Yang T, Cao J. Electronic Tongue Recognition with Feature Specificity Enhancement. Sensors. 2020; 20(3):772. https://0-doi-org.brum.beds.ac.uk/10.3390/s20030772

Chicago/Turabian Style

Liu, Tao, Yanbing Chen, Dongqi Li, Tao Yang, and Jianhua Cao. 2020. "Electronic Tongue Recognition with Feature Specificity Enhancement" Sensors 20, no. 3: 772. https://0-doi-org.brum.beds.ac.uk/10.3390/s20030772

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Electronic Tongue Recognition with Feature Specificity Enhancement

Abstract

1. Introduction

2. Methods

2.1. Notations

2.2. Machine Learning Model

2.2.1. Feature Specificity Enhancement

2.2.2. Kernel Extreme Learning Machine

3. Experiments and Discussions

3.1. Datasets

3.1.1. Our Own Dataset

3.1.2. Public Benchmark

3.2. Method Settings

3.3. Model Performance Evaluation

3.4. Time and Memory Cost Comparisons

3.5. Parameter Sensitivity Analysis

3.5.1. Analysis of FSE Parameters

3.5.2. Analysis of KELM Parameters

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI