Next Article in Journal
Numerical and Non-Asymptotic Analysis of Elias’s and Peres’s Extractors with Finite Input Sequences
Next Article in Special Issue
Some Consequences of the Thermodynamic Cost of System Identification
Previous Article in Journal
Entropy, or Information, Unifies Ecology and Evolution and Beyond
Previous Article in Special Issue
Quantum Quantifiers for an Atom System Interacting with a Quantum Field Based on Pseudoharmonic Oscillator States
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Image Thresholding Segmentation on Quantum State Space

1
School of Information Technology, Luoyang Normal University, Luoyang 471934, China
2
School of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
*
Author to whom correspondence should be addressed.
Submission received: 5 August 2018 / Revised: 11 September 2018 / Accepted: 20 September 2018 / Published: 23 September 2018
(This article belongs to the Special Issue Entropy in Foundations of Quantum Physics)

Abstract

:
Aiming to implement image segmentation precisely and efficiently, we exploit new ways to encode images and achieve the optimal thresholding on quantum state space. Firstly, the state vector and density matrix are adopted for the representation of pixel intensities and their probability distribution, respectively. Then, the method based on global quantum entropy maximization (GQEM) is proposed, which has an equivalent object function to Otsu’s, but gives a more explicit physical interpretation of image thresholding in the language of quantum mechanics. To reduce the time consumption for searching for optimal thresholds, the method of quantum lossy-encoding-based entropy maximization (QLEEM) is presented, in which the eigenvalues of density matrices can give direct clues for thresholding, and then, the process of optimal searching can be avoided. Meanwhile, the QLEEM algorithm achieves two additional effects: (1) the upper bound of the thresholding level can be implicitly determined according to the eigenvalues; and (2) the proposed approaches ensure that the local information in images is retained as much as possible, and simultaneously, the inter-class separability is maximized in the segmented images. Both of them contribute to the structural characteristics of images, which the human visual system is highly adapted to extract. Experimental results show that the proposed methods are able to achieve a competitive quality of thresholding and the fastest computation speed compared with the state-of-the-art methods.

1. Introduction

Image segmentation is the task of dividing the image into different regions, each one of which ideally belongs to the same object or content. As a key step from image processing to computer vision, image segmentation is the target expression and has an important effect on the feature measurement, high-level image analysis and understanding [1,2]. Examples of image segmentation applications include medical imaging [3,4], document image analysis [5], object recognition [6,7] and quality inspection of materials [8,9]. In the last two decades, a wide variety of segmentation techniques have been developed, which conventionally fall into the following two categories [2]: layer-based and block-based segmentation methods [10,11]. Among all these techniques, the thresholding methods offer numerous advantages such as smaller storage space, fast processing and ease in manipulation.
In general, thresholding methods can be classified into parametric and nonparametric approaches [12]. Parametric approaches assume that the intensity distributions of images obey the Gaussian mixture (GM) model, which means the number and parameters of Gaussians in the mixture (the model selection) must be determined [13]. Although these problems have been traditionally solved by considering the expectation maximization (EM) algorithm [14] or gradient-based methods [15,16], the methods are time consuming. Nonparametric approaches find the thresholds that separate regions of an image in an optimal manner based on discriminating criteria such as the between-class variance [17], cluster distance [18], entropy [19,20,21,22], etc. Nonparametric methods have shown the advantage of dispensing with the modeling thresholding. However, they still suffer from the problem of high time consumption, although many techniques based on intelligent optimization algorithms (IOAs) [23,24,25] have been used to speed up the thresholding procedure.
Quantum computation and quantum information processing techniques have shown an immense potential and a revolutionary impact on the field of computer science, due to their remarkable resources: quantum parallelism, quantum interference and entanglement of quantum states. Information representing and processing in the framework of quantum theory is powerful for solving complex problems that are difficult or currently even impossible for conventional methods. The most significant works include Shor’s quantum integer factoring algorithm, which can find the secret key encryption of the RSA algorithm in polynomial time [26], and Grover’s quantum search algorithm for databases, which could achieve quadratic speedup [27]. In the recent years, quantum approaches have been introduced into the image processing field. Various quantum image representation models have been proposed, such as qubit lattice [28] and flexible representation of quantum images (FRQI) [29]. Meanwhile, several applications of quantum image processing have been researched including quantum image segmentation [30], quantum edge detection [31], quantum image recognition [32], quantum image watermarking [33] and quantum image reconstruction [34]. Though the research in quantum image processing still confronts fundamental aspects such as image representation on a quantum computer and the definition of basic processing operations, we still could be inspired to completely exploit new methods for some classical problems from a quantum information theoretical viewpoint.
In this paper, we address the thresholding problem on quantum state space. The proposed methods relate to the details of image representation by utilizing the density matrix, optimal threshold selection based on the criteria of the maximum von Neumann entropy, a novel image encoding scheme and the corresponding segmentation approaches, which can totally avoid the process of optimal solution searching. Specifically, the contributions of this paper mainly include the following aspects:
(1)
We present an image thresholding method based on the criteria of global quantum entropy maximization (GQEM), which has an equivalent object function to Otsu’s, but gives more explicit physical interpretation of image thresholding in the language of quantum mechanics.
(2)
The quantum lossy-encoding based entropy maximization (QLEEM) approach is proposed to deal with the time consumption problem of thresholding. The QLEEM algorithm directly takes the eigenvalues of density matrices of lossy-encoded images as segmenting clues and then avoids the time-consuming process of searching for optimal thresholds. It can achieve the highest execution speed compared with the state-of-the-art methods.
(3)
Due to the physical meaning of the lossy-encoding scheme and the unique procedure of optimal thresholding, a brand-new approach to determine the upper bound of the thresholding level automatically is offered in the proposed QLEEM algorithm. For most of the existing methods, this parameter is conventionally predetermined according to empirical knowledge.
(4)
The QLEEM method provides the maximum inter-class separability with lower loss of intra-class information; thus, segmented images could keep more structural information. This feature is highly consistent with the way the human visual system (HVS) works.
The paper is organized as follows: Section 2 gives a brief description of the image thresholding and introduces some state-of-the-art thresholding methods including Otsu’s between-class variance method [17], Kapur’s entropy-criterion method [19], the quantum version of Kapur’s method [35], and Tsallis entropy-based method [22]. Section 3 introduces the details of the proposed methods. Section 4 provides the experimental results and discussions about our method’s performance. The conclusions of this study are drawn in the last part of this paper.

2. Related Works

Thresholding is a process in which a group of thresholds is selected under some criteria, and then, pixels of an image are divided into a series of sets or classes according to the rule of:
l C i i f t h i 1 l < t h i ,
where l [ 0 , L 1 ] represents the intensity level of image pixels, { t h i i = 1 , 2 , , M 1 } is the set of thresholds and { C i i = 1 , 2 , , M } are classes labeling different groups of pixels.
Otsu’s between-class variance method [17] selects the optimal thresholds by maximizing the following object function:
σ 2 c = i , j ω i ω j ( μ i μ j ) 2 .
Here, i and j index the intensity classes, and ω i and μ i are the probability of occurrence and the mean of a class, respectively. Such values are obtained as:
ω i = j = t h i 1 + 1 t h i p j , μ i = j = t h i 1 + 1 t h i q j j .
where p j denotes the probability distribution of pixels and q j = p j / ω i . As we know, Otsu’s method can achieve the best segmenting results if no contextual or semantic information is considered, but it suffers from the drawback of time-consuming searching for optimal thresholds.
Kapur presented another discriminant criterion based on maximum entropy [19]:
arg max T H i = 0 M 1 H ( C i ) .
where H ( C i ) is the Shannon entropy corresponding to a specific class, which is defined as:
H ( C i ) = j = t h i 1 + 1 t h i q j log q j .
Similarly, the quantum version of Kapur’s method [35] determines the optimal thresholds by maximizing the von Neumann entropy:
arg max T H i = 0 M 1 S ( ρ i ) .
where:
ρ i = j = t h i 1 + 1 t h i q j θ j θ j
is the density matrix representation of the i-th class and:
S ( ρ i ) = t r ( ρ i log ρ i ) .
Recently, the Tsallis entropy-based bi-level thresholding method was proposed [22], in which the optimal threshold is given by:
t ( q ) = arg max t S T A ( t ) + S T B ( t ) + ( 1 q ) S T A ( t ) S T B ( t ) .
Here, S T A ( t ) and S T B ( t ) represent the Tsallis entropy for object A and the background B, respectively, and the entropic index q can be calculated through q-redundancy maximization.
The effectiveness of these entropy-based methods has been proven. However, similar to Otsu’s method, they also have the drawback of high computational complexity, which will affect the efficiency of the whole vision task.

3. Proposed Methods

In this section, we will start with a new method, which utilizes the criteria of global quantum entropy maximization to achieve optimal thresholding, and then propose a novel encoding scheme. Based on this scheme, the improved method for thresholding is derived, which can determine optimal thresholds with linear time complexity.

3.1. Thresholding Based on Global Quantum Entropy Maximization

For an image, we can represent its histogram with the following entangled state of a composite quantum system:
I = i = 0 L 1 p i θ i i .
where we encode the i-th intensity level to the vector θ i = c o s θ i 0 + s i n θ i 1 , which belongs to the state space of the first one-qubit subsystem (labeled as “A”), by establishing a bijective relationship between them, namely:
θ i = π 2 · i L 1 , i [ 0 , L 1 ] ,
and i is the computational basis state of the second subsystem (labeled as “B”), which denotes the indices of pixel intensities. Though I is a pure state, the subsystem A or B is in a mixed state. Therefore, we describe these quantum systems in the language of the density matrix. Assuming I is rewritten as ρ A B , then the reduced density matrix for the subsystem A can be defined by:
ρ = t r B ( ρ A B ) = i = 0 L 1 p i θ i θ i .
The density matrix ρ contains the information about the distance between any two intensities, as well as their probability distribution. This property will be very useful for thresholding.
If pixels of an image are divided into M classes by using M-1 thresholds, we represent the histogram of the segmented image with:
I = i = 0 M 1 ( ω i θ ˜ i j = t h i 1 + 1 t h i q j i ) ,
where θ ˜ i = π 2 · μ i L 1 , ω i and μ i are defined in Equation (3). Then, the density matrix of the subsystem A becomes:
ρ = i = 0 M 1 ω i θ ˜ i θ ˜ i ,
and the von Neumann entropy of ρ :
S ( ρ ) = t r ( ρ log ρ ) = λ 1 log λ 1 λ 2 log λ 2
can quantify how much information is retained in the segmented image; where λ 1 and λ 2 are the eigenvalues of ρ . As a result, we maximize it to determine the optimal thresholds:
T H o p = arg max T H S ( ρ ) .
According to Equations (14) and (15), the following equation is established through simple algebraic computations:
λ 1 λ 2 = 1 2 i = 0 , j = 0 M 1 ω i ω j s i n 2 ( θ ˜ i θ ˜ j ) ,
where λ 1 + λ 2 = 1 , as the restriction must be held.
It is worthwhile to note that Equation (17) can also be used to evaluate thresholding: when Equation (17) takes the maximum value, λ 1 and λ 2 will be most similar to each other, and then, S ( ρ ) also reaches its best value. Meanwhile, Equation (17) indicates that the distance between intensities s i n 2 ( θ ˜ i θ ˜ j ) , as well as the probability distribution ( ω i , ω j ) affect the thresholding results.
Different from Kapur’s entropy-based method and its quantum version, our method has more explicit physical meaning for thresholding in terms of the following features:
(1)
Encoding pixel intensities on the state space of a one-qubit system can be considered as a process in which independent intensities are squeezed into a two-dimensional space. The similarity between different state vectors, as well as its probability distribution, can be described with the density matrix. Both factors contribute to thresholding.
(2)
According to the fundamental principles of information theory, the image segmenting process will causes the decrease of the information contained in images. Shannon entropy cannot directly be used to measure the information losses because it quantifies the amount of information on spaces with different dimensionality for original and segmented images. On the contrary, our method encodes the histograms of original and segmented images on the same quantum state space, which indicates that their entropies are comparable. As a result, the trivial solutions for segmentation, for example the thresholds equally dividing intensities into clusters with the same probability, could never appear since the entropy of the original image acts as the upper bound of our object function for all possible solutions.
(3)
From Equation (17), we find that the object function of our method is very similar to Otsu’s, described in Equation (2). The following experimental results will prove that they both achieve the best thresholding.

3.2. Quantum Lossy-Encoding-Based Entropy Maximization Method

As we have seen in Section 3.1, the proposed thresholding method derived from the viewpoint of quantum principles can achieve the best segmenting results similar to Otsu’s. However, it still suffers from the efficiency problem of searching for optimal thresholds. In this subsection, we present another way for image thresholding on the quantum state space.

3.2.1. Quantum Lossy Encoding of Images

Different from the precedent method, we map the pixel intensities to quantum state vectors according to the following rules:
(1)
Multiple qubits should be required for encoding intensity levels in accordance with the prospective number of thresholds. In other words, the state vectors are supposed to belong to an M-dimensional space if we want the M-level segmentation.
(2)
The angle parameter of state vectors ranges from zero to M · π instead of π / 2 . Namely, θ i = M π i / L .
(3)
After encoding, the terms contributing to density matrices should follow a π -periodic cyclical pattern. Namely, θ θ = θ + π θ + π .
Rule (1) provides the foundation for dividing pixel intensities into M classes, being linearly independent of each other. Rules (2) and (3) indicate that all state vectors representing pixel intensities are equally divided into M classes, and the corresponding density matrix:
ρ ˜ = i = 0 N 1 ( j = 0 M 1 p N · j + i ) θ i θ i , θ i [ 0 , π ] , N = L / M ,
only measures the information related to the local or intra-class uncertainty contributed by those adjoining intensity levels, but removes the global or inter-class information provided by those intensities far apart from each other.
According to the above rules, an alternative encoding scheme is given in the recursive form of:
θ i 2 = c o s θ i 0 + s i n θ i 1 , θ i = 2 π i / L θ i 3 = c o s θ i c o s 2 θ i 0 + c o s θ i s i n 2 θ i 1 + s i n θ i 2 , θ i = 3 π i / L θ i M = c o s θ i 2 θ i M 1 + s i n θ i M 1 , θ i = M π i / L
where the superscript M is temporarily borrowed to label the dimensionality of state vectors and i [ 0 , L 1 ] denote pixel intensities. As an example, the traces of encoded state vectors in the 2D and 3D case are shown in Figure 1.
Differing from ordinary encoding practices, the proposed scheme records local information of images, but removes the global information. More precisely, the following evidence could be verified in the 2D case: we divide intensity levels into two classes equally and equivalently quantify the amount of information with the product of eigenvalues of ρ ˜ :
λ 1 λ 2 = 1 2 i = 0 , j = 0 L 1 p i p j s i n 2 ( θ i θ j ) = 1 2 ( i = 0 , j = 0 L / 2 1 p i p j s i n 2 ( θ i θ j ) + i = L / 2 , j = L / 2 L 1 p i p j s i n 2 ( θ i θ j ) ) + i = 0 L / 2 1 j = L / 2 L 1 p i p j s i n 2 ( θ i θ j ) .
We note that the first term on the right of Equation (20) measures the local information (intra-class uncertainty) contributed by intensities in the same class, and the second term counts the global information (inter-class uncertainty) provided by intensities in different classes. Meanwhile, it is easy to verify that the values of the two terms will increase and decrease respectively when θ covers [ 0 , 2 π ] instead of [ 0 , π / 2 ] .

3.2.2. The QLEEM Method

Intuitively, the intensities far apart from each other and their probability distribution provide the evidence of thresholding. Therefore, we rewrite the density matrix of the given histogram in a decomposed form:
ρ = υ 1 ρ 1 + υ 2 ρ 2 .
where ρ 1 and ρ 2 describe the probability distributions of local and remote intensity levels (that is, intra-class and inter-class uncertainty), respectively. Meanwhile, as there is no more knowledge about υ 1 and υ 2 except υ 1 + υ 2 = 1 , we assume υ 1 = υ 2 = 1 / 2 according to the foundational principle of the entropy theory.
Now, we substitute ρ 1 with ρ ˜ given by the proposed lossy encoding scheme, since it contains the information contributed by local uncertainty of intensity vectors, and maximize the von Neumann entropy of Equation (21) for determining optimal thresholds:
T H o p = arg max S ( 1 2 ρ ˜ + 1 2 ρ ^ ) .
Here, we adopt orthogonal state vectors in M-dimensional space representing M classes after thresholding, since we want these intensity classes to be as independent as possible. Let:
ρ ˜ = i = 0 M 1 λ 1 , i θ 1 , i θ 1 , i , ρ ^ = i = 0 M 1 λ 2 , i θ 2 , i θ 2 , i
be orthonormal decompositions for the states ρ ˜ and ρ ^ , then for any one eigenvector of ρ ˜ denoted with | θ 1 , j > , there must exist an eigenvector of ρ ^ named | θ 2 , i > satisfying the relationship of | θ 2 , i > = ± | θ 1 , j > when S ( ( ρ ˜ + ρ ^ ) / 2 ) takes the max value. Meanwhile, the eigenvalues of the state can be determined according to the following equation:
λ 2 , i = 2 M λ 1 , j , i f θ 2 , i = ± θ 1 , j .
For the sake of representation, here we give the evidence of the above conclusion for the 2D situation. Assuming λ 1 and λ 2 are eigenvalues of the state ( ρ ˜ + ρ ^ ) / 2 , its entropy will take the maximum value if we equivalently maximize:
λ 1 λ 2 = λ 1 , 0 λ 2 , 0 s i n 2 ( θ 1 , 0 θ 2 , 0 ) + λ 1 , 0 λ 2 , 1 s i n 2 ( θ 1 , 0 θ 2 , 1 ) + λ 1 , 1 λ 2 , 0 s i n 2 ( θ 1 , 1 θ 2 , 0 ) + λ 1 , 1 λ 2 , 1 s i n 2 ( θ 1 , 1 θ 2 , 1 ) + λ 1 , 0 λ 1 , 1 + λ 2 , 0 λ 2 , 1
Notice that θ 1 , 0 | θ 1 , 1 = 0 and θ 1 , 0 | θ 1 , 1 = 0 must hold. Then:
λ 1 λ 2 = ( λ 1 , 0 λ 2 , 0 + λ 1 , 1 λ 2 , 1 ) s i n 2 ( θ 1 , 0 θ 2 , 0 ) + ( λ 1 , 0 λ 2 , 1 + λ 1 , 1 λ 2 , 0 ) c o s 2 ( θ 1 , 0 θ 2 , 0 ) + λ 1 , 0 λ 1 , 1 + λ 2 , 0 λ 2 , 1
will take the extremum when θ 1 , 0 | θ 1 , 1 = 0 o r 1 . In other words,
θ 2 , 0 = ± θ 1 , 0 θ 2 , 1 = ± θ 1 , 1 o r : θ 2 , 0 = ± θ 1 , 1 θ 2 , 1 = ± θ 1 , 0
must hold. Without loss of generality, we adopt the first case of Equation (27) for the succeeding discussions. Then:
λ 1 λ 2 = λ 1 , 0 λ 2 , 1 + λ 1 , 1 λ 2 , 0 ) + λ 1 , 0 λ 1 , 1 + λ 2 , 0 λ 2 , 1 = λ 2 , 0 2 + ( 1 + λ 1 , 0 λ 1 , 1 ) λ 2 , 0 + λ 1 , 0 λ 1 , 1 + λ 1 , 1
will reach its maximum value when:
λ 2 , 0 = ( 1 + λ 1 , 1 λ 1 , 0 ) / 2 = 2 M λ 1 , 2 = λ 1 , 1 .
The above conclusions have the instructive function for thresholding, which can be seen in two aspects:
(1)
Based on the proposed lossy-encoding scheme, we can directly calculate the eigenvalues of ρ ^ according to Equation (24), which represent the probability distribution of intensity classes after thresholding, and then determine the optimal threshold values.
(2)
As the probability with which any one intensity class occurs must be greater than zero, according to Equation (24), all eigenvalues of the density matrix ρ ˜ would satisfy the condition of λ 1 , i < 2 / M . Otherwise, λ 1 , i 2 / M indicates that there exist meaningless and unnecessary classes for segmentation. In summary, the upper bound of the thresholding level can be determined using our method. This feature implies that our method is more feasible than the most of the other existing ones, since the thresholding level, as a hyperparameter, is often predetermined empirically.
Finally, the optimal thresholds T H = t h 1 , t h 2 , , t h M 1 can be determined according to the following relationships:
i = 0 t h 1 p i λ 0 < i = 0 t h 1 + 1 p i i = t h M 2 + 1 t h M 1 p i λ M 1 < i = t h M 2 + 1 t h M 1 + 1 p i
where λ 0 , λ 1 , , λ M 1 is the sequence taken from the eigenvalue set of ρ ^ , and the corresponding sequence θ 0 , θ 1 , , θ M 1 belongs to the circular permutation of all eigenvectors, which satisfy the following rules:
θ i = arg max j | θ j | 0 | θ ( i + 1 ) m o d M = arg max j | θ j | 1 | θ ( i + M 1 ) m o d M = arg max j | θ j | M 1 | .
According to the methods mentioned above, the framework of the QLEEM algorithm is given in Algorithm 1.
Algorithm 1 The framework of the QLEEM algorithm
Input: The original image I, the thresholding level M
Output: The optimal thresholds
Init: Compute the histogram of the input image;
Step 1: Obtain density matrix ρ ˜ by using the lossy-encoding scheme;
Step 2: Calculate the eigenvalues and eigenvectors of ρ ˜ and then ρ ^
Step 3: Enumerate all possible M circular sequences of the eigenvalues of ρ ^ , and then get M groups of thresholds;
Step 4: loop over the M groups of thresholds, and select the optimal one based on which the entropy denoted in Equation (15) takes the maximum value.

3.2.3. Time Complexity of the QLEEM Algorithm

For the problem of M-level thresholding segmentation of images containing L-level intensities, the time of calculating the density matrix ρ ˜ is O ( L ) ; computing eigenvalues and eigenvectors of ρ ˜ needs O ( M 3 ) ; the time for performing Step 3 is O ( M ! + L ) ; and the loop in Step 4 consumes O ( M 2 3 ) time. Since M < < L is satisfied in general cases, the optimal performance time of the QLEEM algorithm is achieved by T = O ( L ) , which notably outperforms Otsu’s T = O ( A L 1 M 1 / 2 M 2 ) .

4. Experiments and Comparisons

4.1. Datasets and Settings

To evaluate the performance of the proposed methods, a set of standard test images was obtained from the Berkeley segmentation dataset [36]. All of the test images are 8-bit in depth, with a size of 481 × 321 pixels. The algorithms used for comparison are Otsu’s between-class variance method [17], Kapur’s entropy criterion method [19], the quantum version of Kapur’s [35] and our GQEM and QLEEM methods. These algorithms are implemented with MathWorks MATLAB 2014a on a Thinkpad notebook with an Intel Core-i5 2.2-GHz processor, 16 GB RAM and Ubuntu 14.04.
Threshold levels, quality of segmented images and time complexity are the most important indicators for evaluating the performance of image thresholding algorithms. Here, we evaluate the quality of segmented images by using the peak signal-to-noise ratio (PSNR) and structural similarity (SSIM). In addition, four measures: the Dice similarity coefficient (DICE) [37], the probabilistic rand index (PRI) [38], the global consistency error (GCE) [36] and the variation of information (VI) [39], are used to assess segmentations against ground truth data. Time complexity is measured by the execution time required in these methods. In particular, except for the proposed QLEEM, all the other exhaustive-search-based methods used in our experiments are sped up with the harmony search multithresholding algorithm (HSMA) [25].

4.2. Experimental Results and Comparisons

We applied these algorithms to all 300 pictures contained in the standard test dataset for assessing their performance. For the sake of representation, only five images, which are presented in Figure 2, have been used to show the bi-level segmented results. In Figure 3, the thresholding quality of the outcomes is analyzed considering the complete set, where the PSNR and SSIM scores are calculated under different thresholding levels, and we take the average values on the whole dataset.
Meanwhile, we recorded the CPU time consumed by these algorithms, and the average values for all the test images under different thresholding levels are depicted in Figure 4. As an example, the experimental results in terms of thresholding level, thresholds and CPU time are tabulated in Table 1 for a randomly selected image.
From Figure 2, we find that the segmentations obtained by using GQEM, QLEEM and Otsu are visually indistinguishable, which means these three methods have a similar performance. This conclusion can be further confirmed in Figure 3: the GQEM method obtains almost the same PSNR score as Otsu’s in spite of very little computational error; meanwhile, both GQEM and QLEEM outperform the others in terms of SSIM. The experimental results can be explained with the criteria of maximizing quantum entropy and the lossy-encoding scheme proposed in our methods, because they emphasize the weight of between-class variance and retain the local information, respectively. This feature is highly consistent with the SSIM method, which assesses the perceived quality of images based on structural similarity indicators, such as contrast and local inter-dependencies of pixels.
Examining Figure 4 and Table 1, we can see that the proposed QLEEM algorithm achieves the fastest execution speed (at least 100-times faster than Otsu in the case of bi-level thresholding and up to 350-times when the number of thresholds increases to five). In addition, the time consumption of QLEEM was insensitive to increments of the threshold level, since the complexity of our algorithm was mainly correlated with the total intensity level, instead of the amount of thresholds.
On the other hand, the upper bounds of the thresholding level recommended by the proposed QLEEM algorithm were tested. We found that the maximum possible amount of thresholds was lower than 10 for about 40 images in the test set. Our algorithm would terminate when we try to apply more thresholds to them. Figure 5 lists two groups of images and corresponding histograms, for which the proposed algorithm gave one and two thresholds, respectively. According to the visual observation, it is reasonable to believe that the suggested amounts of thresholds are feasible, as there are no more than three distinct peaks in their histograms.
Finally, we evaluate segmentations against the ground truth data. The first experiment is performed on a synthetic image corrupted by Gaussian noise (the mean value is zero, and the variance is 0.03), which is utilized for testing the efficiency and robustness of the proposed methods. Figure 6 shows the noisy image and segmentation results obtained by different algorithms. In addition, the performance indexes: the DICE ratio, PRI, GCE and VI scores, are used to assess the robustness of these algorithms. The corresponding scores are listed in Table 2.
The visual comparison in Figure 6 shows that the proposed GQEM and QLEEM algorithms produce clearer and more accurate segmentation results. From Table 2, we can confirm this conclusion: our GQEM clearly outperformed the others on the DICE, PRI, GCE and VI values. The robustness of the proposed GQEM for noisy images can be explained by comparing the object function of GQEM and Otsu. Considering the last term in Equations (2) and (17), both of them measure the distance between pixel intensities, but our GQEM method scaled the range [ 0 , L 1 ] of this parameter down to [ 0 , 1 ] . This feature is helpful for suppressing the high contrast caused by noise, and then, our GQEM algorithm partly played the role of a low-pass filter in segmentation tasks.
In the second experiment, we performed thresholding segmentation on BSDS300 dataset and compared the results with the ground truth segmentations in terms of the DICE, PRI, GCE and VI indexes. The average scores of these indicators obtained by different algorithms are presented in Table 3.
From Table 3, we can see that all the listed algorithms obtained lower scores compared with those that have been well trained with the manually-labeled dataset. In general, thresholding segmentation is a form of unsupervised segmentation, which cannot use any a priori knowledge involving the ground truth of a training set of images. Furthermore, the proposed GQEM and QLEEM along with the others used for comparison are all histogram-based algorithms. They achieve optimal segmentation by merely utilizing the probability distribution of colors, instead of the spatial and texture information.

5. Conclusions

In this paper, we address the image thresholding problem on quantum state space. The proposed GQEM and QLEEM methods follow a different way to represent images and determine the optimal thresholds in the language of quantum mechanics. In summary, the contributions of this paper mainly include the following aspects: (1) To our knowledge, this is the first application of the global quantum entropy criteria to the thresholding problem. The von Neumann entropy is more powerful for image segmentations than the Shannon entropy, because it measures the distance between pixel intensities, as well as the probability distribution. (2) Compared with other state-of-the-art approaches, our QLEEM algorithm tends to retain more structural information after segmentations. It is highly consistent with the way in which the HVS works. (3) The proposed QLEEM algorithm has the lowest consumption of execution times known to us, even compared with others that are sped up with some intelligent optimization techniques.

Author Contributions

Conceptualization, X.W. Formal analysis, X.W. Methodology, X.W. and C.Y. Project administration, C.Y. Validation, G.-S.X. Writing, original draft, X.W. Writing, review and editing, C.Y. and Z.L.

Funding

This work was supported in part by the National Natural Science Foundation of China (Nos. 61702163, U1504610).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bhat, M. Digital image processing. Int. J. Sci. Technol. 2014, 3, 272–276. [Google Scholar]
  2. Zaitoun, N.M.; Aqel, M.J. Survey on image segmentation techniques. Procedia Comput. Sci. 2015, 65, 797–806. [Google Scholar] [CrossRef]
  3. Saha, S.; Bandyopadhyay, B. Automatic MR brain image segmentation using a multiseed based multiobjective clustering approach. Appl. Intell. 2011, 35, 411–427. [Google Scholar] [CrossRef]
  4. Iakovidis, D.; Savelonas, M.; Karkanis, S. A genetically optimized level set approach to segmentation of thyroid ultrasound images. Appl. Intell. 2007, 27, 193–203. [Google Scholar] [CrossRef] [Green Version]
  5. Kamel, M.; Zhao, A. Extraction of binary character/graphics images from grayscale document images. Graph. Models Image Process. 1993, 55, 203–217. [Google Scholar] [CrossRef]
  6. Vijayalakshmi, S.; Durairaj, D.C. Use of multiple thresholding techniques for moving object detection and tracking. Int. J. Comput. Appl. 2013, 80, 1–7. [Google Scholar] [CrossRef]
  7. Valova, I.; Milano, G.; Bowen, K.; Gueorguieva, N. Bridging the fuzzy, neural and evolutionary paradigms for automatic target recognition. Appl. Intell. 2011, 35, 211–225. [Google Scholar] [CrossRef]
  8. Yang, M.-D.; Su, T.-C.; Pan, N.-F.; Yang, Y.-F. Systematic image quality assessment for sewer inspection. Expert Syst. Appl. 2011, 38, 1766–1776. [Google Scholar] [CrossRef]
  9. Goumas, S.K.; Dimou, L.N.; Zervakis, M.E. Combination of multiple classifiers for post-placement quality inspection of components: A comparative study. Inf. Fusion 2010, 11, 149–162. [Google Scholar] [CrossRef]
  10. Sasirekha, D.; Chandra, D.E. Enhanced techniques for PDF image segmentation and text extraction. Int. J. Comput. Sci. Inf. Secur. 2012, 10, 1–5. [Google Scholar]
  11. Yang, Y.; Hallman, S.; Ramanan, D.; Fowlkes, C.C. Layered object models for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 1731–1743. [Google Scholar] [CrossRef] [PubMed]
  12. Ma, Z.; Tavares, J.; Jorge, R. A review of algorithms for medical image segmentation and their applications to the female pelvic cavity. Comput. Methods Biomech. Biomed. Eng. 2010, 13, 235–246. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Cuevas, E.; Zaldivar, D.; Sossa, H. A multi-threshold segmentation approach based on Artificial Bee Colony optimization. Appl. Intell. 2012, 37, 321–336. [Google Scholar] [CrossRef] [Green Version]
  14. Zhang, Z.; Chen, C.; Sun, J. EM algorithms for Gaussian mixtures with split-and-merge operation. Pattern Recognit. 2003, 36, 1973–1983. [Google Scholar] [CrossRef] [Green Version]
  15. Lu, Z. Entropy regularized likelihood learning on Gaussian mixture: two gradient implementations for automatic model selection. Neural Process. Lett. 2007, 25, 17–30. [Google Scholar] [CrossRef]
  16. Ma, J.; Wang, T.; Xu, L. A gradient BYY harmony learning rule on Gaussian mixture with automated model selection. Neurocomputing 2004, 56, 481–487. [Google Scholar] [CrossRef] [Green Version]
  17. Otsu, N. A threshold selection method from gray level histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef]
  18. Farshi, T.P.; Demirci, R.; Feizi-Derakhshi, M.R. Image clustering with optimization algorithms and color space. Entropy 2018, 20, 296. [Google Scholar] [CrossRef]
  19. Kapur, J.; Sahoo, P.; Wong, A. A new method for gray-level picture thresholding using the entropy of the histogram. Comput. Vis. Gr. Image Process. 1985, 29, 273–285. [Google Scholar] [CrossRef]
  20. Pare, S.; Kumar, A.; Bajaj, V. An efficient method for multilevel color image thresholding using cuckoo search algorithm based on minimum crossentropy. Appl. Soft Comput. 2017, 61, 570–592. [Google Scholar] [CrossRef]
  21. Liang, Y.C.; Cuevas, J.R. An automatic multilevel image thresholding using relative entropy and meta-heuristic algorithms. Entropy 2013, 15, 2181–2209. [Google Scholar] [CrossRef]
  22. Ramirez-Reyes, A.; Hernandez-Montoya, A.R.; Herrera-Corral, G.; Dominguez-Jimenez, I. Determining the entropic index q of Tsallis entropy in images through redundancy. Entropy 2016, 18, 299. [Google Scholar] [CrossRef]
  23. Ye, Z.; Yin, H.; Ye, Y. Comparative analysis of two leading evolutionary intelligence approaches for multilevel thresholding. Int. J. Signal Imaging Syst. Eng. 2018, 11, 20–30. [Google Scholar] [CrossRef]
  24. Dehshibi, M.M.; Sourizaei, M. A hybrid bio-inspired learning algorithm for image segmentation using multilevel thresholding. Multimedia Tools Appl. 2017, 76, 15951–15986. [Google Scholar] [CrossRef]
  25. Oliva, D.; Cuevas, E.; Pajares, G. Multilevel thresholding segmentation based on harmony search optimization. J. Appl. Math. 2013, 2013, 1–24. [Google Scholar] [CrossRef]
  26. Shor, W.P. Algorithms for quantum computation: Discrete logarithms and factoring. In Proceedings of the 35th Annual Symposium Foundations of Computer Science, Santa Fe, NM, USA, 20–22 November 1994; pp. 124–134. [Google Scholar]
  27. Grover, L. A fast quantum mechanical algorithm for database search. In Proceedings of the 28th Annual ACM Symposium on the Theory of Computing, Philadelphia, PA, USA, 22–24 May 1996; pp. 212–219. [Google Scholar]
  28. Venegas-Andraca, S.E.; Bose, S. Storing, processing and retrieving an image using quantum mechanics. In Proceedings of the SPIE Conference Quantum Information and Computation, Orlando, FL, USA, 21–25 April 2003; pp. 137–147. [Google Scholar]
  29. Le, P.Q.; Dong, F.; Hirota, K. A flexible representation of quantum images for polynomial preparation, image compression, and processing operations. Quantum Inform. Process. 2011, 10, 63–84. [Google Scholar] [CrossRef]
  30. Caraiman, S.; Manta, V.I. Image segmentation on a quantum computer. Quantum Inf. Process. 2015, 14, 1693–1715. [Google Scholar] [CrossRef]
  31. Yangguang, S. Quantum statistical edge detection using path integral monte carlo simulation. Bio-Inspired Computing-Theories and Applications. Commun. Comput. Inf. Sci. 2014, 472, 430–434. [Google Scholar]
  32. Yan, F.; Iliyasu, A.M.; Fatichah, C.; Tangel, M.L. Quantum image searching based on probability distributions. J. Quantum Inf. Sci. 2012, 2, 55–60. [Google Scholar] [CrossRef]
  33. Ning, W.; Song, L. A watermarking strategy for quantum image based on least significant bit. Chin. J. Quantum Electron. 2015, 32, 263–269. [Google Scholar]
  34. Feng, S.; Xiang, L.; Huabao, L. Sampling number of reconstruction arithmetic based on quantum correlated imaging. Chin. J. Quantum Electron. 2015, 32, 144–149. [Google Scholar]
  35. Du, S.; Wu, G.; Ma, L. Maximum quantum entropy based optimal threshold selecting criterion for thresholding image segmentation. J. Comput. Inf. Syst. 2014, 10, 3359–3366. [Google Scholar]
  36. Martin, D.; Fowlkes, C.; Tal, D.; Malik, J. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. Proc. Int. Conf. Comput. Vis. 2001, 2, 416–423. [Google Scholar]
  37. Dice, L.R. Measures of the amount of ecologic association between species. Ecology 1945, 26, 297–302. [Google Scholar] [CrossRef]
  38. Unnikrishnan, R.; Pantofaru, C.; Hebert, M. Toward objective evaluation of image segmentation algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 2007, 29, 929–944. [Google Scholar] [CrossRef] [PubMed]
  39. Arbelaez, P.; Maire, M.; Fowlkes, C. Contour detection and hierarchical image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 898–916. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Traces of encoded state vectors on 2D and 3D space.
Figure 1. Traces of encoded state vectors on 2D and 3D space.
Entropy 20 00728 g001
Figure 2. Visual comparison of (a) original images and bi-level segmented ones by using the (b) Otsu, (c) Kapur, (d) quantum version of Kapur’s method (QKapur), (e) global quantum entropy maximization (GQEM) and (f) quantum lossy-encoding-based entropy maximization (QLEEM) methods, respectively.
Figure 2. Visual comparison of (a) original images and bi-level segmented ones by using the (b) Otsu, (c) Kapur, (d) quantum version of Kapur’s method (QKapur), (e) global quantum entropy maximization (GQEM) and (f) quantum lossy-encoding-based entropy maximization (QLEEM) methods, respectively.
Entropy 20 00728 g002
Figure 3. Quality assessment of the segmented images in terms of (a) peak signal-to-noise ratio (PSNR) and (b) structural similarity (SSIM).
Figure 3. Quality assessment of the segmented images in terms of (a) peak signal-to-noise ratio (PSNR) and (b) structural similarity (SSIM).
Entropy 20 00728 g003
Figure 4. The comparison of the time consumption of different methods under different thresholding levels.
Figure 4. The comparison of the time consumption of different methods under different thresholding levels.
Entropy 20 00728 g004
Figure 5. Two groups of images in the test dataset, to which the QLEEM algorithm suggests applying (a) bi-level and (b) tri-level thresholding, respectively.
Figure 5. Two groups of images in the test dataset, to which the QLEEM algorithm suggests applying (a) bi-level and (b) tri-level thresholding, respectively.
Entropy 20 00728 g005
Figure 6. Comparison of segmentation results on a synthetic image. (a) noisy image (Gaussian noise with zero mean and 3% variance); (b) Otsu result; (c) Kapur result; (d) QKapur result; (e) GQEM result; (f) QLEEM result.
Figure 6. Comparison of segmentation results on a synthetic image. (a) noisy image (Gaussian noise with zero mean and 3% variance); (b) Otsu result; (c) Kapur result; (d) QKapur result; (e) GQEM result; (f) QLEEM result.
Entropy 20 00728 g006
Table 1. Performance comparison in terms of thresholding level (M), thresholds and computation time.
Table 1. Performance comparison in terms of thresholding level (M), thresholds and computation time.
MethodMThresholdsCPU Time (s)
Otsu21160.233523
385-1570.313405
469-120-1780.348805
560-101-138-1870.666153
652-85-117-150-1931.293793
Kapur21550.256437
391-1700.395228
475-130-1830.473902
566-113-160-2031.222006
656-93-132-170-2091.181965
QKapur21472.007029
310-1472.12888
410-17-1473.924715
510-17-147-2523.114482
610-17-147-251-2524.59602
GQEM21140.338808
384-1470.410247
470-117-1680.666514
562-99-133-1760.682051
654-86-114-143-1820.985941
QLEEM21070.001661
386-1350.002079
462-106-1530.002549
553-90-121-1600.003043
649-83-106-133-1660.003673
Table 2. Performance of different algorithms on a noisy image (the best values are highlighted). DICE, Dice similarity coefficient; PRI, probabilistic Rand index; GCE, global consistency error; VI, variation of information.
Table 2. Performance of different algorithms on a noisy image (the best values are highlighted). DICE, Dice similarity coefficient; PRI, probabilistic Rand index; GCE, global consistency error; VI, variation of information.
AlgorithmDICEPRIGCEVI
Otsu0.8897870.9347840.098070.54778
Kapur0.9085920.9461410.0932750.532568
QKapur0.4723660.4263670.0844471.570079
GQEM0.9215090.9554910.0786460.45552
QLEEM0.9082810.9485010.0975110.580201
Table 3. Average performance of different algorithms on BSDS300 dataset (the best values are highlighted).
Table 3. Average performance of different algorithms on BSDS300 dataset (the best values are highlighted).
AlgorithmDICEPRIGCEVI
Otsu0.4119340.6130440.3859382.825647
Kapur0.4000790.643130.3663482.49384
QKapur0.3639790.5424630.17041.802242
GQEM0.4123960.6113790.3848272.892085
QLEEM0.4058240.6140350.3867812.931183

Share and Cite

MDPI and ACS Style

Wang, X.; Yang, C.; Xie, G.-S.; Liu, Z. Image Thresholding Segmentation on Quantum State Space. Entropy 2018, 20, 728. https://0-doi-org.brum.beds.ac.uk/10.3390/e20100728

AMA Style

Wang X, Yang C, Xie G-S, Liu Z. Image Thresholding Segmentation on Quantum State Space. Entropy. 2018; 20(10):728. https://0-doi-org.brum.beds.ac.uk/10.3390/e20100728

Chicago/Turabian Style

Wang, Xiangluo, Chunlei Yang, Guo-Sen Xie, and Zhonghua Liu. 2018. "Image Thresholding Segmentation on Quantum State Space" Entropy 20, no. 10: 728. https://0-doi-org.brum.beds.ac.uk/10.3390/e20100728

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop