3D Transparent Object Detection and Reconstruction Based on Passive Mode Single-Pixel Imaging

Mathai, Anumol; Guo, Ningqun; Liu, Dong; Wang, Xin

doi:10.3390/s20154211

Open AccessArticle

3D Transparent Object Detection and Reconstruction Based on Passive Mode Single-Pixel Imaging

¹

School of Engineering, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway, Selangor 47500, Malaysia

²

State Key Laboratory of Modern Optical Instrumentation, College of Optical Science and Engineering, Zhejiang University, 38 Zheda Road, Hangzhou 310027, China

^*

Author to whom correspondence should be addressed.

Sensors 2020, 20(15), 4211; https://0-doi-org.brum.beds.ac.uk/10.3390/s20154211

Submission received: 30 May 2020 / Revised: 20 July 2020 / Accepted: 24 July 2020 / Published: 29 July 2020

(This article belongs to the Special Issue Sensing and Processing for 3D Computer Vision)

Download

Browse Figures

Versions Notes

Abstract

:

Transparent object detection and reconstruction are significant, due to their practical applications. The appearance and characteristics of light in these objects make reconstruction methods tailored for Lambertian surfaces fail disgracefully. In this paper, we introduce a fixed multi-viewpoint approach to ascertain the shape of transparent objects, thereby avoiding the rotation or movement of the object during imaging. In addition, a simple and cost-effective experimental setup is presented, which employs two single-pixel detectors and a digital micromirror device, for imaging transparent objects by projecting binary patterns. In the system setup, a dark framework is implemented around the object, to create shades at the boundaries of the object. By triangulating the light path from the object, the surface shape is recovered, neither considering the reflections nor the number of refractions. It can, therefore, handle transparent objects with a relatively complex shape with the unknown refractive index. The implementation of compressive sensing in this technique further simplifies the acquisition process, by reducing the number of measurements. The experimental results show that 2D images obtained from the single-pixel detectors are better in quality with a resolution of

32 \times 32

. Additionally, the obtained disparity and error map indicate the feasibility and accuracy of the proposed method. This work provides a new insight into 3D transparent object detection and reconstruction, based on single-pixel imaging at an affordable cost, with the implementation of a few numbers of detectors.

Keywords:

transparent object detection; single-pixel imaging; compressive sensing; disparity map acquisition

1. Introduction

Many practical activities in industry, such as automatic inspection, oceanology, fluid mechanism, and computer graphics, often require imaging of three-dimensional shapes of invisibles. Though various techniques have been developed for deducing 3D images of transparent objects, this class of objects poses difficulties due to many reasons. Firstly, these objects are colorless, and they gain a form from neighboring background objects. Secondly, the complexity of light interactions within the transparent objects makes its detection impossible. Finally, knowledge about the refractive index of the material is needed for reconstruction.

The existing techniques for transparent object inspection required application of known or unknown background (checkboard/striped) patterns in calibration with a camera, which is computationally a long process and costly. The first implementation of an unknown background pattern at the bottom of the water tank was performed to approximate the distorted water surface. Thereafter, structured light patterns were introduced to extract the surface properties of the glass objects [1]. Subsequently, direct rays were collected from the refractive objects by placing multiple cameras in several positions, with respect to the object to approximate the depth [2]. Later, the shape of transparent objects was reconstructed from its known motion [3]. Dusting and submerging transparent objects in fluorescent liquids deteriorated the structure of the object, therefore, it has limited its implementation in real-time applications [4,5]. Presently, transparent object recovery is achieved by combining polarization analysis and light-path triangulation [6]. Almost all the techniques for depth acquisition rely on some external factors, such as system calibration, background patterns, less intensity environment, and object motion, which will introduce a lot of errors to the inspection system. Hence, the accuracy is relatively low.

In addition, a range of sensors, such as color cameras, light detection and ranging (LIDAR) systems, time of flight (TOF) cameras, IR cameras and Kinect sensors, have been developed to image transparent objects, yet a general solution could not find for the detection of transparent objects [7,8,9]. In color cameras, object recognition can be performed when the background color of the object is exactly the same as the color of the object. When LIDAR/TOF sensors are used to image a transparent object, the reflected light from the object has recorded an approximate shape. The projected light from these sensors is reflected multiple times before it hits the transparent object, which does not cause object information in the camera. Hence, the shape cannot be recognized, and the edges of the object have been missed, which made object reconstruction impossible. Kinect sensor is an alternative method utilized for transparent object inspection, in which a sensor is moved in the scene to acquire multiple views of the object [7]. Additionally, the work is limited to non-planar transparent objects with a smooth surface. With an IR camera, Maldague et al., proposed a technique called “shape from heating”, in which the transparent object surface was heated with a thermal camera source, and the time sequence of thermograms (thermal images) was recorded to estimate the shape [10]. Later, Eren et al. developed “scanning from heating” to detect 3D transparent objects on the basis of “shape from heating” technique, used by Maldague et al. [11]. In this work, the object was heated with a laser source, to a temperature at which the object turns opaque and then, irradiations were recorded by a thermal camera for shape estimation. The main limitations of these studies were the non-uniform surface heating of the object and the use of an infrared laser source, which restricted the performance of both the studies. The identified limitations in the sensors can be solved with an alternative sensor called a “single-pixel detector”.

Single-pixel imaging (SPI) is an advanced imaging approach that is applicable for acquiring spatial information in low light, high absorption, and backscattering conditions [12]. SPI has been widely used in myriad applications, such as infrared imaging [13], gas imaging [14], photoacoustic imaging [15], three-dimensional imaging [16,17,18], terahertz imaging [19,20], X-ray diffraction tomography [21], remote sensing [22], encrypted imaging [23], lensless imaging [24], shadowless imaging [25], hyperspectral imaging [26], microscopy [27], and scattering imaging [28]. In this imaging modality, a single-pixel detector that has no spatial resolution can detect the object by means of a modulated structured light [29]. Though this imaging technique is affected by noises, the ability to work in challenging environments with high resolution and precision enables single-pixel detection more popular than any other conventional imaging systems [30]. Moreover, its sensitivity to a wide operating spectrum extends its operation range beyond the visible spectrum [31]. All these characteristics of single-pixel imaging have been utilized to recover the images from challenging environments. The smart control of light propagation with prior knowledge of the modulated patterns in this technique moderates the depletion of the ballistic photons. Tajahuerce et al. proposed an optical system using a single-pixel camera, which can successfully reconstruct 2D objects, even under multiple scattering conditions in turbid media [28]. In addition, this work compared the quality of 2D image reconstruction with the result of CCD camera. In the presence of a scattering medium in front of the object, a CCD camera can only capture the speckle pattern (no information regarding the object). On the contrary, a single-pixel detector can record an excellent result. Based on the modulation involved, this technique is classified as active and passive single pixel imaging. Both methods are implemented in many imaging modalities for the acquisition of 2D and 3D objects.

Magalhães et al. presented an active illumination single-pixel camera, in which a photodetector approximated a replica of the object by averaging the inner product between the pattern and the object sample [32]. The problem of spatial and temporal aberrations that occurs in imaging transparent object was resolved in [12]. Later, Bertolotti et al. [33] proposed a non-invasive imaging technique that uses an iterative algorithm to retrieve the image of a fluorescent object hidden in the opaque medium. There are many published works on non-line of sight imaging, in which the object is sandwiched between two layers of chicken pieces with a thickness of 2.84 mm and 2.92 mm [34,35]. The imaging has performed using a single-pixel detector and estimated the image of the sandwiched object. Winters et al. recommended a method to improve the speed of reconstruction in scattering media with the help of help of

x

and

y

modulators. These modulators can operate at extremely high speed control the illumination pattern before sampling the object [36]. All the above-mentioned works give an insight into the reconstruction of 2D objects.

Three-dimensional image reconstruction approaches, such as time-of-flight (TOF), binocular vision, photometric stereo, and shape-from-X techniques can estimate the depth information of opaque objects. Sun et al. [37] proposed a model based on the “shape from shading” method, where multiple cameras were placed at different positions of the reflective object. Two-dimensional images from each camera hold the shadows of the object from which surface gradients are derived and 3D images are reconstructed via photometric stereo. Wen-kai et al. [38] developed a 3D reconstruction system using a binocular stereo vision algorithm and a single-pixel detector by placing the object on a rotating platform. The limitations of using spatially separated multiple detectors and moving the object were eliminated in [17]. The TOF technique is utilized to obtain the depth information and its accuracy depends on a high-speed photodiode and precision in measurements. Similarly, other demonstrations for scanning the scene and obtaining the depth and reflectivity information via TOF have also been discussed [39,40,41,42]. Zhang et al. proposed a method to capture the images of opaque objects. In this study, four photodetectors were implemented at different locations of the object to capture the reflected light. The variations in shading information in the images were studied, and a photometric stereo algorithm was utilized for 3D image reconstruction [43]. Salvador-Balaguer et al. [44] implemented a basic active single-pixel imaging system to image opaque objects. They also used reflected light from the object and processed it, based on an adaptive compressive algorithm for image reconstruction. All these methods are suitable for recovering 3D images of objects for reflective surfaces. The perfect assembly and tuning of all instruments with high precision are needed to achieve the target reconstruction. If the relevant parameters of each instrument in the system are properly raised, and the high-precision assembly is matched, all these techniques can assure the overall reconstruction quality of the system.

In this paper, we present a fixed multi-viewpoint 3D transparent object inspection system, based on passive mode single-pixel imaging. Two single-pixel detectors are applied in the setup to eliminate the need to move the object while imaging. The results show that it is possible to obtain the disparity map of the object by using high-speed detectors to record the sampled refracted light, along with our image reconstruction algorithm. The rest of the paper is organized as follows. Section 2 describes our experimental setup for 3D transparent object detection. Section 3 examines how 3D depth information is extracted from 2D images. This is followed by the conclusion, which is expounded in Section 4.

2. Experimental Setup

The schematic diagram of the proposed 3D transparent object detection system is shown in the Figure 1. The system consists of a red laser to illuminate the transparent object, a dark framework to cause streak effects at the object boundary, an imaging lens, a digital micromirror device (DMD) to modulate the laser light with a computer-generated measurement matrix, collecting lenses, two single-pixel detectors to collect the transmitted light from the object, a data acquisition (DAQ) system, and a computer, to perform 2D and 3D image reconstruction.

The experimental set up is shown in the Figure 2. The fiber output red laser (650 nm, 1 W) is to illuminate the target object. The refracted light from the target is collected by an imaging lens (LA1740, f-85 mm) and directed to DMD micromirror (DMD6500 &9000) active area. To provide spatial information to the captured image, the pre-programmed patterns stored in the DMD is combined with the transmitted light. Then, the modulated light from the DMD is projected to the environment, where two focusing lenses collect the light and the light is focused to the active area of the spatially unresolved single-pixel detectors. The pre-programmed patterns in the experiment provide spatial information to the transmitted light. The experimental setup is employed for passive modulation mode where two PDA36A2- photodetectors are used as single-pixel detectors to record the total light intensity from the object. DAQ(USB6001) digitizes the recorded light from left and right single-pixel detectors, and sends it to a computer to conduct 2D image reconstruction. The 2D image quality depends on patterns used and any distortions in it deteriorate the image quality. While choosing the passive mode, the intensity transformed object information is modulated with patterns in the DMD, which will reduce the distance at which modulated light beam travels, thereby reducing the distortion through ambient light. Any deviations in pattern structure can also be maintained in the passive method.

For a single-pixel imaging system, the orientation of the optical components and specifications of the lenses often play a crucial role for ensuring high quality images. Additionally, the proper selection of the lens and its focal length is essential to concentrate the transmitted light beam from the object to the very small active area of the single-pixel detector. Furthermore, combining lenses such as planar and aspheric ensures sharp focus with fewer aberrations, resulting in better quality 2D images. As disparity accuracy is closely related to the quality of the 2D reconstruction result, the lens must be chosen cautiously. The proposed experimental setup is implemented in the lab environment with a black framework around the sides of the object to provide a streak effect on the edges of the object. The disparity calculation will depend on the quality of 2D images and edge sharpness. In our work, transmitted light (majority of the light) is collected for image reconstruction, which will provide good quality 2D images compared to conventional methods. Moreover, the features of a single-pixel detector, such as increased detection efficiency, lower noise, and higher time resolution etc., provide additional advantages. Additionally, the apparent directional illumination from DMD and shadow effect at the edges of the object make the system superior in producing good quality 2D images. After obtaining left and right single-pixel detector images, the 3D reconstruction algorithm first looks for the preserved edges, and then finds out the disparity between the pixels for depth calculation.

Additionally, calibration of the left and right single-pixel detector images is required to maximize the accuracy in disparity map calculation, because the 2D images are taken from different angles of the object. In the calibration process, multiple images of the object are captured from different perspectives, and self-calibration is performed to obtain the intrinsic and extrinsic parameters for depth calculation [45,46]. To ensure accurate measurement, the trigger signal is set to initiate the DMD to modulate the incoming light with preprogrammed patterns. The exposure time and dark time of the DMD are decided by the number of samples to be recorded for a period. So, the calibration process reduces the probability of error and consistently increases the measurement process systematically.

For the experimental setup, two single-pixel detectors are synchronized, such that both the detectors can capture images at the same time when DMD project patterns. The number of samples to be recorded is set as 100 in a second for each displayed pattern. DAQ takes an average of 100 samples to obtain a single measurement that corresponds to each pattern and sends it to a high-performance computer for further processing. This operation will continue until the DMD stops pattern projection. In addition, the detectors are placed with a distance of 7 cm between them, to obtain the full view of the object. The distance from the camera to the object is set as 65 cm, and the focal length of the single-pixel detector is set as 8.5 cm, which is based on the focal length of the lenses used for focusing the light on the single-pixel detector.

3. 3D Transparent Object Reconstruction

3.1. 2D Image Reconstruction

The advent of DMD and single-pixel imaging enables fast image reconstruction with a few measurements. The transparent object detection and image acquisition process are shown in the Figure 3. The object to be imaged is fixed at a position and it is scanned and sampled with a sequence of the sparse matrix (up to M numbers). The resolution of the reconstructed image is decided based on the resolution and number of the projected sparse matrix. For the following step, the measurements required for reconstruction is fixed to m

(m = O (K \log N))

, where the total number of pixels in the object is N, due to the adoption of compressive sensing (CS) in acquiring the samples. The detectors in the imaging system collect object samples, until DMD stops matrix pattern projection. The number of patterns projected depends on the sparsity of the measurement matrix. At last, the total variation (TV) minimization algorithm estimates the original signal X from the measurement vector Y with the prior knowledge of the sparse matrix. The result obtained from the system is displayed in Figure 4.

Recovering an image,

X_{N \times N}

from a set of measurements vector,

Y_{N \times N}

is straightforward with matrix inversion techniques. With this technique, single-pixel imaging had limitations, such as the requirement of

N^{2}

pixels for reconstruction, long data acquisition time, and large data storage. These problems can be addressed by combining single-pixel imaging and compressed sensing. It enables the single-pixel detector to reduce the number of measurements required for reconstruction to

Y_{M \times 1}

, thereby reducing data storage and data transfer requirements. This method also solves a linear inverse problem in the case where

X

has a sparse representation.

The SPI technique gathers light, which interacts with the object with the aid of a spatially un-resolved single-pixel detector. The encoding of spatial information in the collected light is done by the pre-programmed spatially resolved patterns. The single-pixel detector sequentially measures the inner products between the

N \times N

pixelated scene and a set of

M \times N

binary patterns. The principle behind the CS imaging is summarized in equation [47]:

Y = Φ X

(1)

where

Y

is an

M \times 1

column vector,

Φ

is the measurement matrix contains,

M

is the row vector,

N

is the column vector, and

X

is the representation of original image, having

N \times 1

pixels. When the number of measurements

M

in

Y

is less than the total number of pixels

(N)

in

X

, the Equation (1) will become an ill-conditioned problem with infinite solutions. To solve such problems, the original image should obey the property called sparsity, in which only the most significant co-efficients (K-sparse) in the image are considered for processing, and all the less significant co-efficients are discarded. In CS, the K-sparse information is acquired and stored in the column vector

Y

. If an image can be represented in some basis, then it can be recovered via

l_{1}

minimization, with the knowledge of

Y

and

Φ

[47].

Consider a K-sparse signal

X

and it is sparse in orthogonal basis,

Ψ = [Ψ_{1}, Ψ {}_{2}, \dots Ψ_{N}]

, then

X = Ψ S

(2)

where

S

is K-sparse, in which K coefficients are non-zero. According to CS theory, the signal

X

an be recovered with

m (m = O (K \log N))

incoherent linear measurements when the original signal contains such K-sparse co-efficients. Then, Equation (1) becomes:

Y = Φ X = Φ Ψ S

(3)

where

Φ

s a pre-programmed pattern of the size

M \times N

, which is uncorrelated with the sparsity basis

Ψ

, and

Y

is the

M \times 1

measurement vector [48]. From the measurement vector

Y

, image recovery is achieved by the TV-based minimization model. The directional change (gradient) in the object image

X

can be determined at a pixel location

x_{i j}

[49]:

\begin{array}{l} G_{i j} = (\begin{matrix} G_{h; i j} (X) \\ G_{v; i j} (X) \end{matrix}) \\ G_{h; i j} (X) = x_{i + 1, j} - x_{i, j} \\ G_{v; i j} (X) = x_{i, j + 1} - x_{i, j} \end{array}

(4)

The TV minimization algorithm calculates the total variation and removes the undesirable information by preserving the edges at each pixel location of the image

X

:

T V (X) = \sum_{i j} \sqrt{G_{h; i j} {(X)}^{2} + G_{v; i j} {(X)}^{2}}

(5)

TV minimization has been adopted for most image processing fields due to its ability to keep visual quality than

l_{1}

optimization [49]. To acquire the 2D image of the size

32 \times 32

, the conventional imaging system would take 1024 measurements. In SPI, 200 measurements, around 20% of the total number of pixels, are used for good quality image reconstruction. Three objects are tested, and the resultant images reconstructed from the left and right single-pixel detectors are shown in Figure 4.

Our experimental setup contributes to the formation of good quality 2D images for transparent objects along with CS algorithm. The 2D image reconstruction quality obtained from the SPI system is better than the conventional imaging systems, such as LIDAR or TOF cameras [7,8,9], owing to single-pixel sensors detection efficiency, lower noise, and higher time resolution. The apparent directional illumination from DMD and shadow effect at the edges of the object also make the system superior to traditional imaging methods in obtaining good quality image reconstruction.

3.2. Disparity Map Using Normalized Cross-Correlation (NCC)

The object in the experimental setup is observed by two single-pixel detectors. This is equivalent to getting images of the object from two angles without changing the position of it. Binocular stereovision determines the position of a point in space by finding the intersection of two lines passing through the center of projection and the projection of point in the image. The images from the two viewpoints are dissimilar in intensity distribution and the depth information of the images is lost. However, depth can be inferred through the binocular vision algorithm, which works very similarly to human eyes. Stereovision algorithms are classified as features based and window/area-based techniques. Feature based algorithms are complex in finding the matching features for all the edges or corners from two single-pixel images to build a disparity map. Thus, the area-based method is considered for depth evaluation, in which the algorithm matches blocks of pixels to find correspondences in the images. In this study, the NCC method is used to determine the correspondence between two windows around a pixel of interest. NCC is defined as:

N C C (i, j, d) = \frac{\sum_{(i, j) \in w} X_{l} (i, j) . X_{r} (i^{’} - d, j ’)}{\sqrt{\sum_{(i, j) \in w} X_{l}^{2} (i, j) . \sum_{(i, j) \in w} X_{r}^{2} (i^{’} - d, j ’)}}

(6)

where

w

is the window size,

X_{l}

is the left detector image,

X_{r}

is the right detector image, and

d

is the disparity.

i, j

and

i^{’}, j^{’}

are the blocks of pixels to be matched in the left and right detector images, respectively. The window size can affect the quality of the disparity map, in this work we have chosen window size as

6 \times 6

pixels.

Three-dimensional image reconstruction quality depends on the quality of 2D images and its edge sharpness. The complete details about the edge features aid in estimating the boundaries of the object from the background. Two-dimensional images obtained from SPI are noisy and edges are not uniform. Hence, background noise has been removed first, and then the canny operator algorithm has been applied for edge detection. After that, the image is processed using morphological operators to make the edges smooth and perfect. The major difficulty for transparent object detection is the featureless surface to compute the disparity. This issue is resolved to some extent in this work, owing to the tracing of edges and the significant role of the NCC algorithm in depth computation. As the NCC algorithm is less sensitive to the changes in the intensity value in each pixel, the depth computation with the algorithm become more precise. Depth information from a pair of images can be calculated by first computing the distance between the block of pixels at a location in the left image and its corresponding location in the right image. The search for the best match is performed over a window. This will produce a disparity map, as shown in Figure 5. Before the disparity calculation, the left and right images are converted to grayscale images. Hence, the NCC algorithm determines the intensity range in the images, normally between 0 and 255, and divides the range into multiple offsets (from 0 to 30 offsets), having a range of pixel intensities within each offset. At the same time, the offset adjust is also calculated using the given formula:

O f f s e t a d j u s t = \frac{255}{30}

(7)

where offset adjust is used in the final step of the algorithm to calculate the pixel range. For NCC calculation, the left image is fixed, and the right image is moved across the window and intensities of both left and right images get multiplied and further divided by their own intensity square or standard deviation of the intensities across the window. Then, the disparity is calculated using the following equations:

\begin{array}{l} M = \sum_{i, j} \sum_{i, j} X_{l} (i, j) . * X_{r} (i^{’} - d, j^{’}) \\ X_{r}^{2} = \sum_{i, j} \sum_{i, j} X_{r} (i^{’} - d, j ’) . * X_{r} (i^{’} - d, j ’) \\ X_{l}^{2} = \sum_{i, j} \sum_{i, j} X_{l} (i, j) . * X_{l} (i, j) \end{array}

(8)

N C C = \frac{M}{\sqrt{(X_{r}^{2} * X_{l}^{2})}}

(9)

where

M

represents the dot product between the left and right images. The similarity of the pixels from both left and right images is aggregated over the window size as shown in Equation (6) ariations in window size will affect the quality of the reconstructed images. Increased window size makes the disparity map smoother, however, there will be inaccuracies in object detailing at the boundaries. Hence, smaller window size is chosen to provide maximum depth details with more noise. The obtained disparity from the above equations is multiplied with the offset value to get the pixel range which is given by the equation:

D i s p a r i t y m a p = d i s p a r i t y * o f f s e t a d j u s t

(10)

Depth “

z

” of the object from the photodetector is calculated using the following formula:

z = \frac{b \times f}{d}

(11)

where

b

represents the baseline distance i.e., the distance from the optical center of one detector to other,

f

symbolizes the focal length of the photodetector, and

d

indicates the disparity between the pixels in one image to another image.

The disparity map of the object is plotted in Figure 5 for various objects. The depth bar, which is in cm, indicates the distance at which object is placed from the camera. From the depth bar, the pixel value within the minimum offset range would indicate the farthest away information (background) in the plot, or else the pixel value within the maximum offset range would indicate the nearest information from the detector in the plot. The widely accepted bad matched pixel (BMP) measure is used to quantitatively evaluate the disparity maps for error estimation, and it is calculated using following formula:

\begin{array}{l} B M P = \frac{1}{N} \sum_{(x, y)} ε (x, y); ε (x, y) = 1 if & | D_{true} (x, y) {- D}_{reconstructed} (x, y) | > δ \\ 0 if & | D_{true} (x, y) {- D}_{reconstructed} (x, y) | < δ \end{array}

(12)

where

D_{true}

represents the ground truth data, and

D_{reconstructed}

represents the disparity map data. The error tolerance value

δ

is commonly taken as 1. The BMP value computed for the three disparity maps in Figure 5a–c is given by 0.221, 0.218, and 0.202, respectively. The values obtained are the measure of the quantity of errors occurring in disparity maps.

To verify the effectiveness of our method, the reconstructed images are aligned with the original ground truth images to compute the error map shown in Figure 6. The percentage error (%) is calculated from the difference between the ground truth image and reconstructed image. The equation for calculating the percentage depth error is given by the formula:

A b s o l u t e d e p t h e r r o r (%) = \frac{| g r o u n d t r u t h i m a g e - r e c o n s t r u c t e d i m a g e |}{g r o u n d t r u t h i m a g e} \times 100 %

(13)

The Otsu method is implemented by setting the threshold value as one while comparing the ground truth image with the reconstructed image.

The results show that the proposed transparent object inspection system works very well in capturing images and finding the disparity. When comparing our work with existing techniques, the proposed system is superior in reconstructing the shapes under visible light with cost effective single-pixel detectors. In addition, the movement of object or the camera in the scene for acquiring multiple views of the object is not needed in the proposed setup. Moreover, images are reconstructed with a smaller number of measurements, due to the application of CS, thereby reducing the storage requirements and time-consuming computations. Some parts of the objects are not detected in the reconstructed results, because of fewer transmissions from the object. Additionally, the quality of the 3D reconstruction results is not as good as expected, due to some missing parts in reconstructed 2D images. Post-processing of the 2D images is necessary before feeding the 2D images for the disparity calculation program. Moreover, an increase in window size to obtain the finer details of the reconstruction adds more noise into the image which causes the 3D image to become blurrier and noisier.

4. Conclusions

In conclusion, we have experimentally demonstrated a 3D transparent object inspection system with two single-pixel detectors by collecting the transmission from the object. The employment of two single-pixel detectors overcomes the limitation of object movement during imaging. Two-dimensional images are reconstructed using convex optimization algorithms based on CS. The employed NCC algorithm successfully deduced the depth map from the 2D image. The resultant 3D image using the proposed passive single-pixel imaging setup and NCC algorithm ensures better quality, compared to conventional imaging methods. The system developed for transparent object inspection can detect objects with flat and homogeneous surfaces with limited thickness. More experiments will be conducted for complex objects, and the 3D image reconstruction algorithm will be further improved in the future.

Author Contributions

This research work was mainly conducted by A.M. She composed the manuscript. X.W. contributed with a discussion about the system design, algorithm development, and results analysis, as well as writing the manuscript. N.G. and D.L. contributed to manuscript revising. All authors have read and agreed to the published version of the manuscript.

Funding

This material is based upon work supported by the Air Force Office of Scientific Research under award number FA2386-16-1-4115.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhang, Z.; Zhang, M.; Chang, Y.; Aziz, E.-S.; Esche, S.K.; Chassapis, C. Real-Time 3D Model Reconstruction and Interaction Using Kinect for a Game-Based Virtual Laboratory. In ASME 2013 International Mechanical Engineering Congress and Exposition; American Society of Mechanical Engineers Digital Collection: New York, NY, USA, 2013. [Google Scholar]
Kutulakos, K.N.; Steger, E. A theory of refractive and specular 3D shape by light-path triangulation. Int. J. Comput. Vis. 2008, 76, 13–29. [Google Scholar] [CrossRef] [Green Version]
Zheng, J.Y.; Murata, A. Acquiring 3D object models from specular motion using circular lights illumination. In Proceedings of the Sixth International Conference on Computer Vision (IEEE Cat. No. 98CH36271), Bombay, India, 7 January 1998; IEEE: Piscataway Township, NJ, USA, 2002. [Google Scholar]
Hullin, M.B.; Fuchs, M.; Ihrke, I.; Seidel, H.P.; Lensch, H.P. Fluorescent immersion range scanning. ACM Trans. Graph. 2008, 27, 87. [Google Scholar] [CrossRef]
Rantoson, R.; Stolz, C.; Fofi, D.; Mériaudeau, F. 3D reconstruction of transparent objects exploiting surface fluorescence caused by UV irradiation. In Proceedings of the 2010 IEEE International Conference on Image Processing, Hong Kong, China, 26–29 September 2010; IEEE: Piscataway Township, NJ, USA, 2010. [Google Scholar]
Xu, X.; Qiao, Y.; Qiu, B. Reconstructing the surface of transparent objects by polarized light measurements. Opt. Express 2017, 25, 26296. [Google Scholar] [CrossRef] [PubMed]
Alt, N.; Rives, P.; Steinbach, E. Reconstruction of transparent objects in unstructured scenes with a depth camera. In Proceedings of the 2013 IEEE International Conference on Image Processing, Melbourne, Austrilia, 15–18 September 2013; IEEE: Piscataway Township, NJ, USA, 2013; pp. 4131–4135. [Google Scholar]
Klank, U.; Carton, D.; Beetz, M. Transparent object detection and reconstruction on a mobile platform. In Proceedings of the IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; pp. 5971–5978. [Google Scholar]
Zhong, L.; Ohno, K.; Takeuchi, E.; Tadokoro, S. Transparent object detection using color image and laser reflectance image for mobile manipulator. In Proceedings of the 2011 IEEE International Conference on Robotics and Biomimetics, Phuket, Thailand, 7–11 December 2011; IEEE: Piscataway Township, NJ, USA, 2011; pp. 1–7. [Google Scholar]
Pelletier, J.-F.; Maldague, X. Shape from heating: A two-dimensional approach for shape extraction in infrared images. OptEn 1997, 36, 370–375. [Google Scholar] [CrossRef]
Eren, G.; Aubreton, O.; Meriaudeau, F.; Secades, L.A.S.; Fofi, D.; Naskali, A.T.; Ercil, A. Scanning from heating: 3D shape estimation of transparent objects from local surface heating. Opt. Express 2009, 17, 11457–11468. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Katz, O.; Small, E.; Bromberg, Y.; Silberberg, Y. Focusing and compression of ultrashort pulses through scattering media. Nat. Photonics 2011, 5, 372. [Google Scholar] [CrossRef]
Edgar, M.P.; Gibson, G.M.; Bowman, R.W.; Sun, B.; Radwell, N.; Mitchell, K.J.; Padgett, M.J. Simultaneous real-time visible and infrared video with single-pixel detectors. Sci. Rep. 2015, 5, 10669. [Google Scholar] [CrossRef]
Gibson, G.M.; Sun, B.; Edgar, M.P.; Phillips, D.B.; Hempler, N.; Maker, G.T.; Padgett, M.J. Real-time imaging of methane gas leaks using a single-pixel camera. Opt. Express 2017, 25, 2998–3005. [Google Scholar] [CrossRef]
Huynh, N.; Huynh, N.; Zhang, E.; Betcke, M.; Arridge, S.; Beard, P.; Cox, B. Single-pixel optical camera for video rate ultrasonic imaging. Optica 2016, 3, 26–29. [Google Scholar] [CrossRef]
Hansen, M.F.; Atkinson, G.A.; Smith, L.N.; Smith, M.L. 3D face reconstructions from photometric stereo using near infrared and visible light. Comput. Vis. Image Underst. 2010, 114, 942–951. [Google Scholar] [CrossRef]
Sun, M.-J.; Edgar, M.P.; Gibson, G.M.; Sun, B.; Radwell, N.; Lamb, R.; Padgett, M.J. Single-pixel three-dimensional imaging with time-based depth resolution. Nat. Commun. 2016, 7, 1–6. [Google Scholar] [CrossRef] [PubMed]
Sun, M.-J.; Zhang, J.-M. Single-pixel imaging and its application in three-dimensional reconstruction: A brief review. Sensors 2019, 19, 732. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Shrekenhamer, D.; Watts, C.M.; Padilla, W.J. Terahertz single pixel imaging with an optically controlled dynamic spatial light modulator. Opt. Express 2013, 21, 12507–12518. [Google Scholar] [CrossRef]
Guerboukha, H.; Nallappan, K.; Skorobogatiy, M. Toward real-time terahertz imaging. Adv. Opt. Photonics 2018, 10, 843–938. [Google Scholar] [CrossRef]
Greenberg, J.; Krishnamurthy, K.; Brady, D.J. Compressive single-pixel snapshot x-ray diffraction imaging. Opt. Lett. 2014, 39, 111–114. [Google Scholar] [CrossRef] [PubMed]
Erkmen, B.I. Computational ghost imaging for remote sensing. J. Opt. Soc. Am. A 2012, 29, 782–789. [Google Scholar] [CrossRef] [PubMed]
Wu, J.; Xie, Z.; Liu, Z.; Liu, W.; Zhang, Y.; Liu, S. Multiple-image encryption based on computational ghost imaging. Opt. Commun. 2016, 359, 38–43. [Google Scholar] [CrossRef]
Huang, G.; Jiang, H.; Matthews, K.; Wilford, P. Lensless imaging by compressive sensing. In Proceedings of the 2013 IEEE International Conference on Image Processing, Melbourne, VIC, Australia, 15–18 September 2013; IEEE: Piscataway Township, NJ, USA, 2014. [Google Scholar]
Li, S.; Zhang, Z.; Ma, X.; Zhong, J. Shadow-free single-pixel imaging. Opt. Commun. 2017, 403, 257–261. [Google Scholar] [CrossRef]
Bian, L.; Suo, J.; Situ, G.; Li, Z.; Fan, J.; Chen, F.; Dai, Q. Multispectral imaging using a single bucket detector. Sci. Rep. 2016, 6, 24752. [Google Scholar] [CrossRef] [Green Version]
Aspden, R.S.; Gemmell, N.R.; Morris, P.A.; Tasca, D.S.; Mertens, L.; Tanner, M.G.; Kirkwood, R.A.; Ruggeri, A.; Tosi, A.; Boyd, R.W.; et al. Photon-sparse microscopy: Visible light imaging using infrared illumination. Optica 2015, 2, 1049–1052. [Google Scholar] [CrossRef] [Green Version]
Tajahuerce, E.; Durán, V.; Clemente, P.; Irles, E.; Soldevila, F.; Andrés, P.; Lancis, J. Image transmission through dynamic scattering media by single-pixel photodetection. Opt. Express 2014, 22, 16945–16955. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Duarte, M.; Davenport, M.A.; Takbar, D.; Laska, J.N.; Sun, T.; Kelly, K.F.; Baraniuk, R.G. Single-pixel imaging via compressive sampling. IEEE Signal Process. Mag. 2008, 25, 83–91. [Google Scholar] [CrossRef] [Green Version]
Sun, M.-J.; Xu, Z.-H.; Wu, L.-A. Collective noise model for focal plane modulated single-pixel imaging. Opt. Lasers Eng. 2018, 100, 18–22. [Google Scholar] [CrossRef]
Zhang, Z.; Liu, S.; Peng, J.; Yao, M.; Zheng, G.; Zhong, J. Simultaneous spatial, spectral, and 3D compressive imaging via efficient Fourier single-pixel measurements. Optica 2018, 5, 315–319. [Google Scholar] [CrossRef]
Magalhães, F.; Araújo, F.M.; Correia, M.; Abolbashari, M.; Farahi, F. Active illumination single-pixel camera based on compressive sensing. Appl. Opt. 2011, 50, 405–414. [Google Scholar] [CrossRef] [Green Version]
Bertolotti, J.; Van Putten, E.G.; Blum, C.; Lagendijk, A.; Vos, W.L.; Mosk, A.P. Non-invasive imaging through opaque scattering layers. Nature 2012, 491, 232–234. [Google Scholar] [CrossRef]
Durán, V.; Soldevila, F.; Irles, E.; Clemente, P.; Tajahuerce, E.; Andrés, P.; Lancis, J. Compressive imaging in scattering media. Opt. Express 2015, 23, 14424–14433. [Google Scholar] [CrossRef]
Berrocal, E.; Pettersson, S.-G.; Kristensson, E. High-contrast imaging through scattering media using structured illumination and Fourier filtering. Opt. Lett. 2016, 41, 5612–5615. [Google Scholar] [CrossRef]
Winters, D.G.; Bartels, R.A. Two-dimensional single-pixel imaging by cascaded orthogonal line spatial modulation. Opt. Lett. 2015, 40, 2774–2777. [Google Scholar] [CrossRef]
Sun, B.; Edgar, M.P.; Bowman, R.W.; Vittert, L.E.; Welsh, S.; Bowman, A.; Padgett, M.J. 3D Computational Imaging with Single-Pixel Detectors. Science 2013, 340, 844–847. [Google Scholar] [CrossRef] [Green Version]
Yu, W.-K.; Yao, X.R.; Liu, X.F.; Li, L.Z.; Zhai, G.J. Three-dimensional single-pixel compressive reflectivity imaging based on complementary modulation. Appl. Opt. 2015, 54, 363–367. [Google Scholar] [CrossRef]
Kirmani, A.; Venkatraman, D.; Shin, D.; Colaço, A.; Wong, F.N.C.; Shapiro, J.H.; Goyal, V.K. First-Photon Imaging. Science 2013, 343, 58–61. [Google Scholar] [CrossRef] [PubMed]
Howland, G.A.; Dixon, P.B.; Howell, J.C. Photon-counting compressive sensing laser radar for 3D imaging. Appl. Opt. 2011, 50, 5917–5920. [Google Scholar] [CrossRef] [PubMed] [Green Version]
McCarthy, A.; Krichel, N.J.; Gemmell, N.R.; Ren, X.; Tanner, M.G.; Dorenbos, S.N.; Zwiller, V.; Hadfield, R.H.; Buller, G. Kilometer-range, high resolution depth imaging via 1560 nm wavelength single-photon detection. Opt. Express 2013, 21, 8904–8915. [Google Scholar] [CrossRef] [Green Version]
Howland, G.A.; Lum, D.J.; Ware, M.R.; Howell, J.C. Photon counting compressive depth mapping. Opt. Express 2013, 21, 23822–23837. [Google Scholar] [CrossRef] [Green Version]
Zhang, Y.; Edgar, M.P.; Sun, B.; Radwell, N.; Gibson, G.M.; Padgett, M.J. 3D single-pixel video. J. Opt. 2016, 18, 35203. [Google Scholar] [CrossRef]
Soldevila, F.; Salvador-Balaguer, E.; Clemente, P.; Tajahuerce, E.; Lancis, J. High-resolution adaptive imaging with a single photodiode. Sci. Rep. 2015, 5, 1–9. [Google Scholar] [CrossRef]
Gribben, J.; Boate, A.R.; Boukerche, A. Emerging Digital Micromirror Device Based Systems and Applications IX, Calibration for 3D Imaging with a Single-Pixel Camera; International Society for Optics and Photonics: Bellingham, WA, USA, 2017. [Google Scholar]
Zhang, Z. Flexible Camera Calibration by Viewing a Plane from Unknown Orientations. In Proceedings of the Seventh IEEE International Conference on Computer Vision, Kerkyra, Greece, 20–27 September 1999; IEEE: Piscataway Township, NJ, USA, 2002. [Google Scholar]
Edgar, M.P.; Gibson, G.M.; Padgett, M.J. Principles and prospects for single-pixel imaging. Nat. Photon 2018, 13, 13–20. [Google Scholar] [CrossRef]
Sun, M.-J.; Edgar, M.P.; Phillips, D.B.; Gibson, G.M.; Padgett, M.J. Improving the signal-to-noise ratio of single-pixel imaging using digital microscanning. Opt. Express 2016, 24, 10476–10485. [Google Scholar] [CrossRef] [Green Version]
Candès, E.J.; Romberg, J.K.; Tao, T. Stable signal recovery from incomplete and inaccurate measurements. Commun. Pure Appl. Math. 2006, 59, 1207–1223. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Schematic of 3D transparent object detection and disparity map acquisition system. The red dashed line indicates the laser light beam from the source to detectors.

Figure 2. Experimental setup implemented in our lab environment.

Figure 3. Flowchart of transparent object detection process.

Figure 4. Two-dimensional image reconstruction results for the passive single-pixel imaging method. (a) The original object for reconstruction: “G” has a thickness of 10 mm, “bulb” has a size of

20 \times 26

mm and “Transparent-circle” has a thickness of 5 mm. (b) The reconstructed 2D image from left and (c) right detectors.

Figure 4. Two-dimensional image reconstruction results for the passive single-pixel imaging method. (a) The original object for reconstruction: “G” has a thickness of 10 mm, “bulb” has a size of

20 \times 26

mm and “Transparent-circle” has a thickness of 5 mm. (b) The reconstructed 2D image from left and (c) right detectors.

Figure 5. Disparity map acquisition of 3D transparent objects based on single-pixel imaging. (a) The depth map of the object “G” is displayed with a disparity range. (b) The depth map for the complex object is illustrated with a disparity range. (c) The depth map for the object “Transparent-circle”.

Figure 6. The error map computation result for the object (a) “G” (b) “bulb” and (c) “Transparent-circle”.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mathai, A.; Guo, N.; Liu, D.; Wang, X. 3D Transparent Object Detection and Reconstruction Based on Passive Mode Single-Pixel Imaging. Sensors 2020, 20, 4211. https://0-doi-org.brum.beds.ac.uk/10.3390/s20154211

AMA Style

Mathai A, Guo N, Liu D, Wang X. 3D Transparent Object Detection and Reconstruction Based on Passive Mode Single-Pixel Imaging. Sensors. 2020; 20(15):4211. https://0-doi-org.brum.beds.ac.uk/10.3390/s20154211

Chicago/Turabian Style

Mathai, Anumol, Ningqun Guo, Dong Liu, and Xin Wang. 2020. "3D Transparent Object Detection and Reconstruction Based on Passive Mode Single-Pixel Imaging" Sensors 20, no. 15: 4211. https://0-doi-org.brum.beds.ac.uk/10.3390/s20154211

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

3D Transparent Object Detection and Reconstruction Based on Passive Mode Single-Pixel Imaging

Abstract

1. Introduction

2. Experimental Setup

3. 3D Transparent Object Reconstruction

3.1. 2D Image Reconstruction

3.2. Disparity Map Using Normalized Cross-Correlation (NCC)

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI