Next Article in Journal
Design of Low-Noise CMOS Image Sensor Using a Hybrid-Correlated Multiple Sampling Technique
Next Article in Special Issue
Optimal Configuration of Multi-Task Learning for Autonomous Driving
Previous Article in Journal
A New Method to Evaluate Pressure Distribution Using a 3D-Printed C2-C3 Cervical Spine Model with an Embedded Sensor Array
Previous Article in Special Issue
A Modulated Approach for Improving MFSK RADARS to Resolve Mutual Interference on Autonomous Vehicles (AVs)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improved Frequency Sweep Keying CDMA Using Faster R-CNN for Extended Ultrasonic Crosstalk Reduction

1
School of Electric and Electronic Engineering, Pusan National University, Busan 46241, Republic of Korea
2
School of Electronic Engineering, Pusan National University, Busan 46241, Republic of Korea
*
Author to whom correspondence should be addressed.
Submission received: 1 November 2023 / Revised: 28 November 2023 / Accepted: 30 November 2023 / Published: 1 December 2023

Abstract

:
Ultrasonic sensors are inexpensive and provide highly accurate measurements, even with simple hardware configurations, facilitating their use in various fields. When multiple ultrasonic sensors exist in the measurement space, crosstalk occurs due to other nodes, which leads to incorrect measurements. Crosstalk includes not only receiving homogeneous signals from other nodes, but also overlapping by other signals and interference by heterogeneous signals. This paper proposes using frequency sweep keying modulation to provide robustness against overlap and a faster region-based convolutional neural network (R-CNN) demodulator to reduce the interference caused by heterogeneous signals. The demodulator works by training Faster R-CNN with the spectrograms of various received signals and classifying the received signals using Faster R-CNN. Experiments implementing an ultrasonic crosstalk environment showed that, compared to on–off keying (OOK), phase-shift keying (PSK), and frequency-shift keying (FSK), the proposed method can implement CDMA even with shorter codes and is robust against overlap. Compared to correlation-based frequency sweep keying, the time-of-flight error was reduced by approximately 75%. While the existing demodulators did not consider heterogeneous signals, the proposed method ignored approximately 99% of the OOK and PSK signals and approximately 79% of the FSK signals. The proposed method performed better than the existing methods and is expected to be used in various applications.

1. Introduction

Ultrasonic sensors contain transducers with various frequencies, from 40 kHz transducers commonly used for distance measurement in air to tens of MHz transducers used for non-destructive testing. The measurement resolution ranges from a few centimeters to tens of nanometers, depending on the ultrasonic wavelength. Ultrasonic sensors have been used in various applications, such as distance measurement [1,2,3], positioning [4,5,6], non-destructive testing [7,8], and human physical inspection [9,10,11], because of their diverse measurement resolution. In particular, ultrasonic sensors see widespread use to obtain distance measurements in air because they are less expensive than other sensors and can measure with high accuracy, even with simple hardware configurations. For example, they can be attached to a car and used to implement intelligent safety systems, such as autonomous driving or front and rear collision prevention [12,13,14]. Recently, crosstalk between ultrasonic signals has become a problem due to the increasing number of cars equipped with intelligent safety systems implemented via ultrasound [15,16]. When obtaining measurements in an environment in which multiple ultrasonic sensors exist, ultrasonic signals may not be received, or ultrasonic signals from other sensors may be received due to crosstalk among the sensors. In this case, the measurement error of the ultrasonic sensor may become very large, or the system may fail to operate normally [17].
Several multiple-access methods have been proposed to ensure orthogonality between each signal in an environment with crosstalk among ultrasonic signals. One such method, which is known as time division multiple access (TDMA), ensures orthogonality by having each node measure the time of flight (TOF) at different times [18,19,20]. When using this method, even after the TOF measurement is completed, the ultrasound remains in the medium, so the subsequent TOF measurement may only be conducted once the ultrasound in the medium has disappeared entirely. This significantly increases the time taken to obtain the measurement. The second method, frequency division multiple access (FDMA), ensures orthogonality by having each node perform TOF measurements with different frequencies [21,22]. FDMA may be implemented either by using a wideband ultrasound transducer or by using several narrowband ultrasound transducers. Wideband ultrasound transducers are more difficult to obtain and more expensive than general narrowband ultrasound transducers. When using multiple narrowband ultrasonic transducers, the number of ultrasonic transducers must increase with the number of modes. The third method, code division multiple access (CDMA), ensures orthogonality by assigning different codes to each node [23,24,25,26,27]. It is a commonly used method in ultrasound and is also called coded ultrasound or multi-code ultrasound. CDMA does not increase hardware complexity and has the advantage of reducing the TOF error due to the spread spectrum of pulses during decoding.
In CDMA, modulation/demodulation methods include on–off keying (OOK) [28], phase shift keying (PSK) [29], frequency shift keying (FSK) [30], and frequency sweep keying [31]. Several papers have combined these methods with ultrasound systems to solve the multiple access problem. The modulation method and demodulation algorithm affect the performance of the system because the piezoelectric element of the ultrasonic transducer has resonance characteristics. The authors of [5] proposed an indoor ultrasonic positioning system for multiple robots. The keying method used was OOK. Envelope detection and binarization considering ultrasonic attenuation were performed to improve the demodulator’s performance. The authors of [32] used gold code modulation and proposed using a demodulator with a circular convolution structure. These methods offer robust decoding against environmental noise, but lead to the deterioration of the orthogonality of the code due to demodulation errors occurring when signals from different nodes overlap. The authors of [33] used BPSK to measure indoor distance. A cross-correlation-type demodulator was applied to improve the decoding performance. Although it performed better than ASK in multiple access situations, it increased the number of demodulation errors due to the near–far effect if the difference in the signal strength between nodes was large. In [34], the FSK method was used for indoor localization. The frequency hopping spread spectrum demodulation method was used and incorporated a picoscope, FIR filter, and synchronization process. If the bandwidth of the ultrasonic transducer is sufficient, it has the advantage of small demodulation error and good orthogonality. However, when using a general narrow-band transducer, the transmission and reception signal strength are weak due to the band limiting, making application difficult. The modulation method described in [31], which was previously studied by our research team, sweeps the frequency in the direction of the increase or decrease around the center frequency. Demodulation is performed by correlating the received signal and the reference bit mask in the spectrogram. Frequency sweep keying is robust to signal overlap compared to OOK, PSK, and FSK and can be implemented with a relatively short code length. However, this method has some limitations. First, when a heterogeneous signal is received, it is impossible to distinguish the heterogeneous signal using the existing demodulation method, and a demodulation error can occur. Second, failure to properly set the threshold coefficient may lead to performance degradation. Third, because CDMA is performed on the spectrogram, the time domain resolution is degraded, resulting in low TOF precision.
Ultrasonic signal crosstalk can be caused not only by the homogeneous signals transmitted, but also by heterogeneous signals from unidentified nodes. The previous studies were designed without considering heterogeneous signal input. The same applies to the previous frequency sweep keying method. This paper aimed to ensure that CDMA operates normally by being robust to crosstalk, including overlap, for homogeneous signals and ignoring heterogeneous signals. Therefore, in this paper, we propose a method that uses frequency sweep keying modulation, which is more robust to signal overlap than OOK, PSK, and FSK and uses a faster region-based convolutional-neural-network (R-CNN) [35] demodulation. Frequency sweep keying modulation was proposed in the previous paper and used in this paper, briefly described in relative works, and an improved demodulator is proposed in this study. Therefore, this paper describes a novel demodulator based on Faster R-CNN, and the process of demodulation is described in Section 3. The advantage of the proposed method is that it can ignore heterogeneous signals by classifying the keying type and code. In addition, to solve the problem of low TOF measurement resolution, which is a disadvantage of the demodulation method using a time–frequency spectrogram, the TOF precision is improved by performing signal segmentation in the time domain. In this work, a performance evaluation was conducted by implementing an environment in which ultrasonic crosstalk occurs. Factors such as the SNR and degree of overlap were adjusted, and comparisons were made with related works.
The rest of the paper is organized as follows: Section 2 explains the existing keying methods used in ultrasonic CDMA, including the frequency sweep keying used in this paper. Section 3 describes the demodulation method based on Faster R-CNN and the associated decoding process. Section 4 lists the experimental procedures with ultrasonic sensors and the results of the experiments, while the conclusions and future research directions can be found in Section 5.

2. Related Works

2.1. Existing Keying Code Division Multiple Access

CDMA is a method that provides orthogonality between signals by assigning different identity (ID) codes to each node to solve the signal interference problem. Even in single-node ultrasound systems, the coded ultrasonic method has been used to improve the SNR of the received signal, improving the measurement distance range and TOF measurement accuracy [36,37]. It has the same advantages in multiple access and is easy to implement in ultrasonic systems. OOK [5,32], PSK [33,38], and FSK [39,40] are used to modulate and demodulate digital codes into analog signals.
OOK is the simplest way to implement CDMA. When the code is ‘1’, the transducer transmits N p ultrasonic pulses; when the code is ‘0’, it does not transmit ultrasound for the same amount of time, t p = N p / f c . Here, N p is the number of pulses transmitted per bit, f c is the ultrasonic center frequency, and t p is the transmission time per bit. While this method is easy to implement, the first bit of the ID code must be fixed to code ‘1’ for the TOF measurement. That is, one more bit is always needed in the code length. The demodulator sets a threshold voltage to separate the signal from noise and estimates code ‘1’ when the received signal is greater than the threshold value and code ‘0’ when it is less than the threshold value. Therefore, when the signal overlaps, it is assumed to be code ‘1’, which excludes the overlap of codes ‘0’ and ‘0’. This reduces the orthogonality of the two codes. Hence, long ID codes must be used for normal decoding operation.
PSK is a method of transmitting N p ultrasonic transmission pulses with different phases depending on the ID code. PSK includes binary phase shift keying, which divides a 1 bit code into 180° phase intervals, and quadrature phase shift keying, which divides a 2 bit code into 90° phase intervals. Meanwhile, differential phase shift keying modulates using the phase difference between pulses. PSK is widely used in narrowband ultrasonic transducers because of its good energy efficiency. However, when signals overlap, the phase of the overlapped signal changes depending on the amplitude and phase of the individual signal, which causes demodulation errors to affect the code. Incorrectly estimated codes cannot be predicted, and their correlation with the ID code is low. Hence, decoding is not performed properly, even if the ID code length is long.
FSK is a method that transmits N p ultrasonic transmission pulses at different frequencies based on the center frequency f c according to the ID code. FSK has excellent energy efficiency and modulation and demodulation performance when using wideband ultrasonic transducers. Furthermore, the interference between codes ‘1’ and ‘0’ does not occur when signals overlap. On the other hand, when using a general narrowband ultrasonic transducer, FSK transmits via frequencies that deviate from the center frequency f c , resulting in poor energy efficiency, a lower SNR, and a reduced sensing range.

2.2. Frequency Sweep Keying Code Division Multiple Access

Frequency sweep keying [31] modulates by sweeping the frequency in an increasing or decreasing direction around the center frequency using a general high-Q ultrasonic transducer. Figure 1 shows codes ‘1’ and ‘0’ on a spectrogram modulated using frequency sweep keying. The signal s n , k t corresponding to the n -th bit of the transmission signal is as follows.
s n , k t = sin 2 π f k t t n t u t n t u t n + 1 t
where k is the bit code information with a value of + 1 or 1 . k is + 1 when the bit code is ‘1’ and 1 when the bit code is ‘0’. Δ t is the time taken to perform the frequency sweep. f k t represents the sweep frequency according to time and is written as follows:
f k t = f t k t   m o d   t t 2 + f c
where f c is the sweep center frequency and is set equal to the center frequency of the ultrasonic transducer. Frequency sweep is performed in the range of the upper limit f c + f 2 and lower limit f c f 2 . The transmitted ultrasonic signal s t x ( t ) using an ideal transducer is expressed as the sum of signals for each bit.
s t x t = n = 1 L c s n , k t
The received signal is converted to a spectrogram and demodulated by calculating the cross-correlation using the bit mask and thresholding. Demodulated signals are estimated to have code ‘1’ or code ‘0’, respectively. Each bit mask converts the expected ultrasonic signal corresponding to one bit when the code is ‘1’ or ‘0’ into a time–frequency spectrogram. The threshold value is based on the cross-correlation value between the received signal and the X mask because the result of the correlation operation is proportional to the power of the signal received. The threshold value is given as the product of the cross-correlation value with the X mask and the threshold coefficient set experimentally.
Signals with code ‘0’ or code ‘1’ are estimated through demodulation. Finally, decoding is performed by calculating the correlation between the demodulated signal and the signal that up-sampled the corresponding ID code. This method is more robust to signal overlap than OOK, PSK, and FSK and can be implemented with a relatively short code length.

3. Frequency Sweep Keying CDMA Using Faster Region-Based CNN

Figure 2 shows the proposed Faster-R-CNN frequency sweep keying demodulation and CDMA decoding process. First, the received signal is converted to a spectrogram. The converted signal is segmented to improve the classification accuracy of Faster R-CNN, and the segmentation area is selected by detecting the signal in the time domain. Segmented signals for each bit are classified according to the code and keying type using Faster R-CNN. Finally, CDMA decoding is performed for the purposes of ID recognition and TOF calculation.

3.1. Spectrogram and Segmentation

The proposed demodulator classifies the signal code using Faster R-CNN, merges the results, and outputs the demodulated signal. The segmentation process is implemented to improve the classification performance. After the segmentation range in the time domain has been selected, the signal on the spectrogram is segmented. First, the received ultrasonic signal s r x [ n ] is converted to a time–frequency spectrogram [41] signal to consider both the time and frequency domains in the classification process. The short-time-Fourier-transform-based spectrogram signal S r x n , f is as follows:
S r x n , f = m = s r x m w m n e j 2 π f m 2
where w n is the window function, and the following Kaiser window is used.
w n = L · w 0 L N n N 2 = I 0 β 1 2 n N 1 2 I 0 β ,   0 n N
where I 0 is the zeroth-order modified Bessel function, L is the window duration, N is the window length, and β is a parameter that represents the characteristics of the Kaiser window.
Next, the start and end points of the received signal s r x [ n ] are found for segmentation. Points greater than 6 σ N of noise are considered to be the locations at which the signal exists, and the minimum and maximum values of the index of the signal location are found for the start and end points of the signal. Here, σ N is the standard deviation of background noise. The start index n s and end index n e of the signal are as follows:
n s = min ( n | s r x [ n ] 6 σ N )
n e = max ( n | s r x [ n ] 6 σ N )
Two signals, front signal B k f and rear signal B k r , are segmented by bit from the start and end points, respectively.
B k f = S r x n , f | n s + k 1 w c < n < n s + k w c
B k r = S r x n , f | n e L c k + 1 w c < n < n e L c k w c
where k is the code bit index and the segmentation interval w c is a value that considers the analog–digital converter (ADC) sampling rate f s and Δ t , as well as the time taken to perform the frequency sweep. It is calculated as w c = t f s . Figure 3 shows the segmentation results in the time domain and spectrogram.

3.2. Code Classification Using Faster Region-Based CNN

Faster R-CNN [33,42] consists of a convolutional neural network (CNN) backbone to obtain the features of segmented spectrogram signals, a region proposal network (RPN) to propose a time–frequency domain region, and a classification network to classify the code and heterogeneous signal. The CNN backbone receives the segmented spectrogram of the received ultrasonic signal as the input and extracts the spatial features through convolution. When ultrasonic waves propagate through air, the power of the received signal is attenuated by distance and radiation. Therefore, the signal attenuation is corrected using a time gain amplifier, and the data are normalized to achieve a zero mean before input. The horizontal and vertical axes of the image are the time and frequency domains, respectively. The input image is selected with a size of 64 × 64, covering the time range t and frequency range f of a one-bit signal. The size of the input image is appropriately adjusted according to the sampling rate of the ADC, the window size N , and the overlap interval when converting to a spectrogram. The input volume was set to 64 × 64 × 3 because the signal power of the spectrogram is expressed in RGB colors. A pre-trained ResNet-50 is used as the CNN backbone. A residual neural network (ResNet) [43] is a CNN that solves the vanishing gradient problem using residual blocks. The ResNet architecture comes in various depths; ResNet-50 consists of 50 layers.
The RPN receives the features of the CNN backbone and proposes the time–frequency region of the received ultrasonic signal and the objectness score of the signal in that region. The RPN reduces the dimensionality of the input features and proposes regions through two layers for classification and bounding box regression, respectively. A two-dimensional window is stridden, and bounding boxes with three scales and three aspect ratios (1:1, 1:2, and 2:1) are used for every sliding position to find the bounding box in the features. For each bounding box, the regression layer outputs the position of the bounding box as the center x and y coordinates’ width and height, while the classification layer outputs the objectness score of the bounding box. The RPN performs region proposals for some bounding boxes with high objectness scores.
The classification network receives the features of the CNN backbone and the region proposal of the RPN and classifies the signal within the bounding box. Each bounding box is resized to a fixed-size feature vector using the region of interest (ROI) pooling layer, as the size of the features must be the same to perform the classification. For each ROI, the regression layer outputs better-fit bounding box information, and the classification layer outputs the code of the received signal and its probability or its probability of being a heterogeneous signal.

3.3. Training Dataset

The training dataset was acquired using the ultrasonic transmission and reception environment. The data were collected and classified into four cases. In the first case, only one bit is received without overlap. There are 2 1 classes with code ‘0’ or code ‘1’. In the second case, two bits are received with overlap. There are 2 2 cases of permutation with the repetition of codes ‘0’ and ‘1’. There are 2 2 × 5 classes with an overlap ratio of 0.1, 0.3, 0.5, 0.7, and 0.9 for each case. Regarding the overlap, there are in total five classes, but the data were collected with a uniform random overlap ratio in the range of ±0.1 to increase the diversity of constructive–destructive interference according to the phase difference between the two signals. In the third case, three bits are received with overlap. There are 2 3 cases of permutation with the repetition of code ‘0’ and code ‘1’. As in the second case, there are five overlap ratios and 2 3 × 5 classes. The last case is one in which a heterogeneous signal is received. There are two classes: OOK signal and FSK signal. PSK has a similar spectrogram to OOK and was not classified separately.
The dataset has 100 elements for each class, and the signal power ratio and background noise were adjusted to prevent overfitting and increase the amount of data. The signal power ratio was adjusted by fine-tuning the distance of the two transmitters in five steps, from 0.2 to 1, in increments of 0.2. White Gaussian background noise was added to adjust the SNR in five steps, from 2 to 10 in two increments. Overall, the data have 2500 elements for each class.

3.4. Grouping and Merging

The classes vary depending on the overlap ratio or overlapping signals, but if segmentation is performed properly, only the code information of the bit located at the center needs to be obtained. Therefore, each class can be grouped into the code ‘1’ set G 1 or the code ‘0’ set G 0 according to the code information of the bit located at the center. In addition, certain classes are grouped into a set G b of both detected cases in which code ‘0’ and code ‘1’ signals are received simultaneously, and cases of heterogeneous signals and cases that are not detected are grouped into a non-detected set G n . The signal C r x , 1 s n demodulated to code ‘1’ and the signal C r x , 0 s n demodulated to code ‘0’ for each group are as follows.
C r x , 1 s n = 1 , G 1 , G b F B n s 0 , O t h e r s C r x , 0 s n = 1 , G 0 , G b F B n s 0 , O t h e r s
where the superscript s indicates whether the signals separated during the segmentation process are front or rear signals. Function F x is a Faster R-CNN classifier that receives segmented spectrograms and outputs classified classes. If the n -th bit of the received signal is classified into a class included in the set G 1 corresponding to code ‘1’ and the both-detect case set G b , the n -th bit of C r x , 1 is 1; otherwise, it is 0. If the n -th bit of the received signal is classified into a class included in the set G 0 corresponding to code ‘0’ and the both detected case set G b , the n -th bit of C r x , 0 is 1; otherwise, it is 0. Therefore, for the both detected set G b , C r x , 1 and C r x , 0 are both 1 in the corresponding bit, and for the non-detected set G n , C r x , 1 and C r x , 0 are both 0 in the corresponding bit.

3.5. Code Division Multiple Access Decoding

The demodulated signal C r x , 1 indicates that code ‘1’ is received, and C r x , 0 indicates that code ‘0’ is received. Therefore, C r x , 1 is compared with the ID code C I D , and C r x , 0 is compared with the inverted ID code, 1 C I D . Since the demodulated signal and the ID code have the same dimension, the similarity between the demodulated signal and the ID code can be obtained by performing the product sum. The similarity ρ c 1 for code ‘1’, the similarity ρ c 0 for code ‘0’, and the similarity ρ c between the ID code and the demodulated signal, which can be obtained from the sum of the two, are expressed in the equations below.
ρ c 1 = n = 1 L I D C I D n C r x , 1 n ρ c 0 = n = 1 L I D 1 C I D n C r x , 0 n ρ c = ρ c 1 + ρ c 0
If the similarity ρ c between the ID code and the demodulated signal is equal to the length L c of the ID code, the corresponding code has been detected, and the TOF is calculated. The TOF is calculated when the front signal t f and the rear signal t r are detected as follows, respectively.
t f = n s / f s
t r = n e L c w c / f s

4. Experimental Results

4.1. Experimental Environment

An ultrasound transmission and reception system was configured in a laboratory environment to verify the frequency sweep keying method using the Faster R-CNN proposed in this paper. As shown in Figure 4, an ultrasonic signal crosstalk environment was configured by connecting a single ultrasonic receiver and multiple ultrasonic transmitters to the micro controller unit (MCU). Ultrasonic transmitters have an individual time delay for their transmitted signals. Therefore, when combining signals, the constructive–destructive interference according to the phase and the overlap ratio of each transmission signal can be adjusted. The received signal data are collected through the amplifier and the ADC of the MCU. The collected data are transferred to a PC, and the demodulation and decoding processes are performed.
Figure 5a shows the ultrasonic transmitting and receiving equipment used in the experiment. The used ultrasonic sensor was the Hagisonic HG-L40DC with a center frequency of 40 kHz, a bandwidth of 4 kHz, and a beam angle of 65°. In addition, ultrasonic-sound-absorbing material was attached around the ultrasonic sensor to minimize reverberation and multi-path fading other than the line of sight. As shown in Figure 5b, an ultrasonic signal crosstalk environment was configured, with one receiving device and multiple transmitting devices in an indoor environment measuring 10 m in width and 10 m in length. Table 1 lists the parameters used in the experiment.

4.2. Faster Region-Based CNN Classifier Validation

The performance of the trained Faster R-CNN classifier was verified using five-fold cross-validation [44], which involved randomly dividing the data into five equal-sized subsets and using one subset as a validation set and the remaining four subsets as a training set. The process was repeated five times. Each of the five subsets was used once as a validation set. Figure 6 shows the precision of each class as a validation result. Precision is one of the classification performance indicators that is calculated based on the true positives ( T P ) and false positives ( F P ), as follows.
P R E C I S I O N = T P T P + F P
The verification results showed an average precision of approximately 0.9. As a result of analyzing the misclassification cases, most cases included classification errors because of the overlap ratio. The classification error in the overlap ratio did not affect the final demodulation performance because the bit information at the center was used in the subsequent grouping process. This showed that the Faster R-CNN classifier was properly trained.

4.3. Robustness Comparison for Homogeneous Signal with Other Modulation Methods

The performance of the frequency sweep keying method using the Faster R-CNN proposed in this paper was compared with the traditional modulation methods OOK, FSK, and PSK and the correlation-based frequency sweep keying method previously proposed by our research team. For comparison with other modulation methods, OOK used the modulation/demodulation method of [5], FSK used the modulation/demodulation method of [33], and PSK used the modulation/demodulation method of [34]. The root mean square (RMS) and standard deviation of the TOF error were measured in the millisecond scale. Ultrasound was transmitted from two transmitting nodes, and the signal was received by one receiving node. The position of the ultrasonic node was set so that the two transmitted signals were received with overlap. The manipulation variables of the experiment were the CDMA code length and overlap ratio. The CDMA code length was 2, 4, 8, 16, 32, 64, and 128; the overlap ratio was 0 to 1 in 0.1 increments, and each case was tested 100 times. The experimental results are expressed in two forms: TOF error according to the code length and detection rates according to the overlap ratio. Table 2 lists the TOF error results of each modulation method according to the code length. FSWK-corr represents the previous correlation-based frequency sweep keying method. The code detection rate was low for OOK, FSK, and PSK, with a short code length of eight or less, resulting in a substantial TOF error. Meanwhile, for PSK, even when the code length was long, the code detection rate was low and the TOF error was large. When the frequency sweep keying method was used, the TOF error was relatively small due to the high code detection rate, even if the code length was short. Comparing the proposed method with FSWK-corr, the proposed method estimated the TOF in the time domain, so the TOF error was small. However, the TOF errors of the proposed method and OOK were similar when the code length was long.
The performance of the demodulator and the CDMA decoder was compared and evaluated against other methods for the purpose of verification. If errors occur during the demodulation process, the orthogonality factor deteriorates and the errors propagate to the detection error rate, which is the result of CDMA decoding. Therefore, in this paper, the demodulated code error rate (DCER) is defined as a quantitative measure to evaluate demodulator performance compared to other methods, as follows.
D C E R = C I D C r x L I D
DCER is a normalized value obtained by dividing the total sum of exclusive disjunctions of ID code C I D and demodulated code C r x by L I D . DCER has a value of zero when no demodulation error occurs and a value of one when errors occur in all bits.
Figure 7 presents an evaluation of the DCER of each method according to the overlap ratio of 0.2, 0.6, and 1.0. The error bar represents the standard deviation of the error rate. The proposed method had a small error rate of less than 0.05, regardless of the overlap ratio. It also led to error rates approximately 25% lower than that of the same modulation method, FSWK-corr, and had a smaller standard deviation range, making it more stable. In the case of OOK and FSK, the error rates reached approximately 0.1, and FSK had a smaller standard deviation range. In the case of PSK, errors in phase information were caused by the overlap of signals with different strengths, and the demodulation error rates were generally high.
Figure 8 shows the results of comparing the detection rates according to the overlap ratio. Methods using frequency sweep keying, which include the proposed method and FSWK-corr, showed detection rates of more than 0.93, regardless of the overlap ratio. For OOK and FSK, however, as the overlap ratio increased, the detection rates decreased from approximately 0.92 to approximately 0.84, and the detection rates were even lower for PSK. Therefore, the proposed method proved robust to overlap and had a small TOF error compared to other methods.

4.4. Robustness Comparison for Heterogeneous Signal

Table 3 compares the robustness of the demodulator to heterogeneous signals. Each keying method exhibited different characteristics associated with the demodulation of heterogeneous signals. The OOK demodulator usually estimated code ‘1’ for all heterogeneous signals. The PSK demodulator estimated the OOK signal as code ‘1’ because their phases were identical, leading to false detections. The PSK demodulator estimated the FSK and frequency sweep keying signals as random codes because of their continuously changing phases, also leading to false detections. The FSK demodulator failed to detect the OOK and PSK signals because they only had center frequency components and caused false detection for the frequency sweep keying signals. The correlation-based frequency sweep keying demodulator falsely detected codes ‘0’ and ‘1’ for the OOK signal with code ‘1’ and the PSK signal, while it failed to detect the OOK signal with code ‘0’. The correlation-based frequency sweep keying demodulator falsely detected code ‘1’ or code ‘0’ for the FSK signal because the frequency rose or fell when the code changed. For the demodulator using Faster R-CNN, which is the proposed method, the OOK and PSK signals used only the center frequency, so they are recognized as horizontal parallel bars in the spectrogram and were classified as heterogeneous signals. False detection occurred in approximately 1% of the OOK and PSK signals. For the FSK signal, when the position at which the code changes was segmented, the frequency rose or fell, resulting in an approximately 21% false detection rate as code ‘0’ or ‘1’.

5. Conclusions

As the number of devices using ultrasonic sensors increases, the crosstalk between ultrasonic nodes becomes more frequent. Crosstalk includes situations in which signals overlap and heterogeneous signals are received. To solve this problem, our research team proposed a frequency sweep keying method based on correlation in the time–frequency spectrogram in a previous study. The suggested correlation-based method was more robust to overlap than the existing method. However, it had a high false detection rate when heterogeneous signals were received. In addition, the time resolution was lower than that of other methods because it performs CDMA on the spectrogram. This study proposed a method robust to overlap using the frequency sweep keying method and demodulated the signal using Faster R-CNN to increase the detection rate of proper signals by ignoring when heterogeneous signals are received. An experiment implementing an environment in which ultrasonic crosstalk occurs showed that the detection rate was higher than other traditional modulation methods at a short code length, and the TOF error was reduced by approximately 75% compared to the correlation-based frequency sweep keying method. It showed a high non-detection rate for heterogeneous signal input and exhibited good performance in demodulating only homogeneous signals. Based on the experimental results, the proposed method was robust against overlap and successfully performed CDMA even with a short code length. Therefore, it has a short dead zone and is robust to heterogeneous signals, highlighting its potential in ultrasonic sensors for blind spot detection in automobiles.
On the other hand, several problems have yet to be overcome before this method can be applied successfully. In this study, experiments were conducted in an ideal environment to verify the method’s performance in crosstalk situations. Nevertheless, the proposed method may have limitations when well-known problems of ultrasonic sensing, such as multi-path interference, reverberation artifacts, and reflected signal distortion due to object shape, occur. The performance of the proposed method is greatly affected by the training dataset of Faster R-CNN. If the above problem occurs, the shape of the spectrogram of the received signal may change and misclassification may become more likely. To solve this problem without significantly changing the proposed method, it is possible to train Faster R-CNN more robustly by training on received data in various situations in the training dataset.

Author Contributions

G.-R.P., S.-H.P. and K.-R.B. participated in the study design and data collection, performed the statistical analyses, contributed to the interpretation of the results, wrote the manuscript, and approved the final manuscript as presented. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by BK21FOUR, Creative Human Resource Education and Research Programs for ICT Convergence in the 4th Industrial Revolution.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kelemen, M.; Virgala, I.; Kelemenová, T.; Miková, Ľ.; Frankovský, P.; Lipták, T.; Lörinc, M. Distance measurement via using of ultrasonic sensor. J. Autom. Control 2015, 3, 71–74. [Google Scholar]
  2. Zhmud, V.A.; Kondratiev, N.O.; Kuznetsov, K.A.; Trubin, V.G.; Dimitrov, L.V. Application of ultrasonic sensor for measuring distances in robotics. J. Phys. Conf. Ser. 2018, 1015, 032189. [Google Scholar] [CrossRef]
  3. Yasin, J.N.; Mohamed, S.A.; Haghbayan, M.H.; Heikkonen, J.; Tenhunen, H.; Plosila, J. Low-cost ultrasonic based object detection and collision avoidance method for autonomous robots. Int. J. Inf. Technol. 2021, 13, 97–107. [Google Scholar] [CrossRef]
  4. Laureti, S.; Mercuri, M.; Hutchins, D.A.; Crupi, F.; Ricci, M. Modified FMCW scheme for improved ultrasonic positioning and ranging of unmanned ground vehicles at distances <50 mm. Sensors 2022, 22, 9899. [Google Scholar] [PubMed]
  5. Moon, W.S.; Cho, B.S.; Jang, J.W.; Baek, K.R. A multi-robot positioning system using a multi-code ultrasonic sensor network and a Kalman filter. Int. J. Control Autom. Syst. 2010, 8, 1349–1355. [Google Scholar] [CrossRef]
  6. Lin, Q.; An, Z.; Yang, L. Rebooting ultrasonic positioning systems for ultrasound-incapable smart devices. In Proceedings of the 25th Annual International Conference on Mobile Computing and Networking, Los Cabos, Mexico, 21–25 October 2019; pp. 1–16. [Google Scholar]
  7. Jodhani, J.; Handa, A.; Gautam, A.; Rana, R. Ultrasonic non-destructive evaluation of composites: A review. Mater. Today Proc. 2023, 78, 627–632. [Google Scholar] [CrossRef]
  8. Rehbein, J.; Lorenz, S.J.; Holtmannspötter, J.; Valeske, B. 3D-visualization of ultrasonic ndt data using mixed reality. J. Nondestruct. Eval. 2022, 41, 26. [Google Scholar] [CrossRef]
  9. Hasan, M. Applications of Ultrasonic Testing (UT) for Irregularities Detection in Human Body and Materials: A Literature Review. Int. J. Occup. Hyg. 2021, 13, 91–104. [Google Scholar]
  10. Zeng, W.; Lu, T.; Liu, Z.; Xu, Q.; Peng, H.; Li, C.; Yang, S.; Yao, F. Research on a laser ultrasonic visualization detection method for human skin tumors based on pearson correlation coefficient. Opt. Laser Technol. 2021, 141, 107117. [Google Scholar] [CrossRef]
  11. Bi, D.; Shi, L.; Liu, C.; Li, B.; Li, Y.; Le, L.H.; Luo, J.; Wang, S.; Ta, D. Ultrasonic through-transmission measurements of human musculoskeletal and fat properties. Ultrasound Med. Biol. 2023, 49, 347–355. [Google Scholar] [CrossRef]
  12. Susilo, J.; Febriani, A.; Rahmalisa, U.; Irawan, Y. Car parking distance controller using ultrasonic sensors based on arduino uno. J. Robot. Control 2021, 2, 353–356. [Google Scholar] [CrossRef]
  13. Krishnan, P. Design of collision detection system for smart car using li-fi and ultrasonic sensor. IEEE Trans. Veh. Technol. 2018, 67, 11420–11426. [Google Scholar] [CrossRef]
  14. Tonmoy, A.B.R.; Zinan, M.S.; Sultan, S.; Sarker, A. A comparative study on LIDAR and Ultrasonic Sensor for Obstacle Avoidance Robot Car. In Proceedings of the 2023 International Conference on Advances in Electronics, Communication, Computing and Intelligent Information Systems, Bangalore, India, 19–21 April 2023; pp. 582–587. [Google Scholar]
  15. Diego, C.; Hernández, A.; Jiménez, A.; Alvarez, F.J.; Sanz, R.; Aparicio, J. Ultrasonic array for obstacle detection based on CDMA with Kasami codes. Sensors 2011, 11, 11464–11475. [Google Scholar] [CrossRef] [PubMed]
  16. Li, S.E.; Li, G.; Yu, J.; Liu, C.; Cheng, B.; Wang, J.; Li, K. Kalman filter-based tracking of moving objects using linear ultrasonic sensor array for road vehicles. Mech. Syst. Signal Process. 2018, 98, 173–189. [Google Scholar] [CrossRef]
  17. Shin, S.; Kim, M.H.; Choi, S.B. Ultrasonic distance measurement method with crosstalk rejection at high measurement rate. IEEE Trans. Instrum. Meas. 2018, 68, 972–979. [Google Scholar] [CrossRef]
  18. Medina, C.; Segura, J.C.; de la Torre, Á. A synchronous TDMA ultrasonic TOF measurement system for low-power wireless sensor networks. IEEE Trans. Instrum. Meas. 2012, 62, 599–611. [Google Scholar] [CrossRef]
  19. Cheng, P.; Zhang, F.; Chen, J.; Sun, Y.; Shen, X. A distributed TDMA scheduling algorithm for target tracking in ultrasonic sensor networks. IEEE Trans. Ind. Electron. 2012, 60, 3836–3845. [Google Scholar] [CrossRef]
  20. Haigh, S.; Kulon, J.; Partlow, A.; Rogers, P.; Gibson, C. Improved obstacle mitigation and localization accuracy in narrowband ultrasonic localization systems using Robcul algorithm. IEEE Trans. Instrum. Meas. 2020, 69, 2315–2324. [Google Scholar] [CrossRef]
  21. Chen, X.; Chen, Y.; Cao, S.; Zhang, L.; Zhang, X.; Chen, X. Acoustic indoor localization system integrating TDMA+ FDMA transmission scheme and positioning correction technique. Sensors 2019, 19, 2353. [Google Scholar] [CrossRef]
  22. Khyam, M.O.; Alam, M.J.; Lambert, A.J.; Garratt, M.A.; Pickering, M.R. High-precision OFDM-based multiple ultrasonic transducer positioning using a robust optimization approach. IEEE Sens. J. 2016, 16, 5325–5336. [Google Scholar] [CrossRef]
  23. Stojanovic, M.; Freitag, L. Multichannel detection for wideband underwater acoustic CDMA communications. IEEE J. Ocean. Eng. 2006, 31, 685–695. [Google Scholar] [CrossRef]
  24. Toru, I.; Yasuda, Y.; Sato, S.; Izumi, S.; Kawaguchi, H. Millimeter-precision ultrasonic DSSS positioning technique with geometric triangle constraint. IEEE Sens. J. 2022, 22, 16202–16211. [Google Scholar] [CrossRef]
  25. Pérez-Rubio, M.C.; Hernández, Á.; Gualda-Gómez, D.; Murano, S.; Vicente-Ranera, J.; Ciudad-Fernández, F.; Nieto, R. Simulation Tool and Online Demonstrator for CDMA-Based Ultrasonic Indoor Localization Systems. Sensors 2022, 22, 1038. [Google Scholar] [CrossRef]
  26. Suzuki, A.; Kumakura, K.; Choi, Y.; Iyota, T. Accuracy of distance measurements using signal tracking of spread-spectrum ultrasonic waves with CDMA. In Proceedings of the 2014 International Conference on Indoor Positioning and Indoor Navigation, Busan, Republic of Korea, 27–30 October 2014; pp. 575–581. [Google Scholar]
  27. Oberdorfer, M.; Esslinger, D.; Benz, G.; Sawodny, O.; Tarin, C. Robustness enhancements of time-of-flight measurements in a CDMA ultrasonic channel of an opto-acoustic indoor positioning system using MEMS microphones. In Proceedings of the 2020 IEEE International Ultrasonics Symposium, Las Vegas, NV, USA, 7–11 September 2020; pp. 1–6. [Google Scholar]
  28. Oetting, J.A. comparison of modulation techniques for digital radio. IEEE Trans. Commun. 1979, 27, 1752–1762. [Google Scholar] [CrossRef]
  29. Gardner, F.A. BPSK/QPSK timing-error detector for sampled receivers. IEEE Trans. Commun. 1986, 34, 423–429. [Google Scholar] [CrossRef]
  30. Watson, B. FSK: Signals and demodulation. Watkins–Johns. Co. Tech–Notes 1980, 7, 5. [Google Scholar]
  31. Park, G.R.; Park, S.H.; Baek, K.R. Frequency Sweep Keying CDMA for Reducing Ultrasonic Crosstalk. Sensors 2022, 22, 4462. [Google Scholar] [CrossRef]
  32. Rapp, P.; Sawodny, O.; Taŕn, C. Opto-acoustic distance measurement using spread spectrum techniques and carrier phase measurements. In Proceedings of the 2015 IEEE Conference on Control Applications, Sydney, Australia, 21–23 September 2015; pp. 1461–1466. [Google Scholar]
  33. Esslinger, D.; Rapp, P.; Sawodny, O.; Tarin, C. High precision opto-acoustic BPSK-CDMA distance measurement for object tracking. In Proceedings of the 2018 IEEE International Conference on Systems, Man, and Cybernetics, Miyazaki, Japan, 7–10 October 2018; pp. 2898–2905. [Google Scholar]
  34. Segers, L.; Van Bavegem, D.; De Winne, S.; Braeken, A.; Touhafi, A.; Steenhaut, K. An ultrasonic multiple-access ranging core based on frequency shift keying towards indoor localization. Sensors 2015, 15, 18641–18665. [Google Scholar] [CrossRef] [PubMed]
  35. Ren, S.; He, K.; Girshick, R.; Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. In Proceedings of the Advances in Neural Information Processing Systems, Montreal, Canada, 7–12 December 2015; p. 28. [Google Scholar]
  36. Fan, Z.; Rudlin, J.; Asfis, G.; Meng, H. Convolution of Barker and Golay Codes for Low Voltage Ultrasonic Testing. Technologies 2019, 7, 72. [Google Scholar] [CrossRef]
  37. Ding, Z.X.; Payne, P.A. A new Golay code system for ultrasonic pulse echo measurements. Meas. Sci. Technol. 1990, 1, 158. [Google Scholar] [CrossRef]
  38. Schröder, A.; Henning, B. Signal optimization of PSK modulated gold-sequences for narrow band transducers. In Proceedings of the 2014 IEEE International Ultrasonics Symposium, Chicago, IL, USA, 3–6 September 2014; pp. 552–555. [Google Scholar]
  39. Zhenjing, Y.; Li, H.; Yanan, L. Improvement of measurement range via chaotic binary frequency shift keying excitation sequences for multichannel ultrasonic ranging system. Int. J. Control Autom. 2016, 9, 189–200. [Google Scholar] [CrossRef]
  40. Nakahira, K.; Okuma, S.; Kodama, T.; Furuhashi, T. The use of binary coded frequency shift keyed signals for multiple user sonar ranging. IEEE Int. Conf. Netw. Sens. Control 2004, 2, 1271–1275. [Google Scholar]
  41. Le Roux, J.; Kameoka, H.; Ono, N.; Sagayama, S. Fast signal reconstruction from magnitude STFT spectrogram based on spectrogram consistency. Proc. DAFx 2010, 10, 397–403. [Google Scholar]
  42. Chen, X.; Wang, Z.; Hua, Q.; Shang, W.L.; Luo, Q.; Yu, K. AI-empowered speed extraction via port-like videos for vehicular trajectory analysis. IEEE Trans. Intell. Transp. Syst. 2022, 24, 4541–4552. [Google Scholar] [CrossRef]
  43. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  44. Wong, T.T.; Yeh, P.Y. Reliable accuracy estimates from k-fold cross validation. IEEE Trans. Knowl. Data Eng. 2019, 32, 1586–1594. [Google Scholar] [CrossRef]
Figure 1. One-bit frequency sweep keying signal on the spectrogram. The x-axis represents time and the y-axis represents frequency. The dotted line represents the center frequency of the ultrasonic sensor and the time at that frequency.
Figure 1. One-bit frequency sweep keying signal on the spectrogram. The x-axis represents time and the y-axis represents frequency. The dotted line represents the center frequency of the ultrasonic sensor and the time at that frequency.
Sensors 23 09550 g001
Figure 2. Process of a frequency sweep keying CDMA using Faster R-CNN.
Figure 2. Process of a frequency sweep keying CDMA using Faster R-CNN.
Sensors 23 09550 g002
Figure 3. Segmentation results in the time domain (top) and spectrogram (bottom). Red represents the front signal and orange represents the rear signal.
Figure 3. Segmentation results in the time domain (top) and spectrogram (bottom). Red represents the front signal and orange represents the rear signal.
Sensors 23 09550 g003
Figure 4. Ultrasonic transceiver system block diagram.
Figure 4. Ultrasonic transceiver system block diagram.
Sensors 23 09550 g004
Figure 5. Ultrasonic transmission/reception environment: (a) ultrasonic transducer; (b) experimental environment.
Figure 5. Ultrasonic transmission/reception environment: (a) ultrasonic transducer; (b) experimental environment.
Sensors 23 09550 g005
Figure 6. Precision of each class as a validation result.
Figure 6. Precision of each class as a validation result.
Sensors 23 09550 g006
Figure 7. Results of comparing the DCER according to the overlap ratio.
Figure 7. Results of comparing the DCER according to the overlap ratio.
Sensors 23 09550 g007
Figure 8. Results of comparing the detection rates according to the overlap ratio.
Figure 8. Results of comparing the detection rates according to the overlap ratio.
Sensors 23 09550 g008
Table 1. Experimental parameters.
Table 1. Experimental parameters.
ParameterDescriptionValueUnit
f c Ultrasonic transducer center frequency40kHz
Ultrasonic transducer frequency bandwidth4kHz
f s ADC sampling rate1MHz
f Sweep frequency range8kHz
t Sweep time2ms
Table 2. Comparison of the TOF error according to the ID length of the proposed method and the existing methods. Measured in millisecond scale.
Table 2. Comparison of the TOF error according to the ID length of the proposed method and the existing methods. Measured in millisecond scale.
Code LengthProposedFSWK-CorrOOKFSKPSK
20.061 ± 0.0190.362 ± 0.07366.792 ± 29.13566.268 ± 29.84768.441 ± 30.362
40.064 ± 0.0200.324 ± 0.05932.235 ± 15.34932.379 ± 15.89433.618 ± 16.168
80.062 ± 0.0190.326 ± 0.1213.215 ± 1.5484.782 ± 2.16111.272 ± 4.558
160.064 ± 0.0190.272 ± 0.0752.371 ± 1.2994.185 ± 2.33512.185 ± 5.654
320.059 ± 0.0180.249 ± 0.0430.548 ± 0.0241.335 ± 0.88719.143 ± 8.548
640.058 ± 0.0170.314 ± 0.0540.064 ± 0.0190.248 ± 0.05221.151 ± 10.167
1280.057 ± 0.0170.291 ± 0.0390.059 ± 0.0180.206 ± 0.05716.688 ± 7.748
Table 3. Results of comparing the robustness of demodulator to heterogeneous signals.
Table 3. Results of comparing the robustness of demodulator to heterogeneous signals.
Modulation MethodDetection Rate/Wrong Detection Rate/Non-Detection Rate
Faster R-CNNFSWK-corrOOKPSKFSK
Proposed0.99/0.01/0.000.97/0.00/0.030.43/0.57/0.000.69/0.31/0.000.61/0.39/0.00
OOK0.01/0.01/0.990.26/0.30/0.440.94/0.06/0.000.53/0.47/0.000.44/0.56/0.00
PSK0.00/0.01/0.980.46/0.54/0.000.51/0.49/0.000.96/0.04/0.000.52/0.48/0.00
FSK0.14/0.07/0.790.31/0.42/0.270.47/0.53/0.000.42/0.58/0.000.97/0.03/0.00
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Park, G.-R.; Park, S.-H.; Baek, K.-R. Improved Frequency Sweep Keying CDMA Using Faster R-CNN for Extended Ultrasonic Crosstalk Reduction. Sensors 2023, 23, 9550. https://0-doi-org.brum.beds.ac.uk/10.3390/s23239550

AMA Style

Park G-R, Park S-H, Baek K-R. Improved Frequency Sweep Keying CDMA Using Faster R-CNN for Extended Ultrasonic Crosstalk Reduction. Sensors. 2023; 23(23):9550. https://0-doi-org.brum.beds.ac.uk/10.3390/s23239550

Chicago/Turabian Style

Park, Ga-Rin, Sang-Ho Park, and Kwang-Ryul Baek. 2023. "Improved Frequency Sweep Keying CDMA Using Faster R-CNN for Extended Ultrasonic Crosstalk Reduction" Sensors 23, no. 23: 9550. https://0-doi-org.brum.beds.ac.uk/10.3390/s23239550

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop