PDE-Constrained Scale Optimization Selection for Feature Detection in Remote Sensing Image Matching

Peng, Yunchao; Zhou, Bin; Qi, Feng

doi:10.3390/math12121882

Open AccessArticle

PDE-Constrained Scale Optimization Selection for Feature Detection in Remote Sensing Image Matching

by

Yunchao Peng

¹,

Bin Zhou

^2,3,*

and

Feng Qi

¹

PipeChina Network Corporation Eastern Oil Storage and Transportation Co., Ltd., Xuzhou 221008, China

²

School of Sciences, Southwest Petroleum University, Chengdu 610500, China

³

Institute of Artificial Intelligence, Southwest Petroleum University, Chengdu 610500, China

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(12), 1882; https://0-doi-org.brum.beds.ac.uk/10.3390/math12121882

Submission received: 8 May 2024 / Revised: 4 June 2024 / Accepted: 11 June 2024 / Published: 17 June 2024

(This article belongs to the Special Issue Mathematics and Its Applications in Science and Engineering, 3rd Edition)

Download

Browse Figures

Versions Notes

Abstract

:

Feature detection and matching is the key technique for remote sensing image processing and related applications. In this paper, a PDE-constrained optimization model is proposed to determine the scale levels advantageous for feature detection. A variance estimation technique is introduced to treat the observation optical images polluted by additive zero-mean Gaussian noise and determine the parameter of a nonlinear scale space governed by the partial differential equation. Additive Operator Splitting is applied to efficiently solve the PDE constraint, and an iterative algorithm is proposed to approximate the optimal subset of the original scale level set. The selected levels are distributed more uniformly in the total variation sense and helpful for generating more accurate and robust feature points. The experimental results show that the proposed method can achieve about a 30% improvement in the number of correct matches with only a small increase in time cost.

Keywords:

constrained optimization; scale space; noise estimation; additive operator splitting

MSC:

90C30; 35Q68; 35Q90

1. Introduction

Remote sensing refers to the detection of targets through various sensors mounted on different aircraft, and the obtained remote sensing image data often carry a large amount of information, requiring the use of different efficient processing techniques for practical purposes [1,2]. Image matching is an important technical foundation in remote sensing image processing, and the effectiveness of registration often directly affects the subsequent image processing results [3,4,5,6].

The work related to image matching can be traced back to P.E. Anuta’s work on cross correlation image detection [3] and Brown’s summary of previous work [4], which has since produced numerous influential works. For image pairs with significant deformation, Flusher proposed an adaptive mapping method that segments the image into multiple sub-blocks, calculates the similarity between the sub-blocks, matches them using the similarity of the sub-blocks, and registers the original image using the coordinate relationship between the sub-blocks [7]. Studholme proposed a normalized mutual information method that overcomes the sensitivity of traditional mutual information methods to the size of overlapping parts between images and achieves good matching results [8]. David Lowe proposed the classic Scale Invariant Feature Transform (SIFT) by combining the scale space pyramid and Gaussian kernel to improve the features invariance and detection rate of feature points [9,10]. The SIFT feature maintains the same changes in image grayscale, angle, scale, etc., and performs well in natural image matching. Some techniques based on SIFT have been developed over the past years. Ke and Sukthankar introduced principal component analysis to replace the weighted histograms and reduce the dimension of features to 20 [11]. Speeded-up robust features (SURFs) were presented based on a Hessian matrix and image convolutions to faster the computing [12]. Dellinger et al. proposed an improved SIFT registration algorithm for Synthetic Aperture Radar (SAR) images called SAR-SIFT [13]. This algorithm uses a new gradient calculation method to effectively suppress the noise impact of SAR images. Alcantarilla et al. proposed a novel multiscale 2D feature detection and description algorithm in nonlinear scale spaces, named KAZE [14,15]. The results revealed a step forward in performance both in detection and description compared to some traditional methods. Gong et al. utilized two matching strategies, feature matching and regional matching, and then proposed a rough-to-accurate registration method, which first uses the SIFT algorithm to coarsely register the image and then uses mutual information methods to evaluate the initial matching points and find accurate matching point pairs [16]. Wu et al. improved the Random Sample Consistency (RANSAC) algorithm and proposed using a fast iterative strategy to filter feature point pairs to obtain more accurate transformation parameters in order to achieve the accurate matching of remote sensing images [17]. In order to effectively solve the problem of significant changes in image grayscale leading to the failure of traditional registration algorithms, Ma et al. proposed a Position Scale and Orientation Scale Invariant Feature Transform (PSO-SIFT) algorithm based on the SIFT algorithm, which uses a new gradient definition to calculate feature point descriptors [6]. Accurate matching between remote sensing image pairs with significant grayscale changes has been achieved. Wu et al. proposed the Particle Swarm Optimization Sample Consistency (PSOSAC) algorithm, which uses particle swarm optimization to optimize the model parameters solved using RANSAC based on the RANSAC algorithm to obtain more accurate matching results [18]. There are some other techniques that have been also introduced to achieve better performances [19,20,21,22,23,24].

Most of these researches are based on a Gaussian Scale Space and have shown success in many problems. However, it is still to be further explored in more complex scenes, such as heterogeneous image matching, multi-modal image matching, and medical image matching [25,26,27]. A remote sensing image is much different from a nature image, not only in acquisition and transport but also in information presentation and vision characteristics [1,28,29]. It is necessary to develop more suitable and efficient feature describing methods for remote sensing image matching. Scale selection techniques can be found in various image processing tasks. Olson presented a method where the scale of the filtering and feature detection is varied locally according to the distance to the scene pixel estimated through stereoscopy [30]. Energy models inspired from Markov random fields and segmentation quality measures can be introduced to determine scales for different purposes [31,32]. PDE-constrained optimization has generated a great deal of attention in many fields [33,34]. In related problems, a certain functional is aimed to be optimized with one or more PDEs serving as constraints.

This paper aimed to present a novel scale optimization selection technique for local feature describing in remote sensing image matching. A nonlinear partial differential equation is configured with the noise estimation results to generate a continuous nonlinear scale space. Then, a PDE-constrained model is proposed to determine the optimized scale levels. It can be numerically solved based on Additive Operator Splitting (AOS) and subset selection. After some calculation steps similar to those of KAZE and SIFT are performed on the scale space, the feature points can be found for remote sensing image matching.

The rest of this paper is arranged as follows. In Section 2, a linear scale space and nonlinear scale space are introduced. In Section 3, an evolution partial differential equation configured with noise estimation is applied to describe a nonlinear scale space. And then, a PDE-constrained model is presented to optimally select the scale levels. In Section 4, the several experiments implemented in this study to verify the accuracy and efficiency of proposed algorithm are discussed.

2. Fundamentals and Basis

2.1. Linear Scale Space

Scale space theory has been introduced in many models, and it can help to distinguish the features at different levels. After a scale parameter is added to the related model, a multi-scale representation sequence can be computed following a change in the parameter. Many tasks can be achieved by analyzing or processing the sequence, such as extracting features.

The scale space method can simulate the process of people observing objects from near to far, and the continuous change in scale can effectively describe the essential features of the image. The Scale Invariant Feature Transform (SIFT) uses a Gaussian pyramid to represent the linear scale space, which means to convolve a serial of Gaussian kernel functions with the original image function to obtain smoothed images at different scales [10,35]. It can be formulated as

\begin{matrix} I (x, t) = I (x) * g_{t} (x) = I (x) * \frac{1}{4 π t^{2}} exp (- \frac{{| x |}^{2}}{4 t^{2}}), (x, t) \in Ω \times Γ . \end{matrix}

(1)

Here, t denotes the scale parameter, and x denotes the pixel position.

Ω \subset R^{2}

denotes the image domain, and

Γ

denotes the time interval.

I (x)

is the original image, and

I (x, t)

means the related linear scale space. The value of t represents the degree of smoothness of the original image. The larger the value, the higher the degree of smoothness, and then, different scale image features can be obtained through some specific steps, which are the following: calculating the Difference of Gaussian (DoG), searching for extreme points, determining the key points (interesting points) and the main direction, and generating feature descriptors.

The discrete difference of Gaussian space can be denoted as

\begin{matrix} D o G (x, t) = I (x, t + Δ t) - I (x, t) . \end{matrix}

(2)

Here,

Δ t

can be preset or determined according to the image data. The first two rows in Figure 1 show different Gaussian kernels and the corresponding smoothed images. The last row shows the difference between two neighbor images. With the scale parameter t increasing, the original image will be gradually smoothed, and the feature information will be redundant. A downsampling technique is adopted in SIFT to obtain a more compact version (small-sized) for later calculations.

Based on the scale space and the difference space, the original SIFT can be summarized as two main steps (see [10,36]).

2.1.1. Calculating the DoG and Determining the Key Points

\begin{matrix} \begin{matrix} K = \{x \in P, |- {(\frac{\partial^{2} D o G}{\partial x^{2}})}^{- 1} \frac{\partial D o G}{\partial x}| < \frac{1}{2} \land | D o G (x) | < 0.03 \land \frac{T r {(H)}^{2}}{| H |} < \frac{{(r + 1)}^{2}}{r}\} . \end{matrix} \end{matrix}

(3)

Here, H means the Hessian of the DoG, and

P = {{arg max}_{z \in B} D o G (z)}

denotes a set of extreme points found in some basis subareas of the image domain. For example, a subarea can be set to a circle-centered pixel x with radius d, that is,

B (x, d)

. The second derivative of the DoG is actually the Hessian in a 3D space (spatial 2D and scale 1D), and it is different from the Hessian in the 2D spatial space. ∧ denotes the Boolean operator AND.

T r

denotes the trace operator. r is a threshold used to determine good key points.

2.1.2. Orientation Assignment and Generating Feature Descriptors

\begin{matrix} F e a t s_{S I F T} (x) = {[histogram (\nabla R_{θ} I (B_{i, j}), θ_{d})]}_{4 \times 4 \times 8} \end{matrix}

(4)

\begin{matrix} θ (B) = \underset{θ_{d}}{arg min} histogram (\nabla B (x), θ_{d}) . \end{matrix}

Here,

θ_{d}, d = 1, 2, \dots, 8

means the preset eight direction angles.

B_{i, j}

is a subdomain of

B (x)

, which is the neighborhood centered keypoint x.

R_{θ}

denotes the rotate matrix with angle

θ

. ∇ denotes the gradient operator. The image is divided into

4 \times 4

sub-regions, and eight direction statistics are calculated on each corresponding gradient field. Then, the statistics on 16 sub-regions are collected to form a feature vector with dimensions

4 \times 4 \times 8

.

2.2. Nonlinear Scale Space

Stable feature points at different scales can be extracted on the previously established linear scale space. However, while smoothing noise, Gaussian convolution also smooths image details and edges, affecting the accuracy and uniqueness of feature points. Some methods have been introduced to improve this defect in the past years and achieved good results in applications [11,14,19,37,38]. The KAZE features are extracted from a nonlinear scale space that is generated using a related partial differential equation. And it has been validated to have a comparable efficiency on some problems in image denoising and restoration. In general, a nonlinear scale space can be denoted as the solution of a nonlinear partial differential equation:

\begin{matrix} \begin{matrix} \frac{\partial I}{\partial t} (x, t) = div (c (| \nabla I |) \cdot \nabla I) \end{matrix} \end{matrix}

(5)

Here,

c (| \nabla I |)

means the diffusion velocity, and the equation will degenerate into a linear version if c degenerates into a constant. Unlike the linear scale space, there is no explicit formulation of the nonlinear scale space but an implicit formulation described using the above Equation (5).

It is clear that the continuous nonlinear scale space is determined using the diffusion velocity and evolution time. There are lots of different configurations for them. For the velocity, there are several useful formulations, such as the following [14]:

\begin{matrix} c_{1} (s) = exp (- \frac{s^{2}}{b^{2}}), c_{2} (s) = \frac{1}{1 + \frac{s^{2}}{b^{2}}}, c_{3} (s) = \{\begin{matrix} 1, & s = 0 \\ 1 - exp (- \frac{3.315}{s^{8} / b^{8}}), & s > 0 \end{matrix} \end{matrix}

(6)

As there is no analytical solution for Equation (5), it is necessary to introduce some numerical technique such as Additive Operator Splitting (AOS) to approximate the solution at a different scale t. The discretization of the equation can be denoted as

\frac{I^{n + 1} - I^{n}}{τ} = \sum_{l = 1}^{m} A_{l} (I^{n}) I^{n + 1}

(7)

and an efficient semi-implicit scheme can be obtained as

I^{n + 1} = {[1 - τ \sum_{l = 1}^{m} A_{l} (I^{n})]}^{- 1} I^{n} .

(8)

Here,

A_{l}

is a matrix that encodes the image conductivities for each dimension, and this scheme is advantageous for large time steps. After taking some steps similar to those in SIFT, the image features can be generated.

Figure 2 shows three nonlinear scale spaces based on the velocity

c_{2} (s)

with different values of k. It can be found that the total variation decreased at different rates. And if such a space is divided according to a fixed mode (such as uniformly or proportionally in the scale parameter), the differences between two adjacent parts are various and difficult to be balanced. It is not advantageous to detect the image feature points and achieve a satisfactory performance in image matching.

3. Proposed Method

It can be found that the continuous nonlinear scale space

I (x, t)

is absolutely determined through the velocity and evolution time. Different diffusion velocities are very different in smoothing or preserving the potential features, so a proper velocity should be selected for each image. Of course, the noise level is also an important factor affecting the determination. The number of scale levels is another important parameter affecting the feature point detection. It is easy to know that if more scale levels are set, a higher computation cost is required though less feature points may be lost. Even if a proper number of levels is set, it is still difficult to determine which levels should be set. Though some experience and recommend selection methods can be taken from previous works, we still tried to present an optimization model to solve these difficulties.

The surroundings in remote sensing images is a complex variable, and the signal is susceptible to being polluted. There are also many changes in the type and intensity of image features; a flexible pattern of an alternating scale space is advantageous for the detection of more characteristics, which benefits later applications.

It is important to know some information about noise and feature intensity before the parameters of the scale space are determined. Here, we introduce a filter to compute a low-density part of the image [39]:

\begin{matrix} \hat{n} = I * h = I * [\begin{matrix} 1 & - 2 & 1 \\ - 2 & 4 & - 2 \\ 1 & - 2 & 1 \end{matrix}] \end{matrix}

(9)

Then, the noise mean, noise variance, and Signal-to-Noise Ratio (SNR) can be estimated by

\begin{matrix} μ_{e s t} = Mean (\hat{n}), σ_{e s t}^{2} = Var (\hat{n}), S N R_{e s t} = 10 {log}_{10} \frac{\sum {[I - \hat{n}]}^{2}}{\sum {\hat{n}}^{2}} . \end{matrix}

(10)

Based on these, a new parameter configuration is formulated as (11), for Equation (5) to generate a continuous nonlinear scale space:

b = \frac{μ_{e s t} \cdot S N R_{0}}{S N R_{e s t}} .

(11)

Here,

S N R_{0}

means a reference SNR value that can be preset using some prior knowledge of the image data.

However, it is still difficult to determine which scale levels should be selected to compute the feature descriptor for the interesting points. In this paper, it is assumed that the interesting points are uniformly distributed in the scale space, so we tried to divide the scale space into several parts in which the changed amount of some metric is equal. Then, a related PDE-constrained optimization problem can be expressed as

\begin{matrix} (t_{1}, t_{2}, \dots, t_{N}) = \underset{t_{1}, t_{2}, \dots, t_{N}}{arg min} \sum_{p = 0}^{N} {[Ene (I (x, t_{p + 1}) - Ene (I (x, t_{p}))]}^{2} \end{matrix}

(12)

\begin{matrix} s . t . \{\begin{matrix} \frac{\partial I}{\partial t} (x, t) & = div (\frac{\nabla I}{1 + {(\frac{S N R_{e s t}}{μ_{e s t} * S N R_{0}})}^{2} {| \nabla I |}^{2}}), & (x, t) \in Ω \times (0, T) \\ I (x, 0) & = I_{0} (x), & x \in Ω, \\ \frac{\partial I}{\partial n} (x, t) & = 0, & x \in \partial Ω . \end{matrix} \end{matrix}

(13)

Here,

Ene (I) = {| | \nabla I | |}_{2}^{2}

denotes the image energy needed to measure the variant of

I (x, t)

. The parameters

t_{0} = 0

and

t_{N + 1} = T

are preset. The initial condition means that the scale space should be generated from the observed image

I_{0}

. div denotes the divergence operator. This model tries to determine N scale levels

t_{1}, t_{2}, \dots, t_{N}

that uniformly divide the scale space

I (x, t), t \in [t_{0}, t_{N + 1}]

.

To find the optimal solution, the continuous scale space

I (x, t)

must be solved from Equation (13), which requires a fair amount of calculation. To reduce the cost for solving the whole scale space, a discrete version is introduced to approximate the scale space, and then the model (12) can be rewritten as

\begin{matrix} (k_{1}, k_{2}, \dots, k_{N}) = & \underset{k_{1}, k_{2}, \dots, k_{N}}{arg min} \sum_{p = 0}^{N} {[Ene (I^{k_{p + 1}}) - Ene (I^{k_{p}})]}^{2} \end{matrix}

(14)

\begin{matrix} s . t . & I^{n + 1} = {[1 - τ \sum_{l = 1}^{m} A_{l} (I^{n})]}^{- 1} I^{n} \end{matrix}

(15)

where

τ

denotes a small enough time step, and (15) means to solve Equation (13) solved using the AOS scheme.

Attributed to the reference step size

τ

, the original optimized scale levels

t_{1}, t_{2}, \dots, t_{N}

are approximated by

k_{1} τ, k_{2} τ, \dots, k_{N} τ

. The variables to be solved are changed from continuous type to integer type, and then the cost will be greatly reduced.

However, it is still not easy to properly choose N levels from the level set with about

\frac{T}{τ}

elements, and this is similar to a subset selection problem to some extent. To efficiently solve such a problem, an iterative approximation is introduced to the proposed algorithm, Algorithm 1.

After setting the initial image

I^{0} = I (x, 0)

, maximal scale level T, maximal allowed error

ε

, maximal iteration number

m a x I t e r

, initial solution

k_{1}^{0}, k_{2}^{0}, \dots, k_{N}^{0}

and then applying Algorithm 1, optimal scale levels

k_{1}, \dots, k_{N}

will be determined, and the nonlinear scale space can be uniformly divided into several parts in terms of total variation. As shown in Figure 3, the nonlinear scale space of a remote sensing image is divided by five optimal scale levels, and the differences between two adjacent scales are almost uniform.

Algorithm 1 Optimal Scale Iterative Approximation

Require:

I^{0} = I (x, 0)

, T,

τ

,

ε

,

m a x I t e r

, initial solution

k_{1}^{0}, k_{2}^{0}, \dots, k_{N}^{0}

.
Ensure: Optimal solution

k_{1}, k_{2}, \dots, k_{N}

1:: for $i = 1$ to N do
2:: $k_{i} = k_{i}^{0}$ ;
3:: end for
4:: $E_{0} = | | \nabla I^{0} | |, k_{0} = 0, k_{N + 1} = [\frac{T}{τ}], Δ E = i n f, j = 0$ ;
5:: while $Δ E < ε$ and $j < m a x I t e r$ do
6:: for $i = 1$ to $N + 1$ do
7:: $I^{k_{i}} = {[1 - τ (k_{i} - k_{i - 1}) \sum_{l = 1}^{m} A_{l} (I^{k_{i - 1}})]}^{- 1} I^{k_{i - 1}}$ ;
8:: $E_{i} = | | \nabla I^{k_{i}} | |$ ;
9:: end for
10:: $Δ E = \sqrt{\frac{\sum_{i = 1}^{N + 1} {(E_{i} - E_{i - 1})}^{2}}{N + 1}}$ , $j = j + 1$ ;
11:: for $i = 1$ to N do
12:: $k_{i} = k_{i} - [(k_{i + 1} - k_{i - 1}) \frac{E_{i} - (E_{0} + i Δ E)}{E_{i + 1} - E_{i - 1}}]$ ;
13:: end for
14:: end while

4. Experiments and Results

In this study, we arranged several experiments to illustrate the performance and advantages of the proposed method. In real-world remote sensing, the same scene is often observed from different positions/viewpoint withs different surroundings. Subsequently, it may be more difficult to detect the feature points and search the matches. Here, we focused on remote sensing images collected with optical sensors. For a comprehensive comparison, the number of feature points was used to evaluate the cost. The match number and correct number are used to evaluate the efficiency. We also present the time cost of the comparison methods.

In the first experiment, we prepared one remote sensing images and generated four pairs for the test. One of each pair of images was the original image, and the other was generated by rotating and re-scaling the original image. The detected feature points and matching results are shown in Figure 4 and Figure 5. The detailed data (including time cost) can be found in Table 1 and Table 2.

For the first column of Figure 4, the rotation angle was

\frac{25 π}{180}

, and the scaling factor was

0.75

. For the next three columns, the scaling factor was fixed, and the rotation angle was set to be

\frac{55 π}{180}

,

\frac{85 π}{180}

, and

\frac{115 π}{180}

, respectively. The rows indicate different methods: SIFT, SURF, BRISK, KAZE, and the proposed method.

It can be found that SIFT finds the most feature points but few matches. Surf finds fewer feature points but obtains more matches than SIFT ol average. BRISK finds the least matches, but there is no error. KAZE can find more correct matches than the three previously mentioned methods but less than the proposed method. In general, because of the optimized scale space, the proposed method achieves the best performance with cost and efficiency being well balanced.

In the second experiment, we tried to test the performances of the different methods under a scaling scene. The rotation angle was fixed to be

\frac{25 π}{180}

. The original image was scaled using different factor separately to generate four image pairs. The detected feature points and matching results are shown in Figure 6 and Figure 7. The detailed data (including time cost) can be found in Table 3 and Table 4.

The columns correspond to different scaling factors (

55 %

,

65 %

,

75 %

,

85 %

), and the rows correspond to different methods (SIFT, SURF, BRISK, KAZE, proposed). It can be found that SIFT finds more feature points but fewer matches. Surf finds more matches than SIFT but shows less accuracy, especially when the scaling factor is smaller. BRISK finds the second most feature points but is not advantageous in terms of the number of correct matches. KAZE and the proposed method still find the most correct matches, but KAZE holds no obvious advantage in the match number compared with the other three methods. Our method finds not as many feature points but the most correct matches.

In the third experiment, the rotation angle was fixed to be

\frac{25 π}{180}

, and the scale factor was fixed to be

75 %

. Gaussian noise was added to the original image to prepare one image of the image pair, and then, the polluted image was rotated and scaled to generate the other image of the pair. The detected feature points and matching results are shown in Figure 8 and Figure 9. The detailed data (including time cost) can be found in Table 5 and Table 6.

The columns correspond to different noise intensities (SNR = 20 db, 18 db, 16 db, 14 db), and the rows correspond to different methods (SIFT, SURF, BRISK, KAZE, proposed).

As shown in Figure 9, SIFT and BRISK find the most key points in each pair, but there is no advantage in finding correct matches. SURF finds the least key points, but there is no obvious disadvantage in finding correct matches. KAZE finds a few more key points than SURF but achieves a similar correct number of matches. The proposed method finds key points a little more than KAZE but achieves the largest correct number of matches.

In the fourth experiment, the rotation angle is fixed to be

\frac{55 π}{180}

and the scale factor is fixed to be

65 %

. Four optical remote sensing images are prepared to be scaled and rotated. The feature detection and matching identify are performed on four image pairs. The results are shown in Figure 10 and Figure 11, Table 7 and Table 8.

Similarly, the columns are corresponding to different remote sensing image (rs1, rs2, rs3, rs4) and rows are related to different methods (SIFT, SURF, BRISK, KAZE, proposed).

As shown in Figure 11, SIFT finds the most feature points in each pair but no more matches than other methods. BRISK finds the second most feature points and achieve the least correct match number in average. SURF finds more feature points and correct matches than KAZE. The proposed method finds key points a little more than KAZE and achieves the biggest correct number.

From above experiments, the proposed method shows the significantly best comprehensive performance exceed other four.

5. Conclusions

In this paper, a constrained optimization model is introduce to determine some scale slices of a nonlinear scale space for remote sensing image feature detection and matching. With noise estimation introduced to determine the parameter values, a partial differential equation is applied to generate a continuous nonlinear scale space advantageous to represent the important features. The continuous scale space is approximated by a discrete one by Additive Operator Splitting solving the PDE. Then an discrete model is present to fast approach the optimal scale levels for feature detection and image matching. Four experiments are arranged to test the performance of proposed method from views of rotation, scaling, noise and the mixed version. The results show the accuracy and efficiency of the propose method. About 30% improvement in correct matches number with only a small increase in time cost can be found in the proposed method results against the others. In the future work, more efforts will be made to improve the proposed method and develop more efficient and robust method for complex scenes.

Author Contributions

Conceptualization, Y.P. and B.Z.; resources, F.Q.; data curation, Y.P.; writing—original draft preparation, F.Q. and B.Z.; writing—review and editing, Y.P. and F.Q.; visualization, B.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported in part by PipeChina Select the Best Candidates to Undertake Key Research Projects (WZXGL202106).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

Authors Yunchao Peng and Feng Qi were employed by the company PipeChina Network Corporation Eastern Oil Storage and Transportation Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. All authors have read and agreed to the published version of the manuscript.

References

Li, X.; Zhang, L.; Du, B.; Zhang, L.; Shi, Q. Iterative reweighting heterogeneous transfer learning framework for supervised remote sensing image classification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 2022–2035. [Google Scholar] [CrossRef]
Li, Z.; Yue, J.; Fang, L. Adaptive regional multiple features for large-scale high-resolution remote sensing image registration. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–13. [Google Scholar] [CrossRef]
Anuta, P.E. Spatial registration of multispectral and multitemporal digital imagery using fast Fourier transform techniques. IEEE Trans. Geosci. Electron. 1970, 8, 353–368. [Google Scholar] [CrossRef]
Brown, L.G. A survey of image registration techniques. ACM Comput. Surv. (CSUR) 1992, 24, 325–376. [Google Scholar] [CrossRef]
Zhang, X.; Zhou, Y.; Qiao, P.; Lv, X.; Li, J.; Du, T.; Cai, Y. Image Registration Algorithm for Remote Sensing Images Based on Pixel Location Information. Remote Sens. 2023, 15, 436. [Google Scholar] [CrossRef]
Ma, W.; Wen, Z.; Wu, Y.; Jiao, L.; Gong, M.; Zheng, Y.; Liu, L. Remote sensing image registration with modified SIFT and enhanced feature matching. IEEE Geosci. Remote Sens. Lett. 2016, 14, 3–7. [Google Scholar] [CrossRef]
Flusser, J. An adaptive method for image registration. Pattern Recognit. 1992, 25, 45–54. [Google Scholar] [CrossRef]
Studholme, C.; Hill, D.L.; Hawkes, D.J. An overlap invariant entropy measure of 3D medical image alignment. Pattern Recognit. 1999, 32, 71–86. [Google Scholar] [CrossRef]
Lowe, D.G. Object recognition from local scale-invariant features. In Proceedings of the Seventh IEEE International Conference on Computer Vision, Kerkyra, Greece, 20–27 September 1999; Volume 2, pp. 1150–1157. [Google Scholar]
Lowe, D.G. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 2004, 60, 91–110. [Google Scholar] [CrossRef]
Ke, Y.; Sukthankar, R. PCA-SIFT: A more distinctive representation for local image descriptors. In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), Washington, DC, USA, 27 June–2 July 2004; Volume 2, p. II. [Google Scholar]
Bay, H.; Tuytelaars, T.; Van Gool, L. Surf: Speeded up robust features. In Proceedings of the European Conference on Computer Vision, Graz, Austria, 7–13 May 2006; Springer: Berlin/Heidelberg, Germany, 2006; pp. 404–417. [Google Scholar]
Dellinger, F.; Delon, J.; Gousseau, Y.; Michel, J.; Tupin, F. SAR-SIFT: A SIFT-like algorithm for SAR images. IEEE Trans. Geosci. Remote Sens. 2014, 53, 453–466. [Google Scholar] [CrossRef]
Alcantarilla, P.F.; Bartoli, A.; Davison, A.J. KAZE features. In Proceedings of the Computer Vision—ECCV 2012: 12th European Conference on Computer Vision, Florence, Italy, 7–13 October 2012; Proceedings, Part VI 12. Springer: Berlin/Heidelberg, Germany, 2012; pp. 214–227. [Google Scholar]
Ma, J.; Jiang, X.; Fan, A.; Jiang, J.; Yan, J. Image matching from handcrafted to deep features: A survey. Int. J. Comput. Vis. 2021, 129, 23–79. [Google Scholar] [CrossRef]
Gong, M.; Zhao, S.; Jiao, L.; Tian, D.; Wang, S. A novel coarse-to-fine scheme for automatic image registration based on SIFT and mutual information. IEEE Trans. Geosci. Remote Sens. 2013, 52, 4328–4338. [Google Scholar] [CrossRef]
Wu, Y.; Ma, W.; Gong, M.; Su, L.; Jiao, L. A novel point-matching algorithm based on fast sample consensus for image registration. IEEE Geosci. Remote Sens. Lett. 2014, 12, 43–47. [Google Scholar] [CrossRef]
Wu, Y.; Miao, Q.; Ma, W.; Gong, M.; Wang, S. PSOSAC: Particle swarm optimization sample consensus algorithm for remote sensing image registration. IEEE Geosci. Remote Sens. Lett. 2017, 15, 242–246. [Google Scholar] [CrossRef]
Leutenegger, S.; Chli, M.; Siegwart, R. BRISK: Binary robust invariant scalable keypoints. In Proceedings of the 2011 IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain, 6–13 November 2011; pp. 2548–2555. [Google Scholar]
Dou, J.; Qin, Q.; Tu, Z. Robust image matching based on the information of SIFT. Optik 2018, 171, 850–861. [Google Scholar] [CrossRef]
Li, Y.; Li, Q.; Liu, Y.; Xie, W. A spatial-spectral SIFT for hyperspectral image matching and classification. Pattern Recognit. Lett. 2018, 127, 18–26. [Google Scholar] [CrossRef]
He, Y.; Deng, G.; Wang, Y.; Wei, L.; Yang, J.; Li, X.; Zhang, Y. Optimization of SIFT algorithm for fast-image feature extraction in line-scanning ophthalmoscope. Optik 2018, 152, 21–28. [Google Scholar] [CrossRef]
Chen, Z.; Sun, S.K. A Zernike moment phase-based descriptor for local image representation and matching. IEEE Trans. Image Process. 2009, 19, 205–219. [Google Scholar] [CrossRef] [PubMed]
Zhou, B.; Duan, X.; Ye, D.; Wei, W.; Woźniak, M.; Damaševičius, R. Heterogeneous Image Matching via a Novel Feature Describing Model. Appl. Sci. 2019, 9, 4792. [Google Scholar] [CrossRef]
Klare, B.; Jain, A.K. Heterogeneous Face Recognition: Matching NIR to Visible Light Images. In Proceedings of the 2010 20th International Conference on Pattern Recognition, Istanbul, Turkey, 23–26 August 2010. [Google Scholar]
Wang, N.; Li, J.; Tao, D.; Li, X.; Gao, X. Heterogeneous image transformation. Pattern Recognit. Lett. 2013, 34, 77–84. [Google Scholar] [CrossRef]
Peng, C.; Gao, X.; Wang, N.; Li, J. Graphical representation for heterogeneous face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 301–312. [Google Scholar] [CrossRef] [PubMed]
Wang, T.; Kemao, Q.; Seah, H.S.; Lin, F. A flexible heterogeneous real-time digital image correlation system. Opt. Lasers Eng. 2018, 110, 7–17. [Google Scholar] [CrossRef]
Lin, G.; Fan, C.; Zhu, H.; Miu, Y.; Kang, X. Visual feature coding based on heterogeneous structure fusion for image classification. Inf. Fusion 2017, 36, 275–283. [Google Scholar] [CrossRef]
Olson, C.F. Adaptive-scale filtering and feature detection using range data. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 983–991. [Google Scholar] [CrossRef]
Lezoray, O.; Charrier, C. Color image segmentation using morphological clustering and fusion with automatic scale selection. Pattern Recognit. Lett. 2009, 30, 397–406. [Google Scholar] [CrossRef]
Sun, J.; Xu, Z. Scale selection for anisotropic diffusion filter by Markov random field model. Pattern Recognit. 2010, 43, 2630–2645. [Google Scholar] [CrossRef]
Biegler, L.T.; Ghattas, O.; Heinkenschloss, M.; van Bloemen Waanders, B. Large-scale PDE-constrained optimization: An introduction. In Large-Scale PDE-Constrained Optimization; Springer: Berlin/Heidelberg, Germany, 2003; pp. 3–13. [Google Scholar]
Mang, A.; Gholami, A.; Davatzikos, C.; Biros, G. PDE-constrained optimization in medical image analysis. Optim. Eng. 2018, 19, 765–812. [Google Scholar] [CrossRef]
Witkin, A.P. Scale-space filtering. In Readings in Computer Vision; Elsevier: Amsterdam, The Netherlands, 1987; pp. 329–332. [Google Scholar]
Zhou, B.; Duan, X.M.; Wei, W.; Ye, D.J.; Woźniak, M.; Damaševičius, R. An adaptive local descriptor embedding zernike moments for image matching. IEEE Access 2019, 7, 183971–183984. [Google Scholar] [CrossRef]
Fergus, R.; Perona, P.; Zisserman, A. Object class recognition by unsupervised scale-invariant learning. In Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Madison, WI, USA, 18–20 June 2003; pp. 264–271. [Google Scholar]
Jolliffe, I. Principal Component Analysis; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Immerkaer, J. Fast noise variance estimation. Comput. Vis. Image Underst. 1996, 64, 300–302. [Google Scholar] [CrossRef]

Figure 1. Gaussian scale space.

Figure 2. Several nonlinear scale spaces.

Figure 3. Optimized scale levels.

Figure 4. Feature detection and matching under different rotations.

Figure 5. Feature numbers and match numbers under different rotations.

Figure 6. Feature detection and matching under different scales.

Figure 7. Feature numbers and match numbers under different scales.

Figure 8. Feature detection and matching under different noise levels.

Figure 9. Feature numbers and match numbers under different noise levels.

Figure 10. Feature detection and matching for different images.

Figure 11. Features number and matches number of different remote sensing images.

Table 1. Feature numbers under different rotations.

Method	Keypoint Number 1 (1000)				Keypoint Number 2 (1000)
Method	0	0	0	0	$\frac{25 π}{180}$	$\frac{55 π}{180}$	$\frac{85 π}{180}$	$\frac{115 π}{180}$
SIFT	1.146	1.146	1.146	1.146	0.641	0.645	0.624	0.638
SURF	0.170	0.170	0.170	0.170	0.089	0.099	0.091	0.084
BRISK	0.813	0.813	0.813	0.813	0.264	0.271	0.252	0.273
KAZE	0.303	0.303	0.303	0.303	0.168	0.166	0.164	0.172
Proposed	0.402	0.402	0.402	0.402	0.242	0.240	0.238	0.248

Table 2. Match numbers under different rotations.

Method	Matches Number				Correct Number				Total Time Cost (s)
Method	$\frac{25 π}{180}$	$\frac{55 π}{180}$	$\frac{85 π}{180}$	$\frac{115 π}{180}$	$\frac{25 π}{180}$	$\frac{55 π}{180}$	$\frac{85 π}{180}$	$\frac{115 π}{180}$	$\frac{25 π}{180}$	$\frac{55 π}{180}$	$\frac{85 π}{180}$	$\frac{115 π}{180}$
SIFT	19	21	22	20	19	19	18	16	1.034	1.141	1.114	1.115
SURF	22	16	39	26	20	14	37	19	0.014	0.016	0.016	0.015
BRISK	11	15	18	11	11	15	18	11	0.418	0.441	0.439	0.464
KAZE	30	25	23	33	29	24	23	31	0.063	0.062	0.064	0.067
Proposed	50	44	44	56	48	43	42	53	0.061	0.066	0.064	0.078

Table 3. Feature numbers under different scales.

Method	Keypoint Number 1 (1000)				Keypoint Number 2 (1000)
Method	$100 %$	$100 %$	$100 %$	$100 %$	$55 %$	$65 %$	$75 %$	$85 %$
SIFT	1.146	1.146	1.146	1.146	0.303	0.421	0.606	0.798
SURF	0.170	0.170	0.170	0.170	0.062	0.078	0.085	0.105
BRISK	0.813	0.813	0.813	0.813	0.111	0.199	0.263	0.410
KAZE	0.303	0.303	0.303	0.303	0.110	0.129	0.173	0.194
Proposed	0.402	0.402	0.402	0.402	0.143	0.188	0.246	0.283

Table 4. Match numbers under different scales.

Method	Matches Number				Correct Number				Total Time Cost (s)
Method	$55 %$	$65 %$	$75 %$	$85 %$	$55 %$	$65 %$	$75 %$	$85 %$	$55 %$	$65 %$	$75 %$	$85 %$
SIFT	8	9	18	32	8	8	17	29	0.755	1.018	1.120	1.224
SURF	14	16	19	35	8	11	18	30	0.013	0.017	0.016	0.016
BRISK	12	7	16	29	12	7	16	29	0.466	0.449	0.455	0.406
KAZE	5	21	30	35	5	18	29	32	0.050	0.064	0.064	0.069
Proposed	12	38	51	56	11	35	49	52	0.056	0.067	0.062	0.076

Table 5. Featur numbers under different noise levels.

Method	Keypoint Number 1 (1000)				Keypoint Number 2 (1000)
Method	-	-	-	-	20 db	18 db	16 db	14 db
SIFT	1.001	0.969	0.945	0.873	0.801	0.810	0.780	0.753
SURF	0.179	0.192	0.213	0.219	0.125	0.118	0.124	0.134
BRISK	0.996	1.219	1.636	2.155	0.440	0.442	0.497	0.570
KAZE	0.303	0.307	0.303	0.301	0.199	0.197	0.195	0.188
Proposed	0.412	0.420	0.414	0.398	0.292	0.285	0.295	0.293

Table 6. Match numbers under different noise levels.

Method	Matches Number				Correct Number				Total Time Cost (s)
Method	20 db	18 db	16 db	14 db	20 db	18 db	16 db	14 db	20 db	18 db	16 db	14 db
SIFT	25	22	23	22	25	21	23	22	0.957	0.967	0.948	0.922
SURF	35	33	39	37	28	30	26	32	0.017	0.015	0.015	0.016
BRISK	35	24	22	23	35	24	22	23	0.392	0.375	0.383	0.395
KAZE	28	29	29	35	26	27	26	34	0.057	0.055	0.056	0.055
Proposed	41	46	52	49	38	44	47	47	0.056	0.058	0.057	0.058

Table 7. Features number of different remote sensing images.

Method	Keypoint Number 1 (1000)				Keypoint Number 2 (1000)
Method	rs1	rs2	rs3	rs4	rs1	rs2	rs3	rs4
SIFT	1.523	1.604	1.896	1.496	0.631	0.698	0.686	0.663
SURF	0.241	0.157	0.188	0.176	0.095	0.049	0.038	0.012
BRISK	0.796	0.697	0.972	0.621	0.252	0.208	0.288	0.133
KAZE	0.163	0.084	0.12	0.053	0.051	0.018	0.015	0.008
Proposed	0.413	0.255	0.349	0.194	0.128	0.07	0.077	0.026

Table 8. Matches number of different remote sensing images.

Method	Matches Number				Correct Number				Total Time Cost (s)
Method	rs1	rs2	rs3	rs4	rs1	rs2	rs3	rs4	rs1	rs2	rs3	rs4
SIFT	20	6	8	2	17	5	8	2	1.035	0.911	0.963	0.915
SURF	24	13	10	6	21	12	10	6	0.014	0.016	0.012	0.012
BRISK	10	3	2	5	10	3	2	5	0.367	0.362	0.368	0.359
KAZE	13	5	4	2	13	5	4	2	0.044	0.043	0.044	0.043
Proposed	27	20	27	8	26	20	27	8	0.051	0.046	0.049	0.045

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Peng, Y.; Zhou, B.; Qi, F. PDE-Constrained Scale Optimization Selection for Feature Detection in Remote Sensing Image Matching. Mathematics 2024, 12, 1882. https://0-doi-org.brum.beds.ac.uk/10.3390/math12121882

AMA Style

Peng Y, Zhou B, Qi F. PDE-Constrained Scale Optimization Selection for Feature Detection in Remote Sensing Image Matching. Mathematics. 2024; 12(12):1882. https://0-doi-org.brum.beds.ac.uk/10.3390/math12121882

Chicago/Turabian Style

Peng, Yunchao, Bin Zhou, and Feng Qi. 2024. "PDE-Constrained Scale Optimization Selection for Feature Detection in Remote Sensing Image Matching" Mathematics 12, no. 12: 1882. https://0-doi-org.brum.beds.ac.uk/10.3390/math12121882

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

PDE-Constrained Scale Optimization Selection for Feature Detection in Remote Sensing Image Matching

Abstract

1. Introduction

2. Fundamentals and Basis

2.1. Linear Scale Space

2.1.1. Calculating the DoG and Determining the Key Points

2.1.2. Orientation Assignment and Generating Feature Descriptors

2.2. Nonlinear Scale Space

3. Proposed Method

4. Experiments and Results

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI