On the Accessibility of Newton’s Method under a Hölder Condition on the First Derivative

Ezquerro, José Antonio; Hernández-Verón, Miguel Ángel

doi:10.3390/a8030514

Open AccessArticle

On the Accessibility of Newton’s Method under a Hölder Condition on the First Derivative

by

José Antonio Ezquerro

^*

and

Miguel Ángel Hernández-Verón

Department of Mathematics and Computation, University of La Rioja, Calle Luis de Ulloa s/n, 26004 Logroño, Spain

^*

Author to whom correspondence should be addressed.

Algorithms 2015, 8(3), 514-528; https://0-doi-org.brum.beds.ac.uk/10.3390/a8030514

Submission received: 22 June 2015 / Revised: 15 July 2015 / Accepted: 17 July 2015 / Published: 23 July 2015

(This article belongs to the Special Issue Numerical Algorithms for Solving Nonlinear Equations and Systems)

Download

Browse Figures

Versions Notes

Abstract

:

We see how we can improve the accessibility of Newton’s method for approximating a solution of a nonlinear equation in Banach spaces when a center Hölder condition on the first derivative is used to prove its semi-local convergence.

Keywords:

Newton’s method; semi-local convergence; recurrence relations; domain of parameters; nonlinear system

1. Introduction

It is often necessary in many scientific and engineering problems to find a root of a nonlinear equation. To solve this equation, we can use iterative methods. To give sufficient generality to the problem of approximating a solution of a nonlinear equation by iterative methods, we are concerned, in this work, with the problem of approximating a locally-unique solution

x^{*}

of the equation

F (x) = 0

, where F is a nonlinear operator defined on a nonempty open convex subset Ω of a Banach space X with values in a Banach space Y, so that many scientific and engineering problems can be written as a nonlinear equation in Banach spaces.

Newton’s method is probably the best-known iterative method, and it is well known that it converges quadratically. Newton’s method for solving scalar equations was extended to nonlinear equations in Banach spaces by Kantorovich in 1948 [1] in the following way:

x_{0} given in Ω, x_{n} = x_{n - 1} - {[F^{'} (x_{n - 1})]}^{- 1} F (x_{n - 1}), n \in N,

(1)

where

F^{'}

is the Fréchet derivative of the operator F. For this reason, many authors refer to it as the Newton–Kantorovich method.

Kantorovich proved the semi-local convergence of Newton’s method under the hypothesis that the operator involved F is twice differentiable Fréchet with bounded second derivative:

There exists a constant

C \geq 0

, such that

∥ F^{''} (x) ∥ \leq C

for

x \in Ω

.

It is well known that this condition can be replaced by a Lipschitz condition on the first derivative

F^{'}

of the operator involved [2]:

There exists a constant

L \geq 0

, such that

∥ F^{'} (x) - F^{'} (y) ∥ \leq L ∥ x - y ∥

for

x, y \in Ω

.

In view of the applications, numerous papers have recently appeared where the convergence of the method is proven under different conditions for the derivatives of the operator F. A known variant is the study of the convergence of the method under a Hölder condition on the first derivative [3,4,5]:

There exist two constants

K \geq 0

and

p \in (0, 1]

such that

∥ F^{'} (x) - F^{'} {(y) ∥ \leq K ∥ x - y ∥}^{p}

for

x, y \in Ω

.

In order to improve the accessibility of Newton’s method under the last condition, we can use different strategies that change the condition. In this work, we use a similar one as that used by Argyros in [6] for the Lipschitz condition on the first derivative

F^{'}

, which consists of noticing that, as a consequence of the last condition being satisfied in Ω, we have, for the starting point

x_{0}

, that:

There exist two constants $K_{0} \geq 0$ and $p \in (0, 1]$ , such that $∥ F^{'} (x) - F^{'} (x_{0}) ∥ \leq K_{0} {∥ x - x_{0} ∥}^{p}$ for $x \in Ω$ ,

with

K_{0} \leq K

. We then say that

F^{'}

is center Hölder in

x_{0}

.

In this paper, we focus our attention on the analysis of the semi-local convergence of Sequence (1), which is based on demanding the last condition to the initial approximation

x_{0}

and provides the so-called domain of parameters corresponding to the conditions required for the initial approximation that guarantee the convergence of Sequence (1) to the solution

x^{*}

.

In this work, we carry out an analysis of the domain of parameters for Newton’s method under the last two conditions for

F^{'}

and use a technique based on recurrence relations. As a consequence, we improve the domains of parameters associated with the semi-local convergence result given for Newton’s method by Hérnández in [4].

We prove in this paper that center conditions on the first derivative of the operator involved in the solution of nonlinear equations play an important role in the semi-local convergence of Newton’s method, since we can improve the accessibility of Newton’s method from them.

Throughout the paper, we denote

\bar{B (x, ϱ)} = {y \in X; ∥ y - x ∥ \leq ϱ}

and

B (x, ϱ) = {y \in X; ∥ y - x ∥ < ϱ}

.

2. Preliminaries

The best-known semi-local convergence result for Newton’s method under a Hölder condition on the first derivative of the operator involved, when the technique of recurrence relations is used to prove it, is that given by Hernández in [4], which is established under the following conditions:

(A1): There exists $Γ_{0} = {[F^{'} (x_{0})]}^{- 1} \in L (Y, X)$ , for some $x_{0} \in Ω$ , with $∥ Γ_{0} ∥ \leq β$ and $∥ Γ_{0} F (x_{0}) ∥ \leq η$ , where $L (Y, X)$ is the set of bounded linear operators from Y to X.
(A2): There exist two constants $K \geq 0$ and $p \in (0, 1]$ , such that $∥ F^{'} (x) - F^{'} {(y) ∥ \leq K ∥ x - y ∥}^{p}$ for $x, y \in Ω$ .
(A3): If $ξ (p)$ is the unique zero of the auxiliary function:

$ϕ (x; p) = {(1 + p)}^{p} {(1 - x)}^{1 + p} - x^{p}, p \in (0, 1],$

(2)

in the interval $(0, \frac{1}{2}]$ , it is satisfied that $h = K β η^{p} \leq ξ (p)$ and $B (x_{0}, R) \subset Ω$ , where $R = \frac{(1 + p) (1 - h)}{(1 + p) - (2 + p) h} η$ .

Theorem 1. (Theorem 2.1 of [4]) Let

F : Ω \subseteq X ⟶ Y

be a continuously-differentiable operator defined on a nonempty open convex domain Ω of a Banach space X with values in a Banach space Y. Suppose that Conditions (A1)(A3) are satisfied. Then, Newton’s sequence, given by (1), converges to a solution

x^{*}

of the equation

F (x) = 0

, starting at

x_{0}

, and

x_{n}, x^{*} \in \bar{B (x_{0}, R)}

, for all

n = 0, 1, 2, \dots

Note that Condition (A1), required for the initial approximation

x_{0}

, defines the parameters β and η, and Condition (A2), required for the operator F, defines the parameter K fixed for all point of Ω. Observe that, every point

z \in Ω

, such that the operator

{[F^{'} (z)]}^{- 1}

exists with

∥ {[F^{'} (z)]}^{- 1} ∥ \leq β

and

∥ {[F^{'} (z)]}^{- 1} F (z) ∥ \leq η

, has associated the pair

(K, β η^{p})

of the

x y

-plane, where

x = K

and

y = β η^{p}

. In addition, fixed

p \in (0, 1]

, if we consider the set:

D_{T} = \{(x, y) \in R^{2} : x y \leq ξ (p)\},

we can observe that every point z, such that its associated pair

(K, β η^{p})

belongs to

D_{T}

, can be chosen as the starting point for Newton’s method, so that the method converges to a solution

x^{*}

of the equation

F (x) = 0

when it starts at z. The set

D_{T}

is then called the domain of parameters associated with Theorem 1 and can be drawn by choosing

x = K

,

y = β η^{p}

and coloring the region of the

x y

-plane whose points satisfy the condition

h \leq ξ (p)

(namely,

x y \leq ξ (p)

) of the Theorem 1. In Figure 1, we see the domain of parameters

D_{T}

(blue region).

In relation to the above, we can think that the larger the size of the domain of parameters is, the more possibilities we have for choosing good starting points for Newton’s method. As a consequence, we are interested in

D_{T}

being as big as possible, since this fact allows us to find a greater number of good starting points for Newton’s method.

3. Semi-Local Convergence of Newton’s Method

To improve the semi-local convergence of Newton’s method from increasing the domain of parameters

D_{T}

, we consider a procedure that consists of observing that, as a consequence of Condition (A2), once

x_{0} \in Ω

is fixed, the condition:

∥ F^{'} (x) - F^{'} (x_{0}) ∥ \leq K_{0} {∥ x - x_{0} ∥}^{p}, x \in Ω,

(3)

is satisfied with

K_{0} \leq K

. Then, by considering jointly the parameters K and

K_{0}

, we can relax the semi-local convergence conditions of Newton’s method given in Theorem 1 and obtain a larger domain of parameters.

Now, we present a semi-local convergence result for Newton’s method under Condition (A1) for the starting point

x_{0}

and Condition (A2) for the first derivative

F^{'}

. Note that Condition (3) follows from Condition (A2) for the starting point

x_{0}

. In addition, we obtain a semi-local convergence result by combining Conditions (A2) and (3), which allows increasing the domain of parameters

D_{T}

, so that the possibility of choosing good starting points for Newton’s method is increased, as we can see later in the applications. In particular, we study the convergence of Newton’s method to a solution of the equation

F (x) = 0

under certain conditions for the pair

(F, x_{0})

. From some real parameters, a system of four recurrence relations is constructed in which two sequences of positive real numbers are involved. The convergence of Newton’s method is then guaranteed from them.

3.1. Auxiliary Scalar Sequences

From Conditions (A1)–(A2), we define

γ = ε h

,

b_{0} = h

and

δ = \frac{γ}{(1 + p) (1 - γ)}

, where

ε = \frac{K_{0}}{K} \in [0, 1]

. Now, we define

b_{1} = \frac{δ^{p}}{1 - γ} b_{0}

and:

\begin{matrix} a_{n} & = \frac{b_{n}}{(1 + p) (1 - b_{n})}, n \geq 1, \end{matrix}

(4)

\begin{matrix} b_{n} & = \frac{b_{n - 1} a_{n - 1}^{p}}{1 - b_{n - 1}}, n \geq 2 . \end{matrix}

(5)

Observe that we consider the case

b_{0} > 0

, since if

b_{0} = 0

, a trivial problem results, as the solution of the equation

F (x) = 0

is

x_{0}

.

Next, we prove the following four recurrence relations for Sequences (1), (4) and (5):

\begin{matrix} ∥ Γ_{1} ∥ & = ∥ {[F^{'} (x_{1})]}^{- 1} ∥ \leq \frac{∥ Γ_{0} ∥}{1 - γ}, \end{matrix}

(6)

\begin{matrix} ∥ x_{2} - x_{1} ∥ & \leq δ ∥ x_{1} - x_{0} ∥, \end{matrix}

(7)

\begin{matrix} K ∥ Γ_{1} ∥ ∥ x_{2} - x_{1} ∥^{p} & \leq b_{1}, \end{matrix}

(8)

\begin{matrix} ∥ x_{2} - x_{0} ∥ & \leq (1 + δ) ∥ x_{1} - x_{0} ∥, \end{matrix}

(9)

provided that:

x_{1} \in Ω a n d γ < 1 .

(10)

If

x_{1} \in Ω

, then:

∥ I - Γ_{0} F^{'} (x_{1}) ∥ \leq ∥ Γ_{0} ∥ ∥ F^{'} (x_{0}) - F^{'} (x_{1}) ∥ \leq K β η^{p} = h < 1 .

Then, by the Banach lemma on invertible operators, it follows that the operator

Γ_{1}

exists and:

∥ Γ_{1} ∥ \leq \frac{∥ Γ_{0} ∥}{1 - ∥ I - Γ_{0} F^{'} (x_{1}) ∥} \leq \frac{∥ Γ_{0} ∥}{1 - γ} .

After that, from Taylor’s series and Sequence (1), we have:

∥ F (x_{1}) ∥ = ∥\int_{0}^{1} (F^{'} (x_{0} + τ (x_{1} - x_{0})) - F^{'} (x_{0})) (x_{1} - x_{0}) d τ∥ \leq \frac{K_{0} η^{p}}{1 + p} ∥ x_{1} - x_{0} ∥ .

Thus,

\begin{matrix} ∥ x_{2} - x_{1} ∥ & \leq ∥ Γ_{1} ∥ ∥ F (x_{1}) ∥ \leq δ ∥ x_{1} - x_{0} ∥ \\ ∥ Γ_{1} ∥ ∥ x_{2} - x_{1} ∥^{p} & \leq \frac{K δ^{p}}{1 - γ} ∥ Γ_{0} ∥ ∥ x_{1} - x_{0} ∥^{p} \leq b_{1}, \\ ∥ x_{2} - x_{0} ∥ & \leq ∥ x_{2} - x_{1} ∥ + ∥ x_{1} - x_{0} ∥ \leq (1 + δ) ∥ x_{1} - x_{0} ∥ < (1 + \frac{δ}{1 - a_{1}}) η = R, \end{matrix}

provided that

δ \leq 1

and

a_{1} < 1

.

Now, we prove in the same way as above the following four recurrence relations for Sequences (1), (4) and (5):

\begin{matrix} ∥ Γ_{2} ∥ & = ∥ {[F^{'} (x_{2})]}^{- 1} ∥ \leq \frac{∥ Γ_{1} ∥}{1 - b_{1}}, \end{matrix}

(11)

\begin{matrix} ∥ x_{3} - x_{2} ∥ & \leq a_{1} ∥ x_{2} - x_{1} ∥, \end{matrix}

(12)

\begin{matrix} K ∥ Γ_{2} ∥ ∥ x_{3} - x_{2} ∥^{p} & \leq b_{2}, \end{matrix}

(13)

\begin{matrix} ∥ x_{3} - x_{0} ∥ & \leq (1 + δ (1 + a_{1})) ∥ x_{1} - x_{0} ∥, \end{matrix}

(14)

provided that:

x_{2} \in Ω a n d b_{1} < 1 .

(15)

In addition, we generalize the last recurrence relations to every point of Sequence (1), so that we can guarantee that (1) is a Cauchy sequence from them. For this, we analyze the scalar sequences defined in (4) and (5) in order to prove later the semi-local convergence of Sequence (1). For this, it suffices to see that (1) is a Cauchy sequence and (10) and (15) are true for all

x_{n}

and

b_{n - 1}

with

n \geq 3

. We begin by presenting a technical lemma.

Lemma 2 If

γ \leq \frac{1 + p}{2 + p}

and

b_{1}

is such that

b_{1} < \frac{1 + p}{2 + p} and b_{1} + a_{1}^{p} < 1,

(16)

then:

(a): the sequences ${b_{n}}$ and ${a_{n}}$ are strictly decreasing,
(b): $a_{n} < 1$ and $b_{n} < 1$ , for all $n \geq 1$ .

If

b_{1} = 1 - a_{1}^{p} < \frac{1 + p}{2 + p}

, then

a_{n} = a_{1} < 1

and

b_{n} = b_{1} < 1

for all

n \geq 2

.

Proof. We first consider the case in which

b_{1}

satisfies (16). Item (a) is proven by mathematical induction on n. As

b_{1} + a_{1}^{p} < 1

, then

b_{2} < b_{1}

and

a_{2} < a_{1}

. If we now suppose that

b_{j} < b_{j - 1}

and

a_{j} < a_{j - 1}

, for all

i = 2, 3, \dots, n

, then:

b_{n + 1} = \frac{b_{n} a_{n}^{p}}{1 - b_{n}} < \frac{b_{n} a_{1}^{p}}{1 - b_{1}} < b_{n} and a_{n + 1} = \frac{b_{n + 1}}{(1 + p) (1 - b_{n + 1})} < a_{n} .

As a result, the sequences

{a_{n}}

and

{b_{n}}

are strictly decreasing for all

n \geq 2

.

To see Item (b), we have

a_{n} < a_{1} < 1

and

b_{n} < b_{1} < 1

, for all

n \geq 2

, by Item (a) and the conditions given in (16).

Second, if

b_{1} = 1 - a_{1}^{p}

, then

b_{n} = b_{1} = 1 - a_{1}^{p} < 1

, for all

n \geq 2

. Moreover, if

b_{1} < \frac{1 + p}{2 + p}

, then we have

a_{n} = a_{1} < 1

, for all

n \geq 2

. □

3.2. Main Result

We now give a semi-local convergence result for Newton’s method from a modification of the convergence conditions required in Theorem 1. Therefore, we consider Conditions (A1), (A2) and the modification of Condition (A3) given by:

(A3b) $γ \leq \frac{1 + p}{2 + p}$ , $b_{1}$ satisfies (16) and $B (x_{0}, R) \subset Ω$ , where $R = (1 + \frac{δ}{1 - a_{1}}) η$ .

Remember that Condition (3) follows from Condition (A2) for the starting point

x_{0}

.

Theorem 3. Let

F : Ω \subseteq X ⟶ Y

be a continuously-differentiable operator defined on a nonempty open convex domain Ω of a Banach space X with values in a Banach space Y. Suppose that Conditions (A1), (A2) and (A3b) are satisfied. Then, Newton’s sequence, given by (1), converges to a solution

x^{*}

of the equation

F (x) = 0

, starting at

x_{0}

, and

x_{n}, x^{*} \in \bar{B (x_{0}, R)}

, for all

n = 0, 1, 2, \dots

.

proof. We begin by proving the following four items for Sequences (1), (4) and (5) and

n \geq 3

:

(I): There exists $Γ_{n - 1} = {[F^{'} (x_{n - 1})]}^{- 1}$ and $∥ Γ_{n - 1} ∥ \leq \frac{∥ Γ_{n - 2} ∥}{1 - b_{n - 2}}$ ,
(II): $∥ x_{n} - x_{n - 1} ∥ \leq a_{n - 2} ∥ x_{n - 1} - x_{n - 2} ∥$ ,
(III): $K ∥ Γ_{n - 1} ∥ ∥ x_{n} - x_{n - 1} ∥^{p} \leq b_{n - 1}$ ,
(IV): $x_{n} \in Ω$ .

Observe that

x_{1} \in Ω

, since

η < R

. Moreover, from (6), (7), (8) and (9), it follows that

x_{2} \in Ω

. Furthermore, from (11), (12), (13) and (14), we have that Items (I), (II), (III) and (IV) hold for

n = 3

. If we now suppose that Items (I), (II) and (III) are true for some

n - 1

, it follows, by analogy to the case where

n = 3

and induction, that Items (I), (II) and (III) also hold for n. Notice that

b_{n} < 1

for all

n \geq 1

. Now, we prove (IV). Therefore,

\begin{matrix} ∥ x_{n} - x_{0} ∥ & \leq & ∥ x_{n} - x_{n - 1} ∥ + ∥ x_{n - 1} - x_{n - 2} ∥ + \dots + ∥ x_{1} - x_{0} ∥ \\ \overset{(II)}{\leq} & (1 + \sum_{i = 1}^{n - 2} (\prod_{j = 1}^{i} a_{j})) ∥ x_{2} - x_{1} ∥ + ∥ x_{1} - x_{0} ∥ \\ < & (1 + \sum_{i = 1}^{n - 2} a_{1}^{i}) ∥ x_{2} - x_{1} ∥ + ∥ x_{1} - x_{0} ∥ \\ < & \frac{1}{1 - a_{1}} ∥ x_{2} - x_{1} ∥ + ∥ x_{1} - x_{0} ∥ \\ \leq & (1 + \frac{δ}{1 - a_{1}}) ∥ x_{1} - x_{0} ∥ \\ \leq & R \end{matrix}

and

x_{n} \in B (x_{0}, R)

. As

B (x_{0}, R) \subset Ω

, then

x_{n} \in Ω

for all

n \geq 0

. Note that the conditions given in (15) are satisfied for all

x_{n}

and

b_{n - 1}

with

n \geq 3

.

Next, we prove that

{x_{n}}

is a Cauchy sequence. For this, we follow an analogous procedure to the latter. Therefore, for

m \geq 2

and

n \geq 2

, we have:

\begin{matrix} ∥ x_{n + m} - x_{n} ∥ & \leq & ∥ x_{n + m} - x_{n + m - 1} ∥ + ∥ x_{n + m - 1} - x_{n + m - 2} ∥ + \dots + ∥ x_{n + 1} - x_{n} ∥ \\ \overset{(II)}{\leq} & \sum_{i = n - 1}^{n + m - 2} (\prod_{j = 1}^{i} a_{j}) ∥ x_{2} - x_{1} ∥ \\ \overset{L e m m a (a)}{<} & (\sum_{i = n - 1}^{n + m - 2} a_{1}^{i}) ∥ x_{2} - x_{1} ∥ \\ = & (\sum_{i = 0}^{m - 1} a_{1}^{n + i - 1}) ∥ x_{2} - x_{1} ∥ \\ = & \frac{1 - a_{1}^{m}}{1 - a_{1}} a_{1}^{n - 1} ∥ x_{2} - x_{1} ∥ . \end{matrix}

Thus,

{x_{n}}

is a Cauchy sequence.

After that, we prove that

x^{*}

is a solution of equation

F (x) = 0

. As

∥ Γ_{n} F (x_{n}) ∥ \to 0

when

n \to \infty

, if we take into account that:

∥ F (x_{n}) ∥ \leq ∥ F^{'} (x_{n}) ∥ ∥ Γ_{n} F (x_{n}) ∥

and

{∥ F^{'} (x_{n}) ∥}

is bounded, since:

∥ F^{'} (x_{n}) ∥ \leq ∥ F^{'} (x_{0}) ∥ + K_{0} ∥ x_{n} - x_{0} ∥^{p} \leq ∥ F^{'} (x_{0}) ∥ + K_{0} R^{p},

it follows that

∥ F (x_{n}) ∥ \to 0

when

n \to \infty

. As a consequence, we obtain

F (x^{*}) = 0

by the continuity of F in

\bar{B (x_{0}, R)}

. □

4. Accessibility of Newton’s Method

The accessibility of an iterative method is analyzed from the set of possible starting points that guarantee the convergence of the iterative method when it starts at them. As we have indicated, the set of starting points that guarantees the convergence of the iterative method is related to the domain of parameters associated with a result of semi-local convergence of the iterative method.

Next, we study the domain of parameters associated with Theorem 1 and compare it with that associated with Theorem 3. To guarantee the convergence of Newton’s method from Theorem 3, the following three conditions must be satisfied:

γ \leq \frac{1 + p}{2 + p}, b_{1} < \frac{1 + p}{2 + p} and b_{1} + \frac{b_{1}^{p}}{{(1 + p)}^{p} {(1 - b_{1})}^{p}} < 1,

(17)

where

p \in (0, 1]

.

From the auxiliary function given in (2), the third condition of (17) can be written as:

ϕ (b_{1}; p) > 0 .

(18)

Observe that

ϕ (x; p)

is a non-increasing and convex function and

ϕ (0; p) = {(1 + p)}^{p} > 0

and

ϕ (\frac{1}{2}; p) \leq 0

, for all

p \in (0, 1]

. Besides, if

p = 1

, the unique zero of

ϕ (x; 1)

in the interval

(0, \frac{1}{2}]

is

\frac{1}{2}

. If we now denote, for a fixed

p \in (0, 1]

, the unique zero of

ϕ (x; p)

in

(0, \frac{1}{2}]

by

ξ (p)

and demand

b_{1} < ξ (p)

, then Condition (18) holds. Moreover, since

ξ (p) \leq \frac{1}{2}

, the second condition of (17) is satisfied. Now, as:

b_{1} = \frac{δ^{p} b_{0}}{1 - γ} = \frac{γ^{p} b_{0}}{{(1 + p)}^{p} {(1 - γ)}^{1 + p}},

the second and third conditions of (17) are satisfied, provided that:

b_{0} < ξ (p) \frac{{(1 + p)}^{p} {(1 - γ)}^{1 + p}}{γ^{p}} .

After that, we write the first condition of (17) as:

h \leq \frac{1 + p}{ε (2 + p)}

(19)

and take into account that the second and third conditions of (17) are satisfied if:

h < ξ (p) \frac{{(1 + p)}^{p} {(1 - ε h)}^{1 + p}}{ε^{p} h^{p}},

(20)

since

b_{0} = h

and

γ = ε h

. In addition, Condition (20) is equivalent to:

ϖ (h) = ε^{p} h^{1 + p} - ξ {(1 + p)}^{p} {(1 - ε h)}^{1 + p} < 0 .

Furthermore,

ϖ^{'} (h) \geq 0

, so that

ϖ (h)

is a nondecreasing function for all

h \geq 0

, and

ϖ (0) \leq 0

and

ϖ (\frac{1 + p}{ε (2 + p)}) > 0

. Therefore, Condition (19) is satisfied if Condition (20) holds, and we can then give the following result.

Corollary 4 Let

F : Ω \subseteq X ⟶ Y

be a continuously-differentiable operator defined on a nonempty open convex domain Ω of a Banach space X with values in a Banach space Y. Suppose that (A1)–(A2) are satisfied. If Condition (20), where

ξ (p)

is the unique zero of function (2) in the interval

(0, \frac{1}{2}]

, is satisfied and

B (x_{0}, R) \subset Ω

, where

R = (1 + \frac{υ}{1 - ν}) η

,

υ = \frac{ε h}{(1 + p) (1 - ε h)}

,

ν = \frac{ϑ}{(1 + p) (1 - ϑ)}

and

ϑ = \frac{h υ^{p}}{1 - ε h}

, then Newton’s sequence, given by (1), converges to a solution

x^{*}

of the equation

F (x) = 0

, starting at

x_{0}

, and

x_{n}, x^{*} \in \bar{B (x_{0}, R)}

, for all

n = 0, 1, 2, \dots

From the last result, we can define the following domains of parameters:

D_{C}^{ε} = \{(x, y) \in R^{2} : x y \leq ξ (p) \frac{{(1 + p)}^{p} {(1 - ε x y)}^{1 + p}}{ε^{p} x^{p} y^{p}}\},

where

ε = \frac{K_{0}}{K} \in [0, 1]

,

p \in (0, 1]

and

ξ (p)

is the unique zero of Function (2) in the interval

(0, \frac{1}{2}]

.

Next, we compare the conditions required for the semi-local convergence of Newton’s method in Theorem 1 and Corollary 4. In Figure 1, we see that

D_{T} \subset D_{C}^{ε}

for

p = \frac{1}{10}, \frac{1}{5}, \frac{2}{5}, \frac{4}{5}

, so that we can guess that the smaller the quantity

ε = \frac{K_{0}}{K} \in [0, 1]

is, the larger the domain of parameters is: orange for

ε = 0.1

, green for

ε = 0.2

, red for

ε = 0.4

and yellow for

ε = 0.8

. Note that, if

ε \to 1

, the domain of parameters associated with Corollary 4,

D_{C}^{ε}

, tends to be that obtained by Theorem 1 (blue region). As a consequence,

D_{T} = D_{C}^{1} = D_{C}^{ε_{j}} \subset D_{C}^{ε_{j - 1}} \subset \dots \subset D_{C}^{ε_{0}} for ε_{0} < \dots < ε_{j - 1} < ε_{j} = 1 .

Figure 1. Domains of parameters of Newton’s method associated with Theorem 1 (blue region) and Corollary 4 (orange for

ε = 0.1

, green for

ε = 0.2

, red for

ε = 0.4

and yellow for

ε = 0.8

) for

p = \frac{1}{10}, \frac{1}{5}, \frac{2}{5}, \frac{4}{5}

.

Figure 1. Domains of parameters of Newton’s method associated with Theorem 1 (blue region) and Corollary 4 (orange for

ε = 0.1

, green for

ε = 0.2

, red for

ε = 0.4

and yellow for

ε = 0.8

) for

p = \frac{1}{10}, \frac{1}{5}, \frac{2}{5}, \frac{4}{5}

.

On the other hand, in Figure 2, we observe the relationship between the domains of parameters associated with Theorem 1 (gray region) and Corollary 4 (magenta region) from the variability of ε and for four different values of p:

p = \frac{1}{10}, \frac{1}{5}, \frac{2}{5}, \frac{4}{5}

. As we can see, the domain associated with Corollary 4 is always larger, for all

ε \in [0, 1]

, than that associated with Theorem 1.

Figure 2. Domains of parameters of Newton’s method associated with Corollary 4 (magenta region) and Theorem 1 (gray region) for

p = \frac{1}{10}, \frac{1}{5}, \frac{2}{5}, \frac{4}{5}

.

Figure 2. Domains of parameters of Newton’s method associated with Corollary 4 (magenta region) and Theorem 1 (gray region) for

p = \frac{1}{10}, \frac{1}{5}, \frac{2}{5}, \frac{4}{5}

.

In addition, we prove analytically in the following what we have just seen graphically. First, we prove that

D_{T} \subset D_{C}^{ε}

, for each

p \in (0, 1]

and

ε \in [0, 1]

. For this, if

(K, β η^{p}) \in D_{T}

, then

h \leq ξ (p) \leq \frac{1}{2} \leq \frac{1}{2 ε} \leq \frac{1 + p}{ε (2 + p)}

, since

p \in (0, 1]

and

ε \in [0, 1]

. Moreover,

h \leq ξ (p) \leq ξ (p) \frac{{(1 + p)}^{p} {(1 - ε h)}^{1 + p}}{ε^{p} h^{p}}

, since

ϕ (ε h; p) \geq 0

when

h \leq ξ (p)

and

ε h \leq ξ (p)

. Therefore,

(K, β η^{p}) \in D_{C}^{ε}

for each

p \in (0, 1]

and

ε \in [0, 1]

.

Finally, we see that

D_{C}^{ε_{2}} \subset D_{C}^{ε_{1}}

if

ε_{1} < ε_{2}

with

ε_{1}, ε_{2} \in [0, 1]

. Indeed, from

ψ (ε) = \frac{1 + p}{(2 + p) ε}

and

φ (ε) = ξ (p) \frac{{(1 + p)}^{p} {(1 - ε h)}^{1 + p}}{ε^{p} h^{p}}

, we have

ψ^{'} (ε) = \frac{- (1 + p)}{(2 + p) ε^{2}} \leq 0

and

φ^{'} (ε) = - \frac{{(1 + p)}^{p} ξ (p) {(1 - ε h)}^{p} (p + ε h)}{ε^{1 + p} h^{p}} \leq 0

, since

1 - ε h \geq \frac{1}{2 + p} \geq 0

, so that

ψ (ε_{2}) \leq ψ (ε_{1})

and

φ (ε_{2}) \leq φ (ε_{1})

, and therefore

D_{C}^{ε_{2}} \subset D_{C}^{ε_{1}}

.

To that end, we have proven the improvement obtained for the domain of parameters of Newton’s method with the help of the conditions of type (3) that we have just shown in Figure 2.

5. Application

Now, we illustrate all of that mentioned above with the following mildly nonlinear elliptic equation:

u_{x x} + u_{y y} = u^{5 / 3} .

(21)

This type of equation is of interest in the theory of gas dynamics [7]. An associated Dirichlet problem can be formulated as follows. Suppose that the equation is satisfied in the interior of the square

0 \leq x, y \leq 1

in

R^{2}

and that

u (x, y) > 0

is given and continuous on the boundary of the square ([8]):

\begin{matrix} u (x, 0) = 2 x^{2} - x + 1, & u (x, 1) = 2, & 0 \leq x \leq 1, \\ u (0, y) = 2 y^{2} - y + 1, & u (1, y) = 2, & 0 \leq y \leq 1 . \end{matrix}

(22)

Our discussion focuses on the formulation of the finite difference equation for the elliptic boundary value Problems (21)–(22). The method of finite differences applied to this problem yields a finite system of equations. For general use, iterative techniques often represent the best approach to the solution of such finite systems of equations.

Specifically, central difference approximations for (21) are used, so that Problems (21)–(22) are reduced to the problem of finding a real zero of a function

F : Ω \subseteq R^{m} ⟶ R^{m}

, namely a real solution

x^{*}

of a nonlinear system

F (x) = 0

with m equations and m unknowns. The common technique used to approximate

x^{*}

is the application of iterative methods. In this case, Newton’s method goes on being the most used iterative method for approximating

x^{*}

, since this method is one of the most efficient.

For Problems (21)–(22) in

R^{2}

, Equation (21) can be approximated using central difference approximations for the spacial derivatives. Consider a grid with step size

h = \frac{1}{N + 1}

in x and

k = \frac{1}{M + 1}

in y defined over the region D, so that D is partitioned into a grid consisting of

(N + 1) \times (M + 1)

rectangles with sides

h = \frac{1}{N + 1}

and

k = \frac{1}{M + 1}

. The mesh points

(x_{i}, y_{j})

are given by:

x_{i} = i h, y_{i} = j k, i = 0, 1, \dots, N + 1, j = 0, 1, \dots, M + 1 .

Considering the following finite difference expressions to approximate the partial differentials:

u_{x x} (x_{i}, y_{j}) = \frac{u (x_{i - 1}, y_{j}) - 2 u (x_{i}, y_{j}) + u (x_{i + 1}, y_{j})}{h^{2}} + O (h^{2}),

u_{y y} (x_{i}, y_{j}) = \frac{u (x_{i}, y_{j - 1}) - 2 u (x_{i}, y_{j}) + u (x_{i}, y_{j + 1})}{k^{2}} + O (k^{2}),

Equation (21) is approximated at each interior grid point

(x_{i}, y_{j})

by the difference equation:

\frac{u (x_{i + 1}, y_{j}) - 2 u (x_{i}, y_{j}) + u (x_{i - 1}, y_{j})}{h^{2}} + \frac{u (x_{i}, y_{j + 1}) - 2 u (x_{i}, y_{j}) + u (x_{i}, y_{j - 1})}{k^{2}} = u {(x_{i}, y_{j})}^{5 / 3},

for

i = 1, 2, \dots, N

and

j = 1, 2, \dots, M

. The boundary conditions are:

\begin{matrix} u (x_{i}, y_{0}) = 2 x_{i}^{2} - x_{i} + 1, & u (x_{i}, y_{M}) = 2, & i = 1, 2, \dots, N, \\ u (x_{0}, y_{j}) = 2 y_{j}^{2} - y_{j} + 1, & u (x_{N}, y_{j}) = 2, & j = 0, 1, \dots, M + 1 . \end{matrix}

If we now denote the approximate value of

u (x_{i}, y_{j})

as

u_{i, j}

, we obtain the difference equation:

2 [{(\frac{h}{k})}^{2} + 1] u_{i, j} - (u_{i - 1, j} + u_{i + 1, j}) - {(\frac{h}{k})}^{2} (u_{i, j - 1} + u_{i, j + 1}) = - h^{2} u_{i, j}^{5 / 3},

for

i = 1, 2, \dots, N

and

j = 1, 2, \dots, M

, with:

\begin{matrix} u_{i, 0} = 2 x_{i}^{2} - x_{i} + 1, & u_{i, M + 1} = 2, & i = 1, 2, \dots, N, \\ u_{0, j} = 2 y_{j}^{2} - y_{j} + 1, & u_{N + 1, j} = 2, & j = 0, 1, \dots, M + 1 . \end{matrix}

Equation (21) with the boundary conditions given in (22) forms an

N M \times N M

nonlinear system of equations. To set up the nonlinear system, the

N M = m

interior grid points are labeled row-by-row from

x_{1}

to

x_{m}

starting from the left-bottom corner point. The resulting system is:

A x + h^{2} q (x) = v,

where:

A = {(\begin{matrix} B & C & 0 & \dots & \dots & 0 \\ C & B & C & ⋱ & ⋮ \\ 0 & C & B & C & ⋱ & ⋮ \\ ⋮ & ⋱ & ⋱ & ⋱ & ⋱ & 0 \\ ⋮ & ⋱ & ⋱ & ⋱ & C \\ 0 & \dots & \dots & 0 & C & B \end{matrix})}_{M \times M},

B = {(\begin{matrix} 2 (λ + 1) & - 1 & 0 & \dots & \dots & 0 \\ - 1 & 2 (λ + 1) & - 1 & ⋱ & ⋮ \\ 0 & - 1 & 2 (λ + 1) & - 1 & ⋱ & ⋮ \\ ⋮ & ⋱ & ⋱ & ⋱ & ⋱ & 0 \\ ⋮ & ⋱ & ⋱ & ⋱ & - 1 \\ 0 & \dots & \dots & 0 & - 1 & 2 (λ + 1) \end{matrix})}_{N \times N},

C = - λ I

,

λ = {(\frac{h}{k})}^{2}

, I is the identity matrix in

R^{N}

,

x = {(x_{1}, x_{2}, \dots, x_{m})}^{t}

,

q (x) = {(x_{1}^{5 / 3}, x_{2}^{5 / 3}, \dots, x_{m}^{5 / 3})}^{t}

and

v

is a vector formed from the boundary conditions (systems of this type are so-called mildly nonlinear systems).

If we denote the previous system by

F (x) = 0

, where:

F (x) = A x + h^{2} q (x) - v and F : R^{m} ⟶ R^{m}, m = N M,

(23)

then

F^{'} (x)

is a linear operator, which is given by the matrix:

F^{'} (x) = A + h^{2} Q (x), Q (x) = \frac{5}{3} diag \{x_{1}^{2 / 3}, x_{2}^{2 / 3}, \dots, x_{m}^{2 / 3}\} .

(24)

We now choose

N = M = 4

and the infinity norm. In addition, the number of equations is

m = 16

,

h = k = \frac{1}{5}

and

λ = 1

. Besides,

q (x) = {(x_{1}^{5 / 3}, x_{2}^{5 / 3}, \dots, x_{16}^{5 / 3})}^{t}

and:

v = {(\frac{44}{25}, \frac{23}{25}, \frac{28}{25}, \frac{87}{25}, \frac{23}{25}, 0, 0, 2, \frac{28}{25}, 0, 0, 2, \frac{87}{25}, 2, 2, 4)}^{t} .

In this case, we observe that a solution

x^{*}

of the system

F (x) = 0

with F defined in (23) satisfies:

∥ x^{*} ∥ \leq ∥ A^{- 1} ∥ (∥ v ∥ + h^{2} ∥ q (x) ∥) \Rightarrow ∥ x^{*} ∥ - \frac{5}{3} (4 + \frac{1}{25} {∥ x^{*} ∥}^{5 / 3}) \leq 0,

where

∥ A^{- 1} ∥ = \frac{5}{3}

and

∥ v ∥ = 4

, so that

∥ x^{*} ∥ \in [0, ϱ_{1}] \cup [ϱ_{2}, + \infty]

, where

ϱ_{1} = 9.5149 \dots

and

ϱ_{2} = 45.9125 \dots

are the two positive real roots of the scalar equation

t - \frac{5}{3} (4 + \frac{t^{5 / 3}}{25}) = 0

. Then, we can consider:

F : Ω \subset R^{16} ⟶ R^{16} with Ω = \{x \in R^{16} : ∥ x ∥ < 10\},

since

ϱ_{1} < 10 < ϱ_{2}

.

Moreover,

F^{'} (x)

is the linear operator given by the matrix:

A + \frac{1}{15} diag \{x_{1}^{2 / 3}, x_{2}^{2 / 3}, \dots, x_{16}^{2 / 3}\}

and:

F^{'} (x) - F^{'} (y) = \frac{1}{15} diag \{x_{1}^{2 / 3} - y_{1}^{2 / 3}, x_{2}^{2 / 3} - y_{2}^{2 / 3}, \dots, x_{16}^{2 / 3} - y_{16}^{2 / 3}\},

where

y = {(y_{1}, y_{2}, \dots, y_{16})}^{t}

. In addition,

\begin{matrix} ∥ F^{'} (x) - F^{'} (y) ∥ & \leq \frac{1}{15} ∥x^{1 / 3} + y^{1 / 3}∥ ∥x^{1 / 3} - y^{1 / 3}∥ \leq \frac{2}{15} \sqrt[3]{10} {∥ x - y ∥}^{1 / 3}, \\ ∥ F^{'} (x) - F^{'} (x_{0}) ∥ & \leq \frac{1}{15} ({∥ x ∥}^{1 / 3} + {∥ x_{0} ∥}^{1 / 3}) ∥ x - x_{0} ∥^{1 / 3} \leq \frac{1}{15} (\sqrt[3]{10} + {∥ x_{0} ∥}^{1 / 3}) {∥ x - x_{0} ∥}^{1 / 3} . \end{matrix}

Thus,

K = 0.2872 \dots

,

K_{0} = 0.2199 \dots

and

p = \frac{1}{3}

.

If we choose the starting point

x_{0} = {(\frac{3}{2}, \frac{3}{2}, \dots, \frac{3}{2})}^{t}

, we obtain

β = 1.4862 \dots

and

η = 0.5185 \dots

, so that the condition

h = K β η^{p} \leq ξ (p)

of Theorem 1 is not satisfied, since

h = 0.3429 \dots > ξ (p) = ξ (\frac{1}{3}) = 0.3071 \dots

, where

ξ (\frac{1}{3})

is the unique zero of the corresponding auxiliary function given by (2),

ϕ (x; \frac{1}{3}) = \frac{1}{3} (6^{2 / 3} {(1 - t)}^{4 / 3} - 3 \sqrt[3]{t})

, in the interval

(0, \frac{1}{2}]

. As a consequence, we cannot use Theorem 1 to guarantee the convergence of Newton’s method for approximating a solution of the system

F (x) = 0

, where F is defined in (23).

However, we can guarantee the convergence of Newton’s method from Corollary 4, since Condition (20) is satisfied:

h = 0.3429 \dots < ξ (p) \frac{{(1 + p)}^{p} {(1 - ε h)}^{1 + p}}{ε^{p} h^{p}} = 0.3517 \dots

with

ε = 0.7656 \dots

. Therefore, we can then apply Newton’s method for approximating a solution of the system

F (x) = 0

with F defined in (23) and obtain the approximation given by the vector

x^{*} = {(x_{1}^{*}, x_{2}^{*}, \dots, x_{16}^{*})}^{t}

that is shown in Table 1, reached after four iterations with a tolerance

10^{- 16}

. In Table 2, we show the errors

∥ x_{n} - x^{*} ∥

using the stopping criterion

∥ x_{n} - x_{n - 1} ∥ < 10^{- 16}

. Notice that the vector shown in Table 1 is a good approximation of the solution of the system, since

∥ F (x^{*}) ∥ \leq C \times 10^{- 16}

. See the sequence

{∥ F (x_{n}) ∥}

in Table 2.

Table 1. Approximation of the solution

x^{*}

of

F (x) = 0

with F given in (23).

**Table 1.** Approximation of the solution $x^{*}$ of $F (x) = 0$ with F given in (23).
i	$x_{i}^{*}$	i	$x_{i}^{*}$	i	$x_{i}^{*}$	i	$x_{i}^{*}$
1	$0.979069 \dots$	5	$1.097445 \dots$	9	$1.291651 \dots$	13	$1.587503 \dots$
2	$1.097445 \dots$	6	$1.245767 \dots$	10	$1.422935 \dots$	14	$1.664776 \dots$
3	$1.291651 \dots$	7	$1.422935 \dots$	11	$1.561551 \dots$	15	$1.742203 \dots$
4	$1.587503 \dots$	8	$1.664776 \dots$	12	$1.742203 \dots$	16	$1.843388 \dots$

Table 2. Absolute errors obtained by Newton’s method and

{∥ F (x_{n}) ∥}

.

**Table 2.** Absolute errors obtained by Newton’s method and ${∥ F (x_{n}) ∥}$ .
n	$∥ x_{n} - x^{*} ∥$	$∥ F (x_{n}) ∥$
0	$5.2093 \dots \times 10^{- 1}$	$1.318622 \dots$
1	$2.4028 \dots \times 10^{- 3}$	$5.4477 \dots \times 10^{- 3}$
2	$8.4485 \dots \times 10^{- 8}$	$1.2913 \dots \times 10^{- 7}$
3	$1.2409 \dots \times 10^{- 16}$	$1.4741 \dots \times 10^{- 16}$

Finally, we note that if we interpolate the points of Table 1 and take into account that the solution satisfies the boundary conditions given in (22), we obtain the approximation of the numerical solution shown in Figure 3.

Figure 3. Approximated solution of Problems (21)–(22).

Acknowledgments

This work has been partially supported by the project MTM2014-52016-C2-1-P of the Spanish Ministry of Economy and Competitiveness.

Author Contributions

The contributions of the two authors have been similar. Both authors have worked together to develop the present manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kantorovich, L.V.; Akilov, G.P. Functional Analysis; Pergamon Press: Oxford, UK, 1982. [Google Scholar]
Fenyö, I. Über die lösung der in Banachschen raume definierten nichtlinearen gleichungen. Acta Math. Acad. Sci. Hung. 1954, 5, 85–93. [Google Scholar] [CrossRef]
Argyros, I.K. Remarks on the convergence of Newton’s method under Hölder continuity conditions. Tamkang J. Math. 1992, 23, 269–277. [Google Scholar]
Hernández, M.A. The Newton method for operators with Hölder continuous first derivative. J. Optim. Theory Appl. 2001, 109, 631–648. [Google Scholar] [CrossRef]
Rokne, J. Newton’s method under mild differentiability conditions with error analysis. Numer. Math. 1972, 18, 401–412. [Google Scholar] [CrossRef]
Argyros, I.K. On the Newton-Kantorovich hypothesis for solving equations. J. Comput. Appl. Math. 2004, 169, 315–332. [Google Scholar] [CrossRef]
Greenspan, D. Introductory Numerical Analysis of Elliptic Boundary Value Problems; Harper and Row: New York, NY, USA, 1965. [Google Scholar]
Rall, L.B. Computational Solution of Nonlinear Operator Equations; Robert E. Krieger Publishing Company: Malabar, FL, USA, 1979. [Google Scholar]

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ezquerro, J.A.; Hernández-Verón, M.Á. On the Accessibility of Newton’s Method under a Hölder Condition on the First Derivative. Algorithms 2015, 8, 514-528. https://0-doi-org.brum.beds.ac.uk/10.3390/a8030514

AMA Style

Ezquerro JA, Hernández-Verón MÁ. On the Accessibility of Newton’s Method under a Hölder Condition on the First Derivative. Algorithms. 2015; 8(3):514-528. https://0-doi-org.brum.beds.ac.uk/10.3390/a8030514

Chicago/Turabian Style

Ezquerro, José Antonio, and Miguel Ángel Hernández-Verón. 2015. "On the Accessibility of Newton’s Method under a Hölder Condition on the First Derivative" Algorithms 8, no. 3: 514-528. https://0-doi-org.brum.beds.ac.uk/10.3390/a8030514

Article Menu

On the Accessibility of Newton’s Method under a Hölder Condition on the First Derivative

Abstract

1. Introduction

2. Preliminaries

3. Semi-Local Convergence of Newton’s Method

3.1. Auxiliary Scalar Sequences

3.2. Main Result

4. Accessibility of Newton’s Method

5. Application

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI