The Modified Viscosity Approximation Method with Inertial Technique and Forward–Backward Algorithm for Convex Optimization Model

Hanjing, Adisak; Bussaban, Limpapat; Suantai, Suthep

doi:10.3390/math10071036

Open AccessArticle

The Modified Viscosity Approximation Method with Inertial Technique and Forward–Backward Algorithm for Convex Optimization Model

by

Adisak Hanjing

¹,

Limpapat Bussaban

²

and

Suthep Suantai

^3,4,*

¹

Department of Science and Mathematics, Rajamangala University of Technology Isan Surin Campus, Surin 32000, Thailand

²

Faculty of Science, Chiang Mai University, Chiang Mai 50200, Thailand

³

Data Science Research Center, Department of Mathematics, Faculty of Science, Chiang Mai University, Chiang Mai 50200, Thailand

⁴

Research Group in Mathematics and Applied Mathematics, Department of Mathematics, Faculty of Science, Chiang Mai University, Chiang Mai 50200, Thailand

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(7), 1036; https://0-doi-org.brum.beds.ac.uk/10.3390/math10071036

Submission received: 21 February 2022 / Revised: 15 March 2022 / Accepted: 22 March 2022 / Published: 24 March 2022

(This article belongs to the Special Issue Advanced Optimization Methods and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we propose a new accelerated algorithm for finding a common fixed point of nonexpansive operators, and then, a strong convergence result of the proposed method is discussed and analyzed in real Hilbert spaces. As an application, we create a new accelerated viscosity forward–backward method (AVFBM) for solving nonsmooth optimization problems of the sum of two objective functions in real Hilbert spaces, and the strong convergence of AVFBM to a minimizer of the sum of two convex functions is established. We also present the application and simulated results of AVFBM for image restoration and data classification problems.

Keywords:

Hilbert space; common fixed points; viscosity forward–backward algorithm; convergence theorems; convex optimization model

MSC:

47H10; 47J25; 65K05; 90C30

1. Introduction

Image restoration is a fundamental problem in image processing. The image restoration (image deblurring or image deconvolution) is concerned with the reconstruction or estimation of the uncorrupted image from a blurred and noisy image [1,2]. Thus, the main objective of the image restoration algorithms is to reduce the blurring effects and the noise that degraded the image by minimizing the noise of the degraded image to produce an estimate image that approaches the original image.

The image restoration problem can be modeled by a linear inverse problem, which is formulated by:

u = B v + e,

(1)

where

B \in R^{m \times n}

is the blurring matrix,

v \in R^{n}

is an original image,

u \in R^{m}

is the observed image, and

e \in R^{m}

is a noise. One of the most popular models to solve Problem (1) is the least absolute shrinkage and selection operator (LASSO) [3], which can be considered in the following form:

min_{v} {∥ B v - u ∥}_{2}^{2} + τ {∥ v ∥}_{1},

(2)

where

τ > 0

is a regularization parameter,

{∥ \cdot ∥}_{1}

is

l_{1}

-norm, and

{∥ \cdot ∥}_{2}

is

l_{2}

-norm. Moreover, Problem (2) can be applied to solving many areas of science and applied science such as astronomical imaging [4], microscopy [5], and signal recovery problems [6].

The nonsmooth convex optimization model which includes (2) as a particular case has the following form:

min_{x \in H} ϕ_{1} (x) + ϕ_{2} (x),

(3)

where

H

is a Hilbert space with norm

∥ \cdot ∥

, and inner product

〈 \cdot, \cdot 〉,

ϕ_{2} : H \to R \cup {\infty}

is a proper convex and lower semi-continuous function, and

ϕ_{1} : H \to R

is convex differentiable with a Lipschitz continuous gradient constant

L > 0 .

The solution set of Problem (3) will be denoted by

Ω : = A r g m i n (ϕ_{1} + ϕ_{2}) .

Furthermore, x is a solution of Problem (3) if and only if x satisfies the fixed point equation:

x = p r o x_{c ϕ_{2}} (I - c \nabla ϕ_{1}) (x),

(4)

where

c > 0,

I is an identity operator,

\nabla f

is the gradient of

f,

p r o x_{ϕ_{2}} = {(I + \partial ϕ_{2})}^{- 1}

, and

\partial ϕ_{2}

is the subdifferential of

ϕ_{2}

defined by:

\partial ϕ_{2} (a^{*}) : = {u \in H : ϕ_{2} (a) \geq 〈 u, a - a^{*} 〉 + ϕ_{2} (a^{*}), a \in H},

see [7,8,9] for more details. For solving (3), the Forward–Backward splitting (FBS) algorithm [10] has been using the following form:

x_{k + 1} = \underset{\underset{backward step}{︸}}{p r o x_{c_{k} ϕ_{2}}} \underset{forward step}{\underset{︸}{(I - c_{k} \nabla ϕ_{1})}} (x_{k}), k \in N,

(5)

where

x_{1} \in H

and

0 < c_{k} < 2 / L .

To accelerate the proximal gradient algorithm, the inertial technique or extrapolation technique was proposed by Nesterov in 1983 [11] for solving a class of convex optimization problems (3), where

F : = ϕ_{1} + ϕ_{2}

is a smooth and convex function. A typical algorithm takes the following form:

\{\begin{matrix} y_{k} = x_{k} + θ_{k} (x_{k} - x_{k - 1}), \\ x_{k + 1} = y_{k} + c \nabla F (y_{k}), k \in N, \end{matrix}

(6)

where

c > 0

is the step size depending on the Lipschitz continuity modulus of

\nabla F

and the inertial parameter

θ_{k} \in (0, 1)

for all

k .

He also showed that by choosing

{θ_{k}}

such that

{sup}_{k} θ_{k} = 1,

this algorithm has a faster convergence rate than the general gradient algorithm; see [11]. In 2009, Beck and Teboulle [12] improved FBS by using the inertial techniques; this algorithm is known as the fast iterative shrinkage-thresholding algorithm (FISTA), which is defined as follows:

\{\begin{matrix} y_{k} = p r o x_{\frac{1}{L} ϕ_{2}} (x_{k} - \frac{1}{L} \nabla ϕ_{1} (x_{k})), \\ t_{k + 1} = \frac{1 + \sqrt{1 + 4 t_{k}^{2}}}{2}, θ_{k} = \frac{t_{k} - 1}{t_{k + 1}}, \\ x_{k + 1} = y_{k} + θ_{k} (y_{k} - y_{k - 1}), k \in N, \end{matrix}

(7)

where

x_{1} = y_{0} \in H, t_{1} = 1

and

θ_{k}

is the inertial parameter. The FISTA has been recognized as a fast method. It is noted that the inertial parameter

{θ_{k}}

in (7) satisfies

{sup}_{k} θ_{k} = 1 .

So, the sequence generated by FISTA has a rate of convergence that is proven to be significantly better, both theoretically and practically. Recently, Liang and Schonlieb [13] modified FISTA, called “FISTA-Mod”, for the short and proved weak convergence theorem of FISTA-Mod. Moreover, they proved

∥ x_{k} - x_{k - 1} ∥ = O (1 / k) .

The FBS and FISTA are only weak convergence in Hilbert spaces. For strong convergence, the viscosity approximation method (VAM) of the fixed point of a nonexpansive operator T was proposed by Moudafi [14], who proved the strong convergence of the methods (8) in real Hilbert spaces.

x_{k + 1} = γ_{k} g (x_{k}) + (1 - γ_{k}) T x_{k}, k \in N,

(8)

where

x_{1} \in H,

γ_{k}

is a sequence in

(0, 1)

and g is a contraction operator. In 2008, Takahashi [15] modified the viscosity approximation method of Moudafi [14] for finding a common fixed point of a countable family of nonexpansive operators

{T_{k}}

. His algorithm takes the following form:

x_{k + 1} = γ_{k} g (x_{k}) + (1 - γ_{k}) T_{k} x_{k}, k \in N,

(9)

where

x_{1} \in H,

{γ_{k}} \subset (0, 1)

, and g is a contraction operator. He proved a strong convergence theorem of (9) under some conditions on

{T_{k}}

and

{γ_{k}} .

In 2012, He and Guo [16] introduced the following modified viscosity approximation method for a countable family of nonexpansive operators:

x_{k + 1} = γ_{k} g (x_{k}) + (1 - γ_{k}) L_{k} x_{k}, k \in N,

(10)

where

{γ_{k}} \subset (0, 1),

L_{k} = \sum_{i = 1}^{k} (\frac{w_{i}}{s_{k}}) T_{i},

s_{k} = \sum_{i = 1}^{k} w_{i}, w_{i} > 0

with

\sum_{i = k}^{\infty} w_{k} = 1 .

They proved strongly the convergence of (10) under the condition on

{γ_{k}}

without any other condition on

{T_{k}} .

However, this algorithm needs larger computational work than that of (9). After that, several algorithms for the common fixed points of a countable family of nonexpansive operators were introduced and discussed; see [16,17,18,19,20].

Inspired by [10,12,15], in this paper, we propose a simple method with the inertial technique for solving a common fixed point problem of a countable family of nonexpansive operators in a real Hilbert space. We then prove a strong convergence of the proposed method under some suitable conditions. Finally, we apply our proposed method to solving the image restoration and classification problems.

The rest of this paper is organized as follows: In Section 2, we present some notation and useful lemmas that will be used in this paper. The strong convergence of the accelerated viscosity fixed point method and the accelerated viscosity forward–backward method are analyzed in Section 3. Applications and simulated results for image restoration and data classification problems are given in Section 4. Finally, we give a conclusion remark for further study in Section 5.

2. Preliminaries

In this section, we present some definitions and useful lemmas for proving our main results in the next section. Throughout this paper, we adopt the following notations:

$H$ denotes a real Hilbert space with norm $∥ \cdot ∥$ and inner product $〈 \cdot, \cdot 〉;$
C denotes a nonempty closed convex subset of $H;$
$F i x (T)$ denotes the set of all fixed points of $T;$
⇀ and → denote the weak convergence and strong convergence, respectively;
$p r o x_{c ϕ_{2}} (I - c \nabla ϕ_{1})$ denotes the forward–backward operator of $ϕ_{1}$ and $ϕ_{2}$ with respect to $c .$

A mapping

T : C \to C

is said to be an L-Lipschitz operator if there exists

L > 0

such that

∥ T a - T b ∥ \leq L ∥ a - b ∥

for all

a, b \in C .

An L-Lipschitz operator is called a nonexpansive operator and contraction operator if

L = 1

and

L \in (0, 1),

respectively. If

T : C \to C

is a nonexpansive operator with

F i x (T) \neq \emptyset,

then

F i x (T)

is closed and convex, and the mapping

I - T

is demiclosed at zero, that is for any sequence

{x_{k}} \subset C

such that

x_{k} ⇀ a

and

∥ x_{k} - T x_{k} ∥ \to 0

imply

a \in F i x (T) .

A mapping

P_{C}

is said to be a metric projection of

H

onto

C,

if for every

a \in H,

there exists a unique nearest point in C denoted by

P_{C} a

such that:

∥ a - P_{C} a ∥ \leq ∥ a - b ∥, \forall b \in C .

Moreover,

P_{C}

is firmly nonexpansive mapping and

P_{C}

satisfying

〈 a - P_{C} a, b - P_{C} a 〉 \leq 0, \forall a \in H, b \in C .

Let

{T_{k}}

and

Λ

be families of nonexpansive operators of C into itself such that

\emptyset \neq F i x (Λ) \subset Γ : = \cap_{k = 1}^{\infty} F i x (T_{k}),

where

F i x (Λ)

is the set of all common fixed points of

Λ .

A sequence

{T_{k}}

is said to satisfy the NST-condition (I) with

Λ

[21] if for every bounded sequence

{x_{k}}

in

C,

lim_{k \to \infty} ∥ x_{k} - T_{k} x_{k} ∥ = 0 implies lim_{k \to \infty} ∥ x_{k} - T x_{k} ∥ = 0 for all T \in Λ .

If

Λ

is singleton, i.e.,

Λ = {T},

then

{T_{k}}

is said to satisfy the NST-condition (I) with

T .

After that, Aoyama, Kohsaka and Takahashi [22] introduced the condition (Z), which is more general than that of NST-condition (I). A sequence

{T_{k}}

is said to satisfy the condition (Z) if whenever

{x_{k}}

is a bounded sequence in C such that

{lim}_{k \to \infty} ∥ x_{k} - T_{k} x_{k} ∥ = 0

, it follows that every weak cluster point of

{x_{k}}

belongs to

Γ .

It is also known that

p r o x_{c ϕ_{2}} (I - c \nabla ϕ_{1})

is a nonexpansive mapping when

0 < c < 2 / L .

The following lemmas are useful for proving our main results.

Lemma 1

([23]). Let

ϕ_{1} : H \to R

be a convex and differentiable function with an L-Lipschitz continuous gradient of

ϕ_{1}

and let

ϕ_{2} : H \to R \cup {\infty}

be a proper lower semi-continuous and convex function. Let

T_{k} : = p r o x_{c_{k} ϕ_{2}} (I - c_{k} \nabla ϕ_{1})

and

T : = p r o x_{c ϕ_{2}} (I - c \nabla ϕ_{1})

, where

c_{k}, c \in (0, 2 / L)

with

c_{k} \to c

as

k \to \infty

. Then,

{T_{k}}

satisfies NST-condition (I) with T.

Lemma 2

([24]). For all

a, b \in H,

and

t \in [0, 1]

the following hold:

(i): ${∥ t a + (1 - t) b ∥}^{2} = {t ∥ a ∥}^{2} + {(1 - t) ∥ b ∥}^{2} - t (1 - t) {∥ a - b ∥}^{2};$
(ii): ${∥ a \pm b ∥}^{2} = {∥ a ∥}^{2} \pm 2 〈 a, b 〉 + {∥ b ∥}^{2};$
(iii): ${∥ a + b ∥}^{2} = {∥ a ∥}^{2} + 2 〈 b, a + b 〉 .$

Lemma 3

([25]). Let

{a_{i}, i = 1, 2, \dots, k} \subset H .

For

b_{i} \in (0, 1), i = 1, 2, \dots, k

such that

\sum_{i = 1}^{k} b_{i} = 1 .

Then, the following identity holds:

{∥\sum_{i = 1}^{k} b_{i} a_{i}∥}^{2} = \sum_{i = 1}^{k} b_{i} ∥ a_{i} ∥^{2} - \sum_{i, j = 1, i \neq j}^{k} b_{i} b_{j} {∥ a_{i} - a_{j} ∥}^{2} .

Lemma 4

([26]). Let

{a_{k}}

be a sequence of non-negative real numbers,

{b_{k}}

be a sequence of real numbers, and

{t_{k}}

be a sequence of real numbers in

(0, 1)

such that

\sum_{n = 1}^{\infty} t_{k} = \infty .

Assume that:

a_{k + 1} \leq (1 - t_{k}) a_{k} + t_{k} b_{k}, k \in N .

If

{lim sup}_{i \to \infty} b_{k_{i}} \leq 0

for every subsequence

{a_{k_{i}}}

of

{a_{k}}

satisfying the condition:

\underset{i \to \infty}{lim inf} (a_{k_{i} + 1} - a_{k_{i}}) \geq 0,

then

{lim}_{k \to \infty} a_{k} = 0 .

3. Main Results

In this section, we propose a new accelerated viscosity fixed point method, which is called “AVFPM” for solving a common fixed point of nonexpansive operators in a real Hilbert space. In order to introduce AVFPM, we assume the following:

$g : H \to H$ is a contraction with constant $η \in (0, 1);$
${T_{k} : H \to H}$ is a family of nonexpansive operators;
${T_{k}}$ satisfies condition (Z);
$Γ : = \cap_{k = 1}^{\infty} F i x (T_{k}) \neq \emptyset .$

Theorem 5.

Let

{x_{k}}

be a sequence generated by Algorithm 1 (AVFPM). Then,

{x_{k}}

converges strongly to an element

a^{*} \in Γ,

where

a^{*} = P_{Γ} g (a^{*}) .

Now, we prove the strong convergence of Algorithm 1 (AVFPM).

Algorithm 1: An accelerated viscosity fixed point method (AVFPM).

Initialization: Take

x_{0}, x_{1} \in H

arbitrarily and positive sequences

{λ_{k}}, {σ_{k}},

{γ_{k}}, {β_{k}}

, and

{α_{k}}

satisfy the following conditions:

\begin{matrix} {α_{k}} \subset (0, 1), & lim_{k \to \infty} α_{k} = 0 and \sum_{k = 1}^{\infty} α_{k} = \infty, \\ {β_{k}} \subset (0, 1), 0 & < a_{5} \leq γ_{k} < 1, α_{k} + β_{k} + γ_{k} = 1, \\ 0 < a_{1} \leq λ_{k} \leq & a_{2} < 1, 0 < a_{3} \leq σ_{k} \leq a_{4} < 1 . \end{matrix}

for some positive real numbers

a_{1}, a_{2}, a_{3}

, and

a_{4} .

Iterative steps: Calculate

x_{k + 1}

as follows:

Step 1. Choose a bounded sequence of non-negative real numbers

{μ_{k}} .

For

k \geq 1,

set

θ_{k} = \{\begin{matrix} min \{μ_{k}, \frac{τ_{k}}{∥ x_{k} - x_{k - 1} ∥}\} & if x_{k} \neq x_{k - 1}, \\ μ_{k} & otherwise, \end{matrix}

where

{τ_{k}}

is a sequence of positive real numbers such that

{lim}_{k \to \infty} τ_{k} / α_{k} = 0 .

Step 2. Compute

\{\begin{matrix} w_{k} = x_{k} + θ_{k} (x_{k} - x_{k - 1}), \\ z_{k} = (1 - λ_{k}) w_{k} + λ_{k} T_{k} w_{k}, \\ y_{k} = (1 - σ_{k}) T_{k} w_{k} + σ_{k} T_{k} z_{k}, \\ x_{k + 1} = α_{k} g (w_{k}) + β_{k} T_{k} z_{k} + γ_{k} T_{k} y_{k} . \end{matrix}

Update

k : = k + 1

and return to Step 1.

Proof.

By the Banach contraction principle, there exists a unique

a^{*} \in Γ

such that

a^{*} = P_{Γ} g (a^{*}) .

By definitions of

x_{k + 1},

we have:

\begin{matrix} ∥ w_{k} - a^{*} ∥ & \leq ∥ x_{k} - a^{*} ∥ + θ_{k} ∥ x_{k} - x_{k - 1} ∥, \end{matrix}

(11)

and:

\begin{matrix} ∥ z_{k} - a^{*} ∥ & \leq (1 - λ_{k}) ∥ w_{k} - a^{*} ∥ + λ_{k} ∥ T_{k} w_{k} - a^{*} ∥ \leq ∥ w_{k} - a^{*} ∥ . \end{matrix}

(12)

From (12), we get:

\begin{matrix} ∥ y_{k} - a^{*} ∥ & \leq (1 - σ_{k}) ∥ T_{k} w_{k} - a^{*} ∥ + σ_{k} ∥ T_{k} z_{k} - a^{*} ∥ \leq ∥ w_{k} - a^{*} ∥ . \end{matrix}

(13)

From (11)–(13), we obtain:

\begin{matrix} ∥ x_{k + 1} - a^{*} ∥ & \leq α_{k} ∥ g (w_{k}) - g (a^{*}) ∥ + α_{k} ∥ g (a^{*}) - a^{*} ∥ \\ + β_{k} ∥ T_{k} z_{k} - a^{*} ∥ + γ_{k} ∥ T_{k} y_{k} - a^{*} ∥ \\ \leq α_{k} η ∥ w_{k} - a^{*} ∥ + α_{k} ∥ g (a^{*}) - a^{*} ∥ \\ + β_{k} ∥ z_{k} - a^{*} ∥ + γ_{k} ∥ y_{k} - a^{*} ∥ \\ \leq (1 - α_{k} (1 - η)) ∥ w_{k} - a^{*} ∥ + α_{k} ∥ g (a^{*}) - a^{*} ∥ \\ \leq (1 - α_{k} (1 - η)) ∥ x_{k} - a^{*} ∥ \\ + α_{k} [\frac{θ_{k}}{α_{k}} ∥ x_{k} - x_{k - 1} ∥ + ∥ g (a^{*}) - a^{*} ∥] . \end{matrix}

By the condition of

θ_{k}

, we have

{lim}_{k \to \infty} \frac{θ_{k}}{α_{k}} ∥ x_{k} - x_{k - 1} ∥ = 0,

and so there exists a constant

M > 0

such that

\frac{θ_{k}}{α_{k}} ∥ x_{k} - x_{k - 1} ∥ \leq M

\forall k \geq 1 .

Thus:

\begin{matrix} ∥ x_{k + 1} - a^{*} ∥ \leq (1 - α_{k} (1 - η)) ∥ x_{k} - a^{*} ∥ + α_{k} (M + ∥ g (a^{*}) - a^{*} ∥) . \end{matrix}

By mathematical induction, we get:

\begin{matrix} ∥ x_{k + 1} - a^{*} ∥ \leq max \{∥ x_{1} - a^{*} ∥, \frac{M + ∥ g (a^{*}) - a^{*} ∥}{1 - η}\} \forall k \geq 1 . \end{matrix}

This implies that

{x_{k}}

is bounded and

{w_{k}}, {z_{k}}, {y_{k}}, {T_{k} w_{k}}, {T_{k} z_{k}},

{T_{k} y_{k}}, a n d {g (w_{k})}

are also bounded. By Lemma 2, we obtain:

\begin{matrix} ∥ w_{k} - a^{*} ∥^{2} & \leq ∥ x_{k} - a^{*} ∥^{2} + θ_{k}^{2} {∥ x_{k} - x_{k - 1} ∥}^{2} \\ + 2 θ_{k} ∥ x_{k} - a^{*} ∥ ∥ x_{k} - x_{k - 1} ∥, \end{matrix}

(14)

and:

\begin{matrix} ∥ z_{k} - a^{*} ∥^{2} & \leq ∥ w_{k} - a^{*} ∥^{2} - λ_{k} (1 - λ_{k}) {∥ w_{k} - T_{k} w_{k} ∥}^{2} . \end{matrix}

(15)

By Lemma 2(i) and (15), we obtain:

\begin{matrix} ∥ y_{k} - a^{*} ∥^{2} & \leq (1 - σ_{k}) ∥ T_{k} w_{k} - a^{*} ∥^{2} + σ_{k} {∥ T_{k} z_{k} - a^{*} ∥}^{2} \\ - σ_{k} (1 - σ_{k}) {∥ T_{k} w_{k} - T_{k} z_{k} ∥}^{2} \\ \leq ∥ w_{k} - a^{*} ∥^{2} - σ_{k} λ_{k} (1 - λ_{k}) {∥ w_{k} - T_{k} w_{k} ∥}^{2} \\ - σ_{k} (1 - σ_{k}) {∥ T_{k} w_{k} - T_{k} z_{k} ∥}^{2} \end{matrix}

(16)

From (12), (14), (16), Lemmas 2(iii) and 3, we have:

\begin{matrix} ∥ x_{k + 1} - a^{*} ∥^{2} & = ∥ α_{k} (g (w_{k}) - g (a^{*})) + β_{k} (T_{k} z_{k} - a^{*}) + γ_{k} (T_{k} y_{k} - a^{*}) ∥^{2} \\ + 2 α_{k} 〈 g (a^{*}) - a^{*}, x_{k + 1} - a^{*} 〉 \\ \leq α_{k} ∥ g (w_{k}) - g (a^{*}) ∥^{2} + β_{k} ∥ T_{k} z_{k} - a^{*} ∥^{2} + γ_{k} {∥ T_{k} y_{k} - a^{*} ∥}^{2} \\ + 2 α_{k} 〈 g (a^{*}) - a^{*}, x_{k + 1} - a^{*} 〉 \\ \leq α_{k} η ∥ w_{k} - a^{*} ∥^{2} + β_{k} ∥ w_{k} - a^{*} ∥^{2} + γ_{k} {∥ w_{k} - a^{*} ∥}^{2} \\ - γ_{k} σ_{k} λ_{k} (1 - λ_{k}) {∥ w_{k} - T_{k} w_{k} ∥}^{2} \\ - γ_{k} σ_{k} (1 - σ_{k}) {∥ T_{k} w_{k} - T_{k} z_{k} ∥}^{2} + 2 α_{k} 〈 g (a^{*}) - a^{*}, x_{k + 1} - a^{*} 〉 \\ \leq (1 - α_{k} (1 - η)) {∥ x_{k} - a^{*} ∥}^{2} \\ + α_{k} [2 ∥ x_{k} - a^{*} ∥ (\frac{θ_{k}}{α_{k}} ∥ x_{k} - x_{k - 1} ∥) \\ + (\frac{θ_{k}}{α_{k}} ∥ x_{k} - x_{k - 1} ∥) θ_{k} ∥ x_{k} - x_{k - 1} ∥ \\ + 2 〈 g (a^{*}) - a^{*}, x_{k + 1} - a^{*} 〉] - γ_{k} σ_{k} λ_{k} (1 - λ_{k}) {∥ w_{k} - T_{k} w_{k} ∥}^{2} \\ - γ_{k} σ_{k} (1 - σ_{k}) {∥ T_{k} w_{k} - T_{k} z_{k} ∥}^{2} . \end{matrix}

(17)

So, we get:

\begin{matrix} γ_{k} σ_{k} λ_{k} (1 - λ_{k}) ∥ w_{k} - T_{k} w_{k} ∥^{2} \leq ∥ x_{k} - a^{*} ∥^{2} - {∥ x_{k + 1} - a^{*} ∥}^{2} + α_{k} M^{^{'}}, \end{matrix}

(18)

and:

\begin{matrix} γ_{k} σ_{k} (1 - σ_{k}) ∥ T_{k} w_{k} - T_{k} z_{k} ∥^{2} \leq ∥ x_{k} - a^{*} ∥^{2} - {∥ x_{k + 1} - a^{*} ∥}^{2} + α_{k} M^{^{'}}, \end{matrix}

(19)

where:

M^{^{'}} = {sup}_{n \geq 1} \{2 ∥ x_{k} - a^{*} ∥ (\frac{θ_{k}}{α_{k}} ∥ x_{k} - x_{k - 1} ∥) + (\frac{θ_{k}}{α_{k}} ∥ x_{k} - x_{k - 1} ∥) θ_{k} ∥ x_{k} - x_{k - 1} ∥ + 2 〈 g (a^{*}) - a^{*}, x_{k + 1} - a^{*} 〉\} .

Now, we show that

{x_{k}}

converges strongly to

a^{*} .

Let

a_{k} = {∥ x_{k} - a^{*} ∥}^{2} .

Suppose that

{a_{k_{i}}}

is a subsequence of

{a_{k}}

such that

{lim inf}_{i \to \infty} (a_{k_{i} + 1} - a_{k_{i}}) \geq 0 .

By (19) and conditions of

{λ_{k}}, {σ_{k}},

{γ_{k}}, {β_{k}}

, and

{α_{k}},

we have:

\begin{matrix} \underset{i \to \infty}{lim sup} γ_{k_{i}} σ_{k_{i}} (1 - σ_{k_{i}}) {∥ T_{k_{i}} w_{k_{i}} - T_{k_{i}} z_{k_{i}} ∥}^{2} & \leq \underset{i \to \infty}{lim sup} (a_{k_{i}} - a_{k_{i} + 1} + α_{k_{i}} M^{^{'}}) \\ \leq \underset{i \to \infty}{lim sup} (a_{k_{i}} - a_{k_{i} + 1}) \\ + \underset{i \to \infty}{lim sup} α_{k_{i}} M^{^{'}} \\ = - \underset{i \to \infty}{lim inf} (a_{k_{i} + 1} - a_{k_{i}}) \\ \leq 0 . \end{matrix}

(20)

This implies that:

lim_{i \to \infty} ∥ T_{k_{i}} w_{k_{i}} - T_{k_{i}} z_{k_{i}} ∥ = 0 .

(21)

Similarly, we have

{lim}_{i \to \infty} ∥ w_{k_{i}} - T_{k_{i}} w_{k_{i}} ∥ = 0 .

By definitions of

θ_{k}

and

x_{k + 1},

we get:

\begin{matrix} lim_{i \to \infty} ∥ w_{k_{i}} - x_{k_{i}} ∥ = 0 . \end{matrix}

(22)

So, we obtain:

\begin{matrix} ∥ x_{k_{i}} - T_{k_{i}} x_{k_{i}} ∥ & \leq ∥ x_{k_{i}} - T_{k_{i}} w_{k_{i}} ∥ + ∥ T_{k_{i}} w_{k_{i}} - T_{k_{i}} x_{k_{i}} ∥ \\ \leq 2 ∥ w_{k_{i}} - x_{k_{i}} ∥ + ∥ w_{k_{i}} - T_{k_{i}} w_{k_{i}} ∥ \to 0 . \end{matrix}

(23)

From definitions of

x_{k + 1},

we have:

\begin{matrix} ∥ z_{k_{i}} - x_{k_{i}} ∥ & \leq ∥ w_{k_{i}} - x_{k_{i}} ∥ + λ_{k_{i}} ∥ w_{k_{i}} - T_{k_{i}} w_{k_{i}} ∥, \\ ∥ y_{k_{i}} - T_{k_{i}} w_{k_{i}} ∥ & = σ_{k_{i}} ∥ T_{k_{i}} w_{k_{i}} - T_{k_{i}} z_{k_{i}} ∥, \end{matrix}

(24)

and:

\begin{matrix} ∥ y_{k_{i}} - x_{k_{i}} ∥ & \leq ∥ y_{k_{i}} - T_{k_{i}} w_{k_{i}} ∥ + ∥ T_{k_{i}} w_{k_{i}} - w_{k_{i}} ∥ + ∥ w_{k_{i}} - x_{k_{i}} ∥ . \end{matrix}

(25)

This implies:

\begin{matrix} lim_{i \to \infty} ∥ z_{k_{i}} - x_{k_{i}} ∥ = lim_{i \to \infty} ∥ y_{k_{i}} - x_{k_{i}} ∥ = 0 . \end{matrix}

(26)

Moreover,

\begin{matrix} ∥ x_{k_{i} + 1} - x_{k_{i}} ∥ & \leq ∥ x_{k_{i} + 1} - T_{k_{i}} x_{k_{i}} ∥ + ∥ T_{k_{i}} x_{k_{i}} - x_{k_{i}} ∥ \\ \leq α_{k_{i}} ∥ g (w_{k_{i}}) - T_{k_{i}} x_{k_{i}} ∥ + β_{k_{i}} ∥ z_{k_{i}} - x_{k_{i}} ∥ \\ + γ_{k_{i}} ∥ y_{k_{i}} - x_{k_{i}} ∥ + ∥ T_{k_{i}} x_{k_{i}} - x_{k_{i}} ∥, \end{matrix}

(27)

which implies

{lim}_{i \to \infty} ∥ x_{k_{i} + 1} - x_{k_{i}} ∥ = 0 .

Now, we claim:

\underset{i \to \infty}{lim sup} 〈 g (a^{*}) - a^{*}, x_{k_{i} + 1} - a^{*} 〉 \leq 0 .

Indeed, choose a subsequence

{x_{k_{i_{j}}}}

of

{x_{k_{i}}}

such that:

\underset{i \to \infty}{lim sup} 〈 g (a^{*}) - a^{*}, x_{k_{i}} - a^{*} 〉 = lim_{j \to \infty} 〈 g (a^{*}) - a^{*}, x_{k_{i_{j}}} - a^{*} 〉 .

Since

{x_{k_{i_{j}}}}

is bounded, there exists a subsequence

{x_{k_{i_{j_{p}}}}}

of

{x_{k_{i_{j}}}}

such that

x_{k_{i_{j_{p}}}} ⇀ u \in H .

Without loss of generality, we may assume that

x_{k_{i_{j}}} ⇀ u \in H .

Since

{T_{k}}

satisfies condition (Z), we have

u \in Γ .

As

{lim}_{i \to \infty} ∥ x_{k_{i} + 1} - x_{k_{i}} ∥ = 0

and

a^{*} = P_{Γ} g (a^{*}),

we obtain:

\underset{i \to \infty}{lim sup} 〈 g (a^{*}) - a^{*}, x_{k_{i} + 1} - a^{*} 〉 = 〈 g (a^{*}) - a^{*}, u - a^{*} 〉 \leq 0 .

(28)

By (17), (28), and

{lim}_{k \to \infty} \frac{θ_{k}}{α_{k}} ∥ x_{k} - x_{k - 1} ∥ = 0,

we can apply Lemma 4 to obtain

{lim}_{k \to \infty} ∥ x_{k} - a^{*} ∥ = 0;

that is,

{x_{k}}

converges strongly to

a^{*} = P_{Γ} g (a^{*}) .

This completes the proof. □

Finally, we will apply the Algorithm 1 (AVFPM) for solving the nonsmooth convex optimization problems (3) of the sum of two objective functions

ϕ_{1}

and

ϕ_{2}

by assuming the following:

$g : H \to H$ is a contraction with constant $η \in (0, 1);$
$ϕ_{1} : H \to R$ is convex differentiable with Lipschitz continuous gradient constant $L > 0;$
$ϕ_{2} : H \to R \cup {\infty}$ is a proper convex and lower semi-continuous function;
$Ω : = A r g m i n (ϕ_{1} + ϕ_{2}) \neq \emptyset .$

By setting

T_{k} = p r o x_{c_{k} ϕ_{2}} (I - c_{k} \nabla ϕ_{1}),

which is the forward–backward operator of

ϕ_{1}

and

ϕ_{2}

with respect to

c_{k} \in (0, 2 / L)

and

c_{k} \to c,

we have an accelerated viscosity forward–backward method for solving the problems (3) as follows:

Next, we prove the strong convergence of Algorithm 2 (AVFBM) by using Theorem 5.

Algorithm 2: An accelerated viscosity forward–backward method (AVFBM).

Initialization: Take

x_{0}, x_{1} \in H

arbitrarily and positive sequences

{λ_{k}}, {σ_{k}}, {γ_{k}}, {β_{k}},

and

{α_{k}}

satisfy the following conditions:

\begin{matrix} {α_{k}} \subset (0, 1), & lim_{k \to \infty} α_{k} = 0 and \sum_{k = 1}^{\infty} α_{k} = \infty, \\ {β_{k}} \subset (0, 1), 0 & < a_{5} \leq γ_{k} < 1, α_{k} + β_{k} + γ_{k} = 1, \\ 0 < a_{1} \leq λ_{k} \leq & a_{2} < 1, 0 < a_{3} \leq σ_{k} \leq a_{4} < 1 . \end{matrix}

for some positive real numbers

a_{1}, a_{2}, a_{3},

and

a_{4} .

Iterative steps: Calculate

x_{k + 1}

as follows:

Step 1. Choose a bounded sequence of non-negative real numbers

{μ_{k}} .

For

k \geq 1,

defined

θ_{k}

by the same as Algorithm 1.

Step 2. Compute

\{\begin{matrix} w_{k} = x_{k} + θ_{k} (x_{k} - x_{k - 1}), \\ z_{k} = (1 - λ_{k}) w_{k} + λ_{k} p r o x_{c_{k} ϕ_{2}} (I - c_{k} \nabla ϕ_{1}) w_{k}, \\ y_{k} = (1 - σ_{k}) p r o x_{c_{k} ϕ_{2}} (I - c_{k} \nabla ϕ_{1}) w_{k} + σ_{k} p r o x_{c_{k} ϕ_{2}} (I - c_{k} \nabla ϕ_{1}) z_{k}, \\ x_{k + 1} = α_{k} g (w_{k}) + β_{k} p r o x_{c_{k} ϕ_{2}} (I - c_{k} \nabla ϕ_{1}) z_{k} \\ + γ_{k} p r o x_{c_{k} ϕ_{2}} (I - c_{k} \nabla ϕ_{1}) y_{k} . \end{matrix}

Update

k : = k + 1

and return to Step 1.

Theorem 6.

Let

{x_{k}}

be a sequence generated by Algorithm 2 (AVFBM). Then,

{x_{k}}

converges strongly to an element

a^{*} \in Ω,

where

a^{*} = P_{Ω} g (a^{*}) .

Proof.

Let

T : = p r o x_{c ϕ_{2}} (I - c \nabla ϕ_{1})

and

T_{k} : = p r o x_{c_{k} ϕ_{2}} (I - c_{k} \nabla ϕ_{1}) .

Then, T and

{T_{k}}

are nonexpansive operators for all

k,

and

F i x (T) = \cap_{k = 1}^{\infty} F i x (T_{k}) = A r g m i n (ϕ_{1} + ϕ_{2}) .

By Lemma 1, we have

{T_{k}},

which satisfies condition (Z). Therefore, we obtain the result directly by Theorem 5. □

4. Application and Simulated Results

4.1. Image Restoration

In this example, we apply Algorithm 2 (AVFBM) to solving an image restoration problem (2) and compare the deblurring efficiency of AVFBM, FBS [10], and FISTA [12]. Our programs are written in MATLAB and run on a laptop with an Intel core i5, 4.00 GB RAM, and windows 8 (64-bit). All algorithms applied to the

l_{1}

-regularization problem (2); that is,

ϕ_{1} (x) = {∥ B x - b ∥}_{2}^{2}

and

ϕ_{2} (x) = τ {∥ x ∥}_{1},

where B is the blurring operator, b is the observed image, and

τ

is the regularization parameter. The maximum iteration number for all methods was fixed at 500.

In these experiments, we consider four gray-scale images (Cameraman, Lenna, Woman, and Boy) with size of

256 \times 256

as the original images and consider Gaussian blur of filter size

9 \times 9

with a standard deviation

σ = 4

with noise

10^{- 4} .

We have measured the performance of AVFBM, FBS, and FISTA by means of the Signal-to-Noise Ratio (SNR) [27] and Peak Signal-to-Noise Ratio (PSNR) [28]. The SNR and PSNR at

x_{k}

of the restored images are defined as:

S N R (x, x_{k}) = 10 {log}_{10} \{\frac{∥ x - \bar{x} ∥_{2}}{∥ x_{k} {- x ∥}_{2}}\},

P S N R (x_{k}) = 10 {log}_{10} (\frac{255^{2}}{M S E}),

where

M S E = \frac{1}{256^{2}} {∥ x_{k} - x ∥}_{2}^{2},

x is the original image, and

\bar{x}

is the mean of the original image. The regularization parameter was chosen to be

τ = 10^{- 4},

and the initial image was the blurred image. The Lipschitz constant L of the gradient

\nabla f

is

L = 2 λ_{m a x} (B^{T} B)

[12]. The parameters of the algorithms are chosen as follows:

λ_{k} = \frac{0.5 k}{k + 1}, σ_{k} = \frac{0.99 k}{k + 1}, α_{k} = \frac{1}{50 k}, β_{k} = \frac{1}{300 k + 1}, γ_{k} = 1 - α_{k} - β_{k},

c_{k} = \frac{k}{L (k + 1)}, c = \frac{1}{L}, τ_{k} = \frac{10^{15}}{k^{2}} and μ_{k} = \frac{t_{k} - 1}{t_{k + 1}},

where

t_{k}

is a sequence defined by

t_{1} = 1

and

t_{k + 1} = \frac{1 + \sqrt{1 + 4 t_{k}^{2}}}{2} .

The contraction mapping is defined by

g (a) = 0.95 a

for all

a \in R^{n} .

The comparison of the performance of AVFBM, FISTA, and FBS by means of SNR and PSNR is shown in Figure 1. The plot of SNR and PSNR at

x_{k}

of the restored images is shown in Figure 2. We see from Figure 1 and Figure 2 that AVFBM gives a higher performance of SNR and PSNR than the other methods. The comparison results for deblurring of the three methods of the four images are shown in Figure 3.

4.2. Data Classification

In this section, a learning algorithm named extreme learning machine (ELM) [29] will be investigated. ELM is a learning algorithm for single-hidden layer feedforward neural networks (SLFNs). Let

D = {(x_{i}, t_{i}) : x_{i} \in R^{n}, t_{i} \in R^{m}, i = 1, 2, \dots, N}

be a training dataset with N distinct training data

x_{i}

and label

t_{i}

. For a given M nodes in the hidden layer, the SLFNs output for the jth pattern,

o_{j} \in R^{m}

, is given by:

o_{j} = \sum_{i = 1}^{M} w_{i} f (〈 h_{i}, x_{j} 〉 + b_{i}), j = 1, 2, \dots, N,

(29)

where f is the activation function,

h_{i} \in R^{n}

and

b_{i} \in R

for

i = 1, 2, \dots, M

are the weight vector and bias connecting the input layer to the ith hidden node, respectively, and

w_{i} \in R^{m}

for

i = 1, 2, \dots, M

is the weight vector connecting the ith hidden layer to the output layer. The target of SLFNs is to approximate the parameters

w_{i}, h_{i}, b_{i}

for all

i = 1, 2, \dots, M

such that:

t_{j} = \sum_{i = 1}^{M} w_{i} f (〈 h_{i}, x_{j} 〉 + b_{i}), j = 1, 2, \dots, N,

(30)

which means that zero error

\sum_{i = 1}^{N} ∥o_{i} - t_{i}∥

is close to 0 while ELM is used to find only parameter

w_{i}

with random

h_{i}

and

b_{i}

. As the above N equations, Equation (30) can be rewritten as:

Hw = T

(31)

where:

H = {[\begin{matrix} f (〈 h_{1}, x_{1} 〉 + b_{1}) & \dots & f (〈 h_{M}, x_{1} 〉 + b_{M}) \\ ⋮ & ⋱ & ⋮ \\ f (〈 h_{1}, x_{N} 〉 + b_{1}) & \dots & f (〈 h_{M}, x_{N} 〉 + b_{M}) \end{matrix}]}_{N \times M},

w = {[\begin{matrix} w_{1}^{T}, \dots, w_{M}^{T} \end{matrix}]}_{m \times M}^{T}

and

T = {[\begin{matrix} t_{1}^{T}, \dots, t_{N}^{T} \end{matrix}]}_{m \times N}^{T} .

From Equation (31), the ELM learning algorithm estimates the weight

w

by

w = H^{†} T

where

H^{†} = {(H^{T} H)}^{- 1} H^{T}

is the pseudo-inverse matrix of

H

. Note that the linear system (31) can be represented by a least squared method. As shown in [29], ELM has an extremely fast training speed and good generalization performance. Nevertheless, its solutions also have some drawbacks [30]. To overcome these drawbacks, regularized extreme learning machine (RegELM) [30] replacing the least square method by the regularization method, i.e., ridge regression, for the training model was proposed, and the mathematical model of the RegELM algorithm can be described as:

min_{w \in R^{M \times m}} \frac{1}{2} {∥Hw - T∥}_{2}^{2} + \frac{λ}{2} {∥w∥}_{2}^{2},

(32)

where

λ > 0

is called the regularization parameter. The RegELM’s output weight can be calculated by

w = {(λ I + H^{T} H)}^{†} H^{T} T

, where

I

is the identity matrix. Although RegELM can be expected to provide better generalization ability than ELM and its running time is extremely fast similarly to ELM, we can define a greater generalization of RegELM as in [31] by replacing Equation (32) in a generalized way as follows:

min_{w \in R^{M \times m}} \frac{1}{2} {∥Hw - T∥}_{2}^{2} + λ [(1 - α) \frac{1}{2} {∥w∥}_{2}^{2} + α {∥w∥}_{1}],

(33)

where

0 \leq α \leq 1

. Equation (33), called elastic net, trades off between the ride regression (

α = 0

) and the LASSO (

α = 1

). In this paper, we present a new algorithm for RegELM and employ our results to data classification problems with benchmark datasets. For this case, we set

α = 1

, and Problem (33) becomes a LASSO problem. From our result (Theorem 6) in Section 3, we can apply AVFBM (Algorithm 2) to solve the LASSO problem and define a learning algorithm for RegELM as follows:

RegELM-AVFBM: Given a training set

D = {(x_{i}, t_{i}) : x_{i} \in R^{n}, t_{i} \in R^{m}, i = 1, 2, \dots, N}

, activation function f,

Step 1: Select regularization parameter $λ$ and hidden node number M.
Step 2: Randomly $h_{i}$ and $b_{i}, i = 1, \dots, M .$
Step 3: Calculate the hidden layer output matrix $H$ .
Step 4: Obtain the output weight $w$ by using AVFBM (Algorithm 2).

Several benchmark problems were chosen for experiments. All datasets were downloaded from https://archive.ics.uci.edu/ (accessed on 6 April 2020). The information of each dataset viz name of datasets, the number of attributes (number of input nodes), the number of classes (number of output nodes), and the number of (sample) data are summarized in Table 1. Each dataset was normalized to zero mean and unit variance; 70% of the data were sampled for training, and the remaining 30% were used for testing. For each method, we tested a different number of hidden nodes M in order to see which architecture provided the best results. The number of nodes in the hidden layer varied from 1 to 200 for the abalone dataset and from 1 to 100 for the other five datasets. For each method, we set the sigmoid function as the activation function f and the regularization parameter

λ = 1 \times 10^{- 5}

for regularized methods (RegELM and RegELM-AVFBM). However, for approximation methods (AVFBM, FISTA), we use relative error criteria,

\frac{∥x_{k} - x_{k - 1}∥}{∥x_{k}∥} < ϵ

, for the stopping algorithm and set all control sequences (

λ_{k}, σ_{k}, α_{k}, β_{k}, γ_{k}, c_{k}, τ_{k}, μ_{k})

as in Section 4.1. For evaluating the performance of each method, an accuracy is defined as the total accuracy rate of classifying each case correctly. Accuracy is a value that represents the power of a model to correctly predict, and it is described as follows.

A c c u r a c y = (T P + T N) / (T P + F P + T N + F N) .

In experimental results, the accuracy of training and testing in percentage and the suitable number of hidden nodes of our method compared with direct methods viz standard ELM [29] and RegELM [30] are described in Table 2. RegELM-AVFBM has a good behavior in terms of accuracy of prediction and fit for testing datasets compared with the two direct methods. However, it is hard to compare the computational time, since approximation methods take time to iterate for convergence to the solution. Thus, to evaluate in the same way, we use two approximation methods (FISTA and AVFBM) for training RegELM and train the model with five different stopping errors

ϵ

under a maximum of 100,000 iterations. Table 3 shows the performance viz accuracy of training and testing (in percentage), computational time (in second), number of computed iterations, and number of suitable nodes in the hidden layer.

5. Conclusions

In this work, by using the inertial technique together with the viscosity approximation method, we propose a new accelerated algorithm for finding a common fixed point of a countable family of nonexpansive operates in a real Hilbert space. The strong convergence of the proposed method is established under some suitable conditions. As a special case, we obtain a new accelerated algorithm, called the accelerated viscosity forward–backward method (AVFBM), for solving nonsmooth convex optimization problems. We also apply our algorithm, AVFBM, to solving image restoration and classification problems. By our experiments, for image restoration problem, they show that our algorithm, AVFBM, has a better performance for SNR and PSNR than that of FBS and FISTA, which are the most popular methods for solving such problems. Moreover, for the classification problems of six datasets—Zoo, Iris, Wine, Parkinsons, Heart Disease UCI, and Abalone (https://archive.ics.uci.edu/, accessed on 6 April 2020)—we use our algorithm, AVFBM, as a learning algorithm for finding the optimal output weight

w

in the mathematical model (32) of the classification problems. We compare the efficiency of our method with ELM, RegELM, and RegELM-FISTA by using the measurement of accuracy of training and testing. We found that our algorithm outperforms the other methods, as seen from Table 2 and Table 3.

Author Contributions

Formal analysis, writing—original draft preparation, A.H.; methodology, writing—review and editing, software, L.B.; Conceptualization, revised the manuscript, S.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Chiang Mai University and NSRF [grant number B05F640183].

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

This research has received funding support from the NSRF via the program Management Unit for Human Resources & Institutional Development, Research and Innovation [grant number B05F640183] and Chiang Mai University. The first author would like to thank Rajamangala University of Technology Isan for partial financial support and L. Bussaban was supported by the Thailand Research Fund through the Royal Golden Jubilee (RGJ) PhD Programme (Grant No. PHD/0184/2560).

Conflicts of Interest

The authors declare that they have no competing interest.

References

Maurya, A.; Tiwari, R. A Novel Method of Image Restoration by using Different Types of Filtering Techniques. Int. J. Eng. Sci. Innov. Technol. 2014, 3, 124–129. [Google Scholar]
Suseela, G.; Basha, S.A.; Babu, K.P. Image Restoration Using Lucy Richardson Algorithm For X-Ray Images. IJISET Int. J. Innov. Sci. Eng. Technol. 2016, 3, 280–285. [Google Scholar]
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. B Methodol. 1996, 58, 267–288. [Google Scholar]
Vogel, C. Computational Methods for Inverse Problems; SIAM: Philadelphia, PA, USA, 2002. [Google Scholar]
Sluder, G.; Wolf, D.E. Digital Microscopy, 3rd ed.; Elsevier: New York, NY, USA, 2007. [Google Scholar]
Suantai, S.; Kankam, K.; Cholamjiak, P. A Novel Forward-Backward Algorithm for Solving Convex Minimization Problem in Hilbert Spaces. Mathematics 2020, 8, 42. [Google Scholar] [CrossRef] [Green Version]
Bauschke, H.H.; Combettes, P.L. Convex Analysis and Monotone Operator Theory in Hilbert Spaces; Springer: New York, NY, USA, 2011. [Google Scholar]
Burachik, R.S.; Iusem, A.N. Set-Valued Mappings and Enlargements of Monotone Operators; Springer Science Business Media: New York, NY, USA, 2007. [Google Scholar]
Moreau, J.J. Proximité et dualité dans un espace hilbertien. B. Soc. Math. Fr. 1965, 93, 273–299. [Google Scholar]
Lions, P.L.; Mercier, B. Splitting algorithms for the sum of two nonlinear operators. SIAM J. Numer. Anal. 1979, 16, 964–979. [Google Scholar]
Nesterov, Y.E. A method for solving the convex programming problem with convergence rate O(1/k²). Sov. Math. Dokl. 1983, 27, 372–376. [Google Scholar]
Beck, A.; Teboulle, M. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2009, 2, 183–202. [Google Scholar]
Liang, J.; Schonlieb, C.B. Improving fista: Faster, smarter and greedier. arXiv 2018, arXiv:1811.01430. [Google Scholar]
Moudafi, A. Viscosity approximation method for fixed-points problems. J. Math. Anal. Appl. 2000, 241, 46–55. [Google Scholar]
Takahashi, W. Viscosity approximation methods for countable families of nonexpansive mappings in Banach spaces. Nonlinear Anal. 2009, 70, 719–734. [Google Scholar]
He, S.; Guo, J. Iterative algorithm for common fixed points of infinite family of nonexpansive mappings in Banach spaces. J. Appl. Math. 2012, 2012, 787419. [Google Scholar]
Shimoji, K.; Takahashi, W. Strong convergence to common fixed points of infinite nonexpansive mappings and applications. Taiwan. J. Math. 2001, 5, 387–404. [Google Scholar]
Aoyama, K.; Kimura, Y.; Takahashi, W.; Toyoda, M. Finding common fixed points of a countable family of nonexpansive mappings in a Banach space. Sci. Math. Jpn. 2007, 66, 325–335. [Google Scholar]
Takahashi, W.; Takeuchi, Y.; Kubota, R. Strong convergence theorems by hybrid methods for families of nonexpansive mappings in Hilbert spaces. J. Math. Anal. Appl. 2008, 341, 276–286. [Google Scholar]
Takahashi, W.; Yao, J.-C. Strong convergence theorems by hybrid methods for countable families of nonlinear operators in Banach spaces. J. Fixed Point Theory Appl. 2012, 11, 333–353. [Google Scholar]
Nakajo, K.; Shimoji, K.; Takahashi, W. Strong convergence to common fixed points of families of nonexpansive mappings in Banach spaces. J. Nonlinear Convex Anal. 2007, 8, 11–34. [Google Scholar]
Aoyama, K.; Kohsaka, F.; Takahashi, W. Strong convergence theorems by shrinking and hybrid projection methods for relatively nonexpansive mappings in Banach spaces. In Proceedings of the 5th International Conference on Nonlinear Analysis and Convex Analysis, Hakodate, Japan, 26–31 August 2009; pp. 7–26. [Google Scholar]
Bussaban, L.; Suantai, S.; Kaewkhao, A. A parallel inertial S-iteration forward-backward algorithm for regression and classification problems. Carpathian J. Math. 2020, 36, 35–44. [Google Scholar]
Takahashi, W. Introduction to Nonlinear and Convex Analysis; Yokohama Publishers: Yokohama, Japan, 2009. [Google Scholar]
Chidume, C.E.; Ezeora, J.N. Krasnoselskii-type algorithm for family of multi-valued strictly pseudo-contractive mappings. Fixed Point Theory Appl. 2014, 2014, 111. [Google Scholar] [CrossRef] [Green Version]
Saejung, S.; Yotkaew, P. Approximation of zeros of inverse strongly monotone operators in Banach spaces. Nonlinear Anal. 2012, 75, 724–750. [Google Scholar]
Chen, D.Q.; Zhang, H.; Cheng, L.Z. A fast fixed point algorithm for total variation deblurring and segmentation. J. Math. Imaging Vis. 2012, 43, 167–179. [Google Scholar]
Thung, K.; Raveendran, P. A survey of image quality measures. In Proceedings of the 2009 International Conference for Technical Postgraduates (TECHPOS), Kuala Lumpur, Malaysia, 14–15 December 2009; pp. 1–4. [Google Scholar]
Huang, G.B.; Zhu, Q.Y.; Siew, C.K. Extreme learning machine. Neurocomputing 2006, 70, 489–501. [Google Scholar]
Deng, W.; Zheng, Q.; Chen, L. Regularized extreme learning machine. In Proceedings of the 2009 IEEE Symposium on Computational Intelligence and Data Mining, Nashville, TN, USA, 30 March–2 April 2009; pp. 389–395. [Google Scholar]
Martínez-Martínez, J.M.; Escandell-Montero, P.; Soria-Olivas, E.; Martín-Guerrero, J.D.; Magdalena-Benedito, R.; Gómez-Sanchis, J. Regularized extreme learning machine for regression problems. Neurocomputing 2011, 74, 3716–3721. [Google Scholar]

Figure 1. Comparison of SNR and PSNR by FBS, FISTA, AVFBM.

Figure 2. Plot of SNR and PSNR for the images.

Figure 3. Original images, blurred images, and Deblurring images by FBS, FISTA, AVFBM.

Table 1. Information of benchmark datasets.

Datasets	# Attributes	# Classes	# Observations
Datasets	# Attributes	# Classes	# Train (≈70%)	# Test (≈30%)
Zoo	16	7	70	31
Iris	4	3	105	45
Wine	13	3	128	50
Parkinsons	23	2	135	60
Heart Disease UCI	14	2	213	90
Abalone	8	3	2924	1253

Note that the cardinal number of set A is denoted by #A. For example, # Attributes is the number of attributes of the data.

Table 2. Comparison of the accuracy of training and testing as well as the number of hidden nodes for ELM, RegELM, and RegELM-AVFBM.

Datasets	ELM			RegELM			RegELM-AVFBM
Datasets	Training (%)	Testing (%)	# Nodes	Training (%)	Testing (%)	# Nodes	Training (%)	Testing (%)	# Nodes
Zoo	97.1429	93.5484	13	97.1429	93.5484	13	100	96.7742	93
Iris	99.0476	100	60	98.0952	100	42	98.0952	100	54
Wine	98.4375	100	36	98.4375	100	36	100	100	40
Parkinsons	94.8148	75	31	94.8148	75	31	96.2963	81.6667	78
Heart Disease UCI	86.385	84.4444	25	86.385	84.4444	25	88.7324	85.5556	33
Abalone	69.0492	67.4381	89	69.0492	67.4381	89	68.4337	67.518	111

Table 3. Comparison of the accuracy of training and testing, computation time, number of iterations, and number of hidden nodes for RegELM-FISTA and RegELM-AVFBM. The sign ∞ in the column # Iters means that the model was computed over the maximum iterations (100,000 iterations for this case).

Datasets	$ϵ$	RegELM-FISTA					RegELM-AVFBM
Datasets	$ϵ$	Training (%)	Testing (%)	Time(s)	# Iters	# Nodes	Training (%)	Testing (%)	Time(s)	# Iters	# Nodes
Zoo	0.1	90	90.3226	0.0005994	11	33	98.5714	93.5484	0.0008407	9	82
	0.01	98.5714	93.5484	0.0021261	40	72	100	96.7742	0.0079208	81	93
	0.001	100	93.54839	0.0085405	455	42	98.57143	93.54839	0.005203	157	13
	0.0001	97.142857	93.548387	0.0152656	1330	13	97.14286	93.54839	0.0142225	384	13
	0.00001	97.142857	93.548387	0.0353528	2609	13	97.14286	93.54839	0.0193836	694	13
	0.000001	97.142857	93.548387	0.0561337	4193	13	97.142857	93.548387	0.0317928	1354	13
Iris	0.1	79.0476	91.1111	0.0006506	11	39	80	91.1111	0.0003463	7	20
	0.01	80	91.1111	0.0008544	27	9	96.19048	100	0.0033075	110	56
	0.001	92.38095	97.77778	0.0141314	658	22	98.09524	100	0.0217391	804	54
	0.0001	96.190476	100	0.051715	4232	38	98.095238	100	0.1359273	4891	53
	0.00001	98.0952381	100	0.6242778	41,584	56	98.0952381	100	0.9981745	47,695	42
	0.000001	-	-	-	∞	-	-	-	-	∞	-
Wine	0.1	97.6563	96	0.0005292	11	31	99.2188	98	0.0008743	8	65
	0.01	99.2188	98	0.0015979	30	64	100	100	0.0047111	98	40
	0.001	99.21875	100	0.0106758	364	45	98.4375	100	0.006298	271	36
	0.0001	98.4375	100	0.0536025	4374	36	98.4375	100	0.0234622	1146	36
	0.00001	98.4375	100	0.2150904	18,406	36	98.4375	100	0.0710794	3135	36
	0.000001	98.4375	100	0.4733094	39,108	36	98.4375	100	0.1426151	7342	36
Parkinsons	0.1	80.7407	75	0.0005362	11	5	80.7407	73.3333	0.0007843	4	5
	0.01	80.7407	75	0.000316	11	5	96.2963	81.66667	0.0034927	111	78
	0.001	96.2963	78.33333	0.0143303	649	83	95.55556	76.66667	0.0092722	252	31
	0.0001	98.518519	85	0.1072512	4702	95	100	76.666667	0.0401135	1533	60
	0.00001	99.2592593	78.3333333	0.3693551	31,488	60	95.555556	75	0.0395779	2266	31
	0.000001	95.5555556	75	0.3067227	32,185	31	94.814815	75	0.0887627	5421	31
Heart Disease UCI	0.1	82.6291	84.4444	0.000561	11	52	86.8545	85.5556	0.0006593	8	72
	0.01	84.9765	84.4444	0.0008177	31	57	85.9155	84.4444	0.0027626	73	25
	0.001	87.79343	86.66667	0.0115745	600	61	88.73239	85.55556	0.0051466	240	33
	0.0001	90.140845	85.555556	0.1300254	5231	58	86.38498	84.44444	0.0131317	644	25
	0.00001	86.384977	84.444444	0.0629385	6507	25	86.384977	84.444444	0.042063	1707	25
	0.000001	86.3849765	84.4444444	0.1245222	12,505	25	86.384977	84.444444	0.054982	3366	25
Abalone	0.1	57.2845	56.3448	0.0008332	11	9	57.0109	56.664	0.0007203	7	16
	0.01	59.13133	57.86113	0.0116477	47	147	66.72367	66.0016	0.0199067	111	96
	0.001	64.74008	64.08619	0.07772	445	111	68.63885	67.11891	0.2978755	817	175
	0.0001	66.792066	66.400638	0.8201147	5560	96	68.433653	67.517957	0.9515634	4480	111
	0.00001	68.5362517	67.1987231	11.9269803	51,900	149	68.7414501	67.6775738	3.4826104	21,877	89
	0.000001	-	-	-	∞	-	68.9466484	67.6775738	13.0566531	81,392	89

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hanjing, A.; Bussaban, L.; Suantai, S. The Modified Viscosity Approximation Method with Inertial Technique and Forward–Backward Algorithm for Convex Optimization Model. Mathematics 2022, 10, 1036. https://0-doi-org.brum.beds.ac.uk/10.3390/math10071036

AMA Style

Hanjing A, Bussaban L, Suantai S. The Modified Viscosity Approximation Method with Inertial Technique and Forward–Backward Algorithm for Convex Optimization Model. Mathematics. 2022; 10(7):1036. https://0-doi-org.brum.beds.ac.uk/10.3390/math10071036

Chicago/Turabian Style

Hanjing, Adisak, Limpapat Bussaban, and Suthep Suantai. 2022. "The Modified Viscosity Approximation Method with Inertial Technique and Forward–Backward Algorithm for Convex Optimization Model" Mathematics 10, no. 7: 1036. https://0-doi-org.brum.beds.ac.uk/10.3390/math10071036

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Modified Viscosity Approximation Method with Inertial Technique and Forward–Backward Algorithm for Convex Optimization Model

Abstract

1. Introduction

2. Preliminaries

3. Main Results

4. Application and Simulated Results

4.1. Image Restoration

4.2. Data Classification

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI