Hybrid ELM and MARS-Based Prediction Model for Bearing Capacity of Shallow Foundation

Kumar, Manish; Kumar, Vinay; Biswas, Rahul; Samui, Pijush; Kaloop, Mosbeh R.; Alzara, Majed; Yosri, Ahmed M.

doi:10.3390/pr10051013

Open AccessArticle

Hybrid ELM and MARS-Based Prediction Model for Bearing Capacity of Shallow Foundation

¹

Department of Civil Engineering, SRM Institute of Science and Technology (SRMIST), Tiruchirappalli 621105, India

²

Department of Civil Engineering, NIT, Patna 800005, India

³

Department of Civil Engineering, NIT, Ravangla 737139, India

⁴

Department of Public Works Engineering, Mansoura University, Mansoura 35516, Egypt

⁵

Department of Civil and Environmental Engineering, Incheon National University, Incheon 22012, Korea

⁶

Department of Civil Engineering, College of Engineering, Jouf University, Sakakah 72388, Saudi Arabia

⁷

Civil Engineering Department, Faculty of Engineering, Delta University for Science and Technology, Belkas 11152, Egypt

^*

Author to whom correspondence should be addressed.

Processes 2022, 10(5), 1013; https://0-doi-org.brum.beds.ac.uk/10.3390/pr10051013

Submission received: 15 April 2022 / Revised: 11 May 2022 / Accepted: 16 May 2022 / Published: 19 May 2022

Download

Browse Figures

Versions Notes

Abstract

:

The nature of soil varies horizontally as well as vertically, owing to the process of the formation of soil. Thus, ensuring the safe design of geotechnical structures has been a major challenge. In shallow foundations, conducting field tests is expensive and time-consuming and often conducted on significantly scaled-down models. Empirical models, too, have been found to be the least reliable in the literature. The study proposes AI-based techniques to predict the bearing capacity of a shallow foundation, simulated using the datasets obtained in experiments conducted in different laboratories in the literature. The results of the ELM-EO and ELM-PSO hybrid models are compared with that of the ELM and MARS models. The performance of the models is analyzed and compared with each other using various performance parameters. The models are graded to each other using rank analysis and the visual interpretations are provided using error matrices and REC curves. ELM-EO is concluded to be the best performing model (R² and RMSE equal to 0.995 and 0.01, respectively, in the testing phase), closely followed by ELM-PSO, MARS, and ELM. The performance of MARS is better than ELM (R² equals 0.97 and 0.5, respectively, in the testing phase); however, hybridization greatly enhances the performance of the ELM and the hybrid models perform better than MARS. The paper concludes that AI-based models are robust and hybridization of regression models with optimization techniques should be encouraged in further research. Sensitivity analysis suggests that all the input parameters have a significant influence on the output, with friction angle being the highest.

Keywords:

shallow foundation; AI; ELM; MARS; PSO; EO

1. Introduction

Foundations are the most important part of a structure, as they transfer the load of the superstructure to bearing strata. A shallow foundation is generally defined as a foundation having a depth less than or equal to breadth. Shallow foundations have a large base and small thickness, apart from the shallow embedment. It is a system in which the resistance is developed only from its base, and the failure is within the shallow depth extending to the surface. The bearing capacity of soils, also known as the load-carrying capacity of the soil, is one of the most significant topics in soil mechanics and foundation engineering. The greatest value of the load applied for which no point of the subsoil reaches the failure point is the bearing capacity of a shallow foundation [1]. The frictional resistance has a negligible contribution to the bearing capacity in shallow foundations. Constructions of shallow foundations require less cost, less time, and the least geo-surface disturbance. The ultimate bearing capacity of a foundation is the load per unit area of the foundation at which shear failure occurs. The evaluation of the ultimate bearing capacity can be performed either through field tests such as pile load tests or empirical relations developed by various researchers in the literature. Experimental studies are typically conducted on smaller scale models that are highly scaled-down versions of real footings. For evaluating the final bearing capacity, the size of the foundation is a significant element for both square and rectangular footings. As a result, laboratory-built micro footing models differ from real-world footings in terms of behavior and stress distribution. This is known as the scale effect, and it has been investigated for many years by different researchers [2,3]. Although testing the actual size footing is crucial to understanding genuine soil–foundation behavior, it is an expensive, time-consuming, and analytically challenging process. The plate bearing test, standard penetration test, and pressuremeter test are all field tests that can be used to measure a soil’s bearing capacity. These field tests, on the other hand, are time-consuming, costly, and difficult to manage and operate. The expense of testing the structures is so high that it exceeds the structure’s cost. As a result, contractors frequently supply estimated footing sizes based on arbitrary assumptions of soil bearing capacity extrapolated from previous site experiences, which is cost-effective but erroneous. Various researchers in the literature proposed empirical relations to predict the bearing capacity of a shallow foundation [4,5,6,7]; however, all of these conventional formulations have some limitations and assumptions. As a result, they do not always produce realistic results when compared to experimental data [8,9,10,11]. As per the study of Rybak and Krol [12], the limit state is rarely achieved and it is often impossible to estimate the ultimate load.

Soil is naturally heterogeneous and thus uncertainties and variability are involved in its index and material properties. Uncertainty refers to the assessor’s lack of awareness (degree of ignorance) of the elements that define the physical system being modeled. The numerous values a property has at different positions, times, or instances are referred to as variability. Due to the heterogeneous nature of the soil, various sources of uncertainties, and various degrees of variability involved, there is a tendency to search for reliable machine learning (ML)-based soft computing models to predict the bearing capacity of shallow foundations. The goal of artificial intelligence is to create machine parts that study human thought patterns and reflect them in reality. Artificial intelligence has found a wide range of applications in civil engineering in recent years [13,14,15,16,17]. Regression analysis is a statistical method of curve fitting that analyzes the relationship between the dependent variable(s) and the predictor variable. Artificial Neural Network (ANN) has been successfully applied in regression analysis of shallow foundation [18,19,20,21]; however, the success of ANN is subject to various shortcomings, including ‘black-box approach’, ‘overfitting’, low generalization capability, and local minimum. Adem et al. and Padmini et al. [8,22] applied ANN, FIS, and ANFIS to predict the bearing capacity of shallow foundations. ANN performs superior to FIS, but it is proved to be less robust than ANFIS. Baginska and Srokosz [21] applied a deep neural network (DNN) to improve the performance of the neural networks and the best results are achieved for the optimal number of layers 5 to 7. Gaussian process regression (GPR) was tested as a simulation model for the bearing capacity of a shallow foundation and the developed model was concluded to be robust [23]. To improve the sluggish learning speed of traditional feedforward neural networks, Huang et al. [24] proposed an extreme learning machine (ELM), which significantly reduces the training time and improves the generalization performance of the single layer feedforward neural network (SLFN). Though the application of ELM in shallow foundation analysis has been very limited, the model has been found to be robust in various disciplines of civil engineering [25,26,27,28,29]. Khaleel et al. [30] concluded that hybrid ELM models fare better than hybrid multiple linear regression (MLR) models. Traditional ML algorithms, despite yielding better results than statistical techniques, are more vulnerable to being trapped in local minima than catching the actual global minima. This has unfavorable consequences. As a result, researchers are applying optimization methods with the objective of improving the classical ML parameters and presenting major outcomes to address this challenge [31,32,33]. Mesut Gor applied hybrid ANN models for the prediction of the bearing capacity of shallow foundations and observed significant improvement in the prediction accuracy post-optimization [34]. Moaeyedi et al. developed a multi-layer perceptron (MLP) combined with an imperialist competitive algorithm (ICA) and obtained an improved R² from 0.83 (for ANN) to 0.983 [35]. ELM-PSO has not been applied to the pile foundation problems, although its application in other domains has been very encouraging [36,37,38]. Several drawbacks of PSO include high computational complexity and premature convergence [39,40,41]. EO is a fast and powerful metaheuristic optimization technique with high population-based performance [42]. ELM-EO have not been applied in many domains so far; however, it has been proven to be robust in limited applications [43]. The present paper makes a comparative study of the performance of ELM-PSO and ELM-EO with traditional ELM and multivariate adaptive regression splines (MARS) models. MARS has been proven as a significant performing model for the analysis of civil engineering problems [44,45,46,47,48,49] and also in shallow foundations [47,50].

2. Details of AI-Based Models Used

2.1. MARS

Friedman developed the multivariate adaptive regression spline (MARS) [44,51,52]. MARS is based on the methodology of non-parametric and nonlinear regression techniques. Multivariate adaptive regression analysis helps a large number of independent parameters read a continuous output variable. It is the integration of recursive regression, additive regression, recursive partitioning regression, and spline regression. Multivariate adaptive regression splines select the factors by algorithms using “forward” and “backward” algorithms. The prediction accuracy of MARS with respect to other methods is relatively high and it is also highly adaptive. In general, the equation of non-parametric and nonlinear regression is as follows:

y_{i} = f (x_{i 1}, x_{i 2}, \dots, x_{i k}) + ε_{i}

(1)

where

f (x_{i 1}, x_{i 2}, \dots, x_{i k}) = regression function and should be a smooth, continuous function .

ε_{i} = estimate of error involved .

The developed model using MARS for predicting the output of given input in the form of

y

are as follows:

y = C_{0} + \sum_{m = 1}^{M} C_{m} B_{m} (x)

(2)

where

C_{0}

is a constant,

B_{m} (x)

is basis function,

x

is the input variable, and

C_{m}

is the coefficient of

B_{m} (x)

. The spline function consists of two parts, i.e., the left-sided truncated function in Equation (3a) and the right-sided truncated function in Equation (3b), which is as follows:

b_{q}^{-} (x - t) = {[- (x - t)]}_{+}^{q} = \{\begin{matrix} {(t - x)}^{q} \\ 0 \end{matrix} \begin{matrix} if x < t \\ o t h e r w i s e \end{matrix}

(3a)

b_{q}^{+} (x - t) = {[+ (x - t)]}_{+}^{q} = \{\begin{matrix} {(x - t)}^{q} \\ 0 \end{matrix} \begin{matrix} if x > t \\ o t h e r w i s e \end{matrix}

(3b)

where

b_{q}^{+} (x - t)

and

b_{q}^{-} (x - t)

are the spline functions and

t

is the knot location. In general, any model based on multivariate adaptive regressions spline (MARS) follows three basic steps such as:

(a): Constructive phase.
(b): Pruning phase.
(c): Selection of optimum MARS.

At the beginning of the constructive phase, the basis function plays an important role in the formation of Equation (2) and the selection of the basis function depends on the generalized cross-validation (GCV). The GCV is adopted as the residual sum of squares of input parameters. The GCV parameter is used as a penalty for model complexity to prevent using a large number of spline functions. The GCV value is calculated from the following Equation (4), which is as follows:

G C V (M) = (\frac{1}{n}) \frac{\sum_{m = 1}^{M} {(y_{i} - {\hat{y}}_{i})}^{2}}{{[1 - \frac{C (M)}{n}]}^{2}}

(4)

The value of

C (M)

is calculated from the following equation:

C (M) = M + d M

(5)

where

y_{i} = response value for object i .

{\hat{y}}_{i} = predicted response value for object i .

C (M) = penalty factor .

n = total number of data objects .

There is a cost penalty factor for maximizing each basis function in Equation (5). Overfitting of data is possible for many basis functions. To circumvent this problem, these basis functions are deleted during the pruning stage. After completing all needed processes, the best MARS model is selected.

2.2. ELM

Huang developed the extreme learning machine in 2004 and published it in 2006 [24,53]. The sigmoid activation principal is used in ELM and consists of three layers of the neural network. The advantages of the extreme learning machine compared to the conventional gradient-based learning methods are as follows:

It avoids a number of issues that are difficult to deal with in traditional methods, such as halting criteria, learning rate, learning epochs, and local minimums.
In most circumstances, it can provide better generalized performance than backpropagation (BP) since ELM is a one-pass learning technique that does not require re-iteration.
It may be used to activate practically any nonlinear function.

The mathematical model of the ELM is described as follows:

Let us consider

N

samples of data

(x_{i}, t_{i})

where

x_{i} = [x_{i 1}, \dots \dots, x_{i m}] \in R^{m}

t_{i} = [t_{i 1}, \dots \dots . ., t_{i q}] \in R^{q}

. The extreme learning machine (ELM) algorithm consists of a single hidden layer feedforward neural network with N hidden nodes and the activation function g(x) is shown as:

\sum_{i = 1}^{\bar{N}} β_{i} g_{i} (x_{j}) = \sum_{i = 1}^{\bar{N}} β_{i} g_{i} (w_{i} . x_{j} + b_{i}) = o_{j}, j = 1, \dots ., N .

(6)

where

w_{i} = {[w_{i 1}, w_{i 2}, \dots, w_{i m}]}^{T}

is the weight vector of the connectors from the input node to the

i^{t h}

hidden node, and

β_{i} = {[β_{i 1}, β_{i 2}, \dots, β_{i q}]}^{T}

is the weight vector of the connectors between the

i^{t h}

hidden node and the output nodes. The variable

b_{i}

is the threshold of the

i^{t h}

hidden node.

2.3. PSO

Kennedy and Eberhart [36,37,38] were the first to suggest particle swarm optimization. A set of iterative techniques capable of leading the search operation to a quality result is referred to as a meta-heuristic. PSO models population social and cooperative behavior (e.g., fish schooling and flocking of birds) during food search. In PSO, each person in the population is referred to as a particle, and the entire population is referred to as a swarm. The swarm is defined as a set:

S = {P_{1}, P_{2}, P_{3}, \dots \dots \dots, P_{N}}

(7)

of

(N)

particles (candidate solutions), defined as:

P_{i} = (p_{i 1}, p_{i 2}, p_{i 3}, \dots \dots \dots, p_{i m})^{T} \in A, i = 1, 2, 3, 4, \dots \dots ., N

(8)

where

P_{i}

represents an individual particle in defined swarm

(S)

inside the search space A. Each

P_{i}

contains the required number of dimension/control variables designated by

{(p_{i 1}, p_{i 2}, \dots \dots ., p_{i m})}^{T}

P_{i} = (x_{i l}, x_{i r}, α_{i l}, α_{i r})^{T} \in A; i = 1, 2, 3, 4, \dots \dots . ., N

(9)

The position (

X_{i}^{k}

) and velocity (

V_{i}^{k}

) are constantly updated as the particles move inside the search space A. Here,

k

represents the number of iteration steps in the PSO algorithm; therefore,

X_{i}^{k} = (x_{i 1}^{k}, x_{i 2}^{k}, x_{i 3}^{k}, x_{i 4}^{k}) \in A, i = 1, 2, 3, 4, \dots \dots . ., N

(10a)

V_{i}^{k} = {(v_{i 1}^{k}, v_{i 2}^{k}, v_{i 3}^{k}, v_{i 4}^{k})}^{T} i = 1, 2, 3, 4, \dots \dots . ., N

(10b)

In the early form of PSO, each particle employs Equation (11) to update its velocity.

V_{i}^{k + 1} = V_{i}^{k} + c_{1} \times r a n d_{1} \times (X_{p b e s t}^{k} - X_{i}^{k}) + c_{2} \times r a n d_{2} \times (X_{s b e s t}^{k} - X_{i}^{k})

(11)

The problem of premature convergence is observed to plague the early PSO variant [54,55]. To alleviate this problem, another parameter (

ω

), the inertia weight coefficient is introduced to the original equation resulting in the newly formed velocity Equation (12) of PSO

V_{i}^{k + 1} = ω^{k} \times V_{i}^{k} + c_{1} \times r a n d_{1} \times (X_{p b e s t}^{k} - X_{i}^{k}) + c_{2} \times r a n d_{2} \times (X_{s b e s t}^{k} - X_{i}^{k})

(12)

The inertia weight (

ω

) can be assumed to follow a linearly decreasing pattern between maximum (

ω_{\max}

) and minimum (

ω_{\min}

) values.

ω^{k} = ω_{\max} - (ω_{\max} - ω_{\min}) \frac{k}{k_{m a x}}

(13)

Later, a contemporary standard PSO (CS-PSO) version was developed by Clerc and Kennedy (2002), in which the velocity of the particles is updated as follows:

V_{i}^{k + 1} = η \times V_{i}^{k} + c_{1} \times r a n d_{1} \times (X_{p b e s t}^{k} - X_{i}^{k}) + c_{2} \times r a n d_{2} \times (X_{s b e s t}^{k} - X_{i}^{k})

(14)

For all the above cases mentioned, the particle’s positions are updated as follows:

X_{i}^{k + 1} = X_{i}^{k} + V_{i}^{k + 1}

(15)

The constriction coefficient

(η)

is determined as suggested by Clerc and Kennedy (2002):

η = \frac{2}{|2 - ϕ - \sqrt{ϕ^{2} - 4 ϕ}|}

(16)

where

ϕ = c_{1} + c_{2}

.

The cognitive

(c_{1})

and social

(c_{2})

parameters in Equation (10) in the CS-PSO variant is equal to 2.05. In this present work, contemporary standard PSO with velocity clamping is used. This is required so that the modified velocity does not make the particle being moved away from the domain of interest.

2.4. EO

The equilibrium optimizer (EO) is a newly developed meta-heuristics optimization technique by Faramarzi et al. [42]. The equilibrium optimizer algorithm was developed by modeling the theory of dynamic mass balance mathematically. The EO functions similarly to other meta-heuristics in that it searches through a list of potential solutions. Concentration vectors are the names given to the possible solutions in the EO. The initialization of these vectors is performed as follows:

C_{i} = C^{\min} + r a n d \times (C^{\max} - C^{\min}) i = 1, 2, \dots, N

(17)

where

C_{i}

signifies the i^th concentration vector, the value of the uniformly distributed random number, which is denoted by rand lies 0 and 1;

C^{\min}

are lower bound for the concentration vectors

C_{i}

and

C^{\max}

are upper bound vectors. After each concentration vector has been initialized, it is iterated and undergoes updating of parameters to offer an optimal solution. Equation (18) gives the formula for updating the concentration vectors:

C_{i}^{t + 1} = C_{E Q}^{t} + (C_{i}^{t} - C_{E Q}^{t}) \times F_{i}^{t} + (1 - F_{i}^{t}) \times \frac{G_{i}^{t}}{λ_{i}^{t} V_{i}^{t}}

(18)

where t and t + 1 denote the concentration vector iterations (

C_{i}^{t}

and

C_{i}^{t + 1}

respectively). The first and second term represents the equilibrium concentrations and the global search in the exploration phase, respectively. The third and final term aids the exploitation mechanism by extracting important information from the search space’s examined search areas.

C_{E Q}^{t}

It is a concentration vector drawn at random from the equilibrium pool.

C_{E Q}^{t} = {C_{E Q 1}^{t}, C_{E Q 2}^{t}, C_{E Q 3}^{t}, C_{E Q 4}^{t}}

(19)

In terms of fitness,

C_{E Q 1}^{t}

,

C_{E Q 2}^{t}

,

C_{E Q 3}^{t}

, and

C_{E Q 4}^{t}

vectors represent the first four best concentration vectors. When the optimization search process begins, there is no pre-information of the algorithm’s equilibrium state or state of final convergence. As a result, these vectors are thought of as approximations of equilibrium states.

F_{i}^{t}

is used to balance the E&E, and is given mathematically by:

F_{i}^{t} = \exp ((T - T_{0}) λ_{i}^{t})

(20)

T = {(1 - \frac{t}{t_{\max}})}^{a_{2} \times \frac{t}{t_{\max}}}

(21)

T_{0} = T + \frac{1}{λ_{i}^{t}} \times \ln (- a_{1} s i g n (r - 0.5) (1 - e^{λ T}))

(22)

where

t

and

t_{\max}

indicates the current iteration and the maximum number of iterations, respectively. The strength of the exploration search is set by the value of

a_{1}

. A higher value of

a_{1}

stimulates more exploration and vice-versa.

s i g n (r - 0.5)

is the factor which controls the direction of E&E. Here

r

is a random number distributed uniformly between 0 to1. The

a_{1}

and

a_{2}

are assigned fixed values 2 and 1, respectively, for the optimal model. The final expression of

F_{i}^{t}

can be obtained now:

F_{i}^{t} = a_{1} \times s i g n (r - 0.5) (e^{- λ_{i}^{t} T} - 1)

(23)

G_{i}^{t}

iteration rate aids in the exploration of the search space and is calculated using Equation (24).

G_{i}^{t} = G_{0}^{t} \times \exp ((T - T_{0}) \times λ_{i}^{t})

(24)

G_{0}^{t} = p_{i}^{t} \times (C_{E Q}^{t} - λ_{i}^{t} C_{i}^{t})

(25)

P_{i}^{t} = \{\begin{cases} 0.5 r_{1} r_{2} \geq g_{p}^{t} \\ 0 o t h e r w i s e \end{cases}

(26)

where

r_{1}

and

r_{2}

are the two random numbers in the range (0,1).

P_{i}^{t}

symbolizes the control parameters’ iteration rate, which includes the possibility of iteration term contribution during the search procedure, as well as the probability of this contribution, which specifies how many particles use generation term to update their states, as determined by iteration probability (

g_{p}^{t}

).

2.5. Regression Optimization

Regression optimization is a method of integrating regression techniques and optimization approaches in a single framework for optimizing process control setpoints. To obtain accurate results, regression models have various tunable parameters that are unknown values, such as weight, bias, number of neurons in the intermediate hidden layer, and linear and nonlinear parameters, all of which must be optimized using a robust and reliable optimization procedure. The parameters to be optimized in ELM models are input weights, output weights, and hidden biases. The output weight of ELM is set by random initialization and the pseudo-inverse matrix; however, optimization techniques such as PSO and EO can improve its performance further. It is worth mentioning that initialization weights and biases may have non-optimal values, resulting in poor performance. ELM requires a large number of hidden layer nodes to achieve an expected result, which might lead to overfitting. The ELM parameters are optimized using PSO and EO in this study. In PSO-ELM, the fitness function is the root mean square error (RMSE), and the probable solution is weight and biases in the hidden layer. Before ML learning parameter optimization, the EO algorithm is set up. The population size, maximum iteration count, lower and higher boundaries, and ELM hidden layer neuron count are all variables to consider.

3. Details of Dataset

This study’s dataset was compiled from experiments conducted in several laboratories in the literature [21]. These data have five input parameters and one output parameter. The input parameters are the width of the foundation (B), depth of the foundation (D), length to width ratio (L/B), density (γ), and angle of friction (φ). The output parameter is the bearing capacity of the foundation (Q_u). The descriptive statistics of the data are given in Table 1. The histogram with Pearson correlation matrix is shown in Figure 1. As can be seen, the sample variances are scattered in the range of 0, 0.25 to 8860.48, which indicates that the present dataset has a wide range of input and output parameters. In addition, the values of standard error (scattered in the range of 0.01 to 13.18) confirm that the present database consists of a wide range of variables and is hence useful for soft computing modeling.

4. Research Methodology

The first step in the research methodology is data normalization, i.e., normalizing the data in the range 0 to 1 using the ‘min-max method’ to bring all the predictor variables in the same range and to reduce the errors. The goal of data normalization is to achieve stable convergence of weight and biases in ML models. In the next step, the normalized data are divided into the training subset (70% of the data) to train the model and the testing subset (30% of the data) to validate the trained model [56]. The model learns from the correlation between input and output variables. The performance of the model was checked using statistical performance parameters. Based on the cost function, several iterations were carried out and the best performing model was selected using rank analysis. The following 14 performance parameters were used in the present paper to evaluate the performance of the simulation models study, namely, weighted mean absolute percentage error (WMAPE), root mean square error (RMSE), variance account factor (VAF), coefficient of determination (R²), adjusted determination coefficient (Adj. R²), Nash–Sutcliffe efficiency (NS), performance index (PI), root mean square error to observation’s standard deviation ratio (RSR), normalized mean bias error (NMBE), bias, Willmott’s index of agreement (WI), mean absolute error (MAE), mean bias error (MBE), and Legate and McCabe’s Index (LMI) [57,58,59,60,61,62].

W M A P E = \frac{\sum_{i = 1}^{N} |\frac{d_{i} - y_{i}}{d_{i}}| \times d_{i}}{\sum_{i = 1}^{N} d_{i}}

(27)

W M A P E = \frac{\sum_{i = 1}^{N} |\frac{d_{i} - y_{i}}{d_{i}}| \times d_{i}}{\sum_{i = 1}^{N} d_{i}}

(28)

V A F = (1 - \frac{v a r (d_{i} - y_{i})}{v a r (d_{i})}) \times 100

(29)

R^{2} = \frac{\sum_{i = 1}^{N} {(d_{i} - d_{m e a n})}^{2} - \sum_{i = 1}^{N} {(d_{i} - y_{i})}^{2}}{\sum_{i = 1}^{N} {(d_{i} - d_{m e a n})}^{2}}

(30)

A d j R^{2} = 1 - \frac{(n - 1)}{(n - p - 1)} (1 - R^{2})

(31)

P I = a d j . R^{2} + (0.01 \times V A F) - R M S E

(32)

N S = 1 - \frac{\sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{N} {(y_{i} - y_{m e a n})}^{2}}

(33)

R S R = \frac{R M S E}{\sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(d_{i} - d_{m e a n})}^{2}}}

(34)

Bias Factor = \frac{1}{N} \sum_{i = 1}^{n} \frac{y_{i}}{d_{i}}

(35)

N M B E (%) = \frac{\frac{1}{N} \sum_{i = 1}^{n} (y_{i} - d_{i})}{\frac{1}{N} \sum_{i = 1}^{n} d_{i}}

(36)

W I = 1 -  [\frac{\sum_{i = 1}^{N} {(d_{i} - y_{i})}^{2}}{\sum_{i = 1}^{N} {\{|y_{i} - d_{m e a n}| + |d_{i} - d_{m e a n}|\}}^{2}}]

(37)

M A E = \frac{1}{N} \sum_{i = 1}^{N} |(y_{i} - d_{i})|

(38)

M B E = \frac{1}{N} \sum_{i = 1}^{n} (y_{i} - d_{i})

(39)

L M I = 1 -  [\frac{\sum_{i = 1}^{N} |d_{i} - y_{i}|}{\sum_{i = 1}^{N} |d_{i} - d_{m e a n}|}], 0 < L M I \leq 1

(40)

where

d_{i}

is the observed

i^{t h}

value,

y_{i}

is the predicted

i^{t h}

value,

d_{m e a n}

is the average of the observed value, and

N

is the number of data samples. Note that, for an ideal model, the values of these indices should be equal to their ideal values, the details of which are presented in Table 2.

5. Results and Discussion

5.1. Configuration of the Models

The development of the MARS model was made in MATLAB. MARS model started with 10 basis functions and the final best performing MARS model was taken for nine basis functions after the pruning phase. The equations of the basis functions are shown in Table 3. The model is piecewise-cubic, but the equations of basis functions are shown as piecewise-linear. The final output equation is provided in Equation (41).

Q = 0.169 + 0.628 * BF 1 - 0.233 * BF 2 + 0.344 * BF 3 - 0.612 * BF 4 + 0.073 * BF 5 - 0.116 * BF 6 + 0.535 * BF 7 + 0.149 * BF 8 - 0.108 * BF 9 \to

(41)

The ELM model is configured with 25 hidden neurons and a population size of 50. There are five input neurons and one output neuron. The maximum number of iterations was set as 200 and the cost function was RMSE. To optimize the learning parameters, PSO and EO optimization techniques are used in hybrid ELM models. The OAs are initialized to optimize the learning parameters of the ELM such as population size (

n_{s}

), the maximum number of iterations (

k

), lower bound (

l b

), upper bound (

u b

), and other parameters besides the number of hidden neurons (

N h

) of ELMs. Then, using the training dataset, OAs optimize the weight and biases of the ELMs. RMSE was used as the cost function to estimate the optimal weight and bias values. It is worth noting that, while the values of

n_{s}

,

k

,

l b

, and

u b

were kept constant throughout the optimization process, the optimal value of learning parameters differed in each case.

The optimum number of hidden neurons was found by the trial-and-error method by varying the

N h

of the same training dataset in the range of 10 to 30 and the configuration for the best performance was taken to simulate the model. The utmost apt value obtained was 25 for ELM, ELM-PSO, and ELM-EO. The values of other parameters were set as

n_{s}

= 50,

k

= 200,

l b

= −1, and

u b

= +1; therefore, the optimum number of optimized weight and biases are 176 (5 × 25 + 25 + 25 + 1). The detailed configuration of the developed ELM and optimized ELM models is presented in Table 4.

The actual vs. predicted graph is presented in Figure 2. The straight-line inclined at 45 degrees shows the perfect fit model with actual value = predicted value. As obvious from Figure 2, ELM-EO and ELM-PSO are the best fit model as all the data points are exactly close to the ‘actual = predicted’ line. The simulation of MARS and ELM is also satisfactory as all the data points are scattered around the straight line but less than in the hybrid ELM models.

5.2. Performance Parameters

R² is a statistical performance parameter for curve fitting problems. It evaluates the fit of the predicted data with that of a horizontal straight line (actual = predicted line). The ideal parameter is 1, which represents the perfect fit of the model. If the coefficient is 0.80, then the regression line should contain 80 percent of the points. Adj. R² is an improvement to R², which adjusts with a number of independent parameters. RMSE is the cost function used to analyze the accuracy of the model. It measures the magnitude of error of all the data points in the regression line—the lower the RMSE, the better fit the model. The ideal value of RMSE is 0. WMAPE is a statistical measure of the accuracy of the simulation model. It is an improvement over mean absolute percentage error where weighted errors are calculated. NS is a ratio of residual error variance (noise) to measured variance in observed data. NS values of less than one are unacceptable. Values ≥ 1 are desirable. VAF is the variance accounted for among original and predicted values of regression models. Perfect models have 100% VAF. NMBE computes the correlation between the predicted value and the mean value. It normalizes the MBE by dividing it with the mean of the observed values. A bias factor value of 1 means balanced prediction, a value greater than 1 means overprediction, and a value less than 1 means underprediction. The ideal values of the performance parameters are given in Table 5.

It can be inferred from Table 6 that the output performance parameter values for all the models are close to the ideal values for both the training and testing phases. The R² and RMSE for ELM-EO are 0.999 and 0 for training and 0.995 and 0.01 for testing, respectively. Further, the comparison between the performances of the models is made using rank analysis, which is elaborated on in the next section. The values of performance parameters for ELM are comparatively less satisfactory (R² = 0.94 and RMSE = 0.0558 in testing).

5.3. Rank Analysis

Rank analysis is the most straightforward and widely used method for determining the effectiveness of developed models and comparing their robustness. The statistical parameters are used to assign the score value in this study, with their ideal values serving as the benchmark. It depends on how many models are used. The greatest score is given to the best performing results model, and vice versa. The ranking ratings for two models with the same outcomes may be the same. The overall score of a model is calculated by adding the scores value of the training phase and testing phase. The equation used to calculate the total score is given as

T o t a l s c o r e =  [\sum_{i = 1}^{m} X_{i} + \sum_{j = 1}^{n} X_{j}]

(42)

where X_i and Xj are the scores of the performance indicators for the training and testing phase, respectively. The number of performance indicators in the training and testing phase is represented by m and n, respectively.

The score attained by ELM-EO is the highest in the training phase (52), followed by ELM-PSO (42) and MARS (30) as presented in Table 7. ELM attains the least score value in both testing and training phases (18 each). In the testing phase, too, ELM-EO outperforms all the models (49), while closely followed by ELM-PSO and MARS (38 and 37, respectively). The total rank value attained by ELM-EO in both the phases combined is 101 (52 + 49), which is far ahead of ELM-PSO 80 (42 + 38) and MARS 67 (30 + 37). It can be inferred from the rank values that hybridization has a significant impact, and the efficiency of ELM is enhanced many times. While ELM lags behind MARS in both testing and training phases, hybrid ELM models have performances superior to MARS. EO is more robust in enhancing the performance of ELM than PSO and ELM-EO is the clear winner in terms of performance compared to the other applied models.

5.4. Error Matrix

The error matrix is a tool for displaying the correctness of a model. Figure 3 depicts the amount of error associated with hybrid models based on numerous performance parameters in this section [31]. In this study, the error values for indices R², Adj R², and RMSE in the range of 0% to 1%, 0% to 6%, and 0% to 6%, which is very satisfactory. Similarly, error value for indices WMAPE, NS, VAF, PI, Bias, WI, MAE, MBE, and LMI are obtained in the range of 0% to 8%, 0% to 6%, 0% to 6%, 0% to 9%, 3% to 14%, 3% to 14%, 0% to 2%, 0% to 4%, 0% to 1%, and 0% to 22%, respectively, in models for training and testing dataset. It is obvious from Figure 3 that the ELM-EO model achieves the least error compared to the other model for all the performance parameters in both training and testing phases. The error for all the parameters is close to 0 in both the phases except NMBE in testing. Thus, ELM-EO can be concluded as the most accurate and robust model from the error matrix. The error percentage for ELM is highest for all the parameters (6% for R² and RMSE compared to 0% and 2%, respectively, for MARS and 1% and 2%, respectively, for both ELM-EO and ELM-PSO) in both phases and thus concluded to possess the least accuracy among the applied models.

5.5. Sensitivity Analysis

In general, sensitivity analysis (SA) is a technique that is used to determine how changes in input parameters affect the response of the proposed models. This will assist us in identifying the input parameters based on their influence on the result. The cosine amplitude method [63] is used in this work to calculate the amount of influence of the inputs on the response, i.e., the bearing capacity of the pile foundation. The data pairings in this study are represented in a data array, X, as follows:

X = \{x_{1}, x_{2}, x_{3}, \dots, x_{i}, \dots, x_{n}\}

(43)

and variable 𝑥𝑖 in 𝑋, is a length vector of 𝑚 as

x_{i} = \{x_{i 1}, x_{i 2}, x_{i 3}, \dots, x_{i m}\}

(44)

The correlation between the strength of the relation (𝑅_𝑖𝑗) and datasets of 𝑥_𝑖 and 𝑥_𝑗 is provided by

R_{i j} = \frac{\sum_{k = 1}^{m} x_{i k} x_{j k}}{\sqrt{\sum_{k = 1}^{m} x_{i k}^{2}} \sum_{k = 1}^{m} x_{j k}^{2}}

(45)

The graphical representation using a pie chart of 𝑅_𝑖𝑗 in Figure 4 shows the relation between the bearing capacity of soil and the input parameters, as shown in Figure 3. SA reveals that φ has the greatest influence on pile total capacity with a strength value of 0.93 followed by B with a strength value of 0.92. The parameters γ and D have a strength of 0.91; whereas L/B has the minimum effect on the capacity of the pile, i.e., 0.72. It can be concluded that all five parameters have stronger influences on the pile bearing capacity and hence are considered in predicting the output.

5.6. REC Curves

The graph of error tolerance versus the percentage of points predicted inside the tolerance is plotted by the regression error characteristic (REC) curve. The error tolerance and accuracy of a regression function are represented by the x and y axes, respectively. The predicted error is approximated by the area over the REC curve (AOC). The lower the AOC, the better the models’ performance. As a result, ROC curves provide for a quick and accurate visual assessment of model performance.

The REC curves for the models are plotted in Figure 5 for both phases. It can be concluded by the visual interpretation itself that ELM is the least accurate model in terms of prediction accuracy. Other models are very close to each other and we need to check the values of AOC for comparison of their performance. The values of the AOC are plotted in Figure 5. In both training and testing phases the ELM-EO model is a better performing model than other models (AOC value 0.0057 and 0.0122, respectively). In the training phase, the lines for ELM-PSO and ELM-EO are almost overlapping (green line and yellow line) and the AOC values too are close to each other (0.0045 and 0.0057, respectively). Thus, the performances of ELM-PSO and ELM-EO are equally likely in the training phase. In the testing phase, the AOC value of ELM-EO is much better than ELM-PSO (0.0122 and 0.0142) and, of course, far better than MARS (0.0166) and ELM (0.0235).

6. Conclusions

The paper presents AI-based models for the prediction of the bearing capacity of shallow foundation as an alternative to the traditional methods, which suffers several practical and performance-based drawbacks. ELM, MARS, ELM-PSO, and ELM-EO models were trained and validated on field data and all the models were found to perform well on the yardsticks of various performance parameters used. In rank analysis, error matrix and REC curves, ELM-EO is concluded to outperform the other models (R² = 1, RMSE = 0.004, AOC = 0.0057 in the training phase and R² = 0.9945, RMSE = 0.017, AOC = 0.0122 in the testing phase). ELM-EO and ELM-PSO have equally likely performance in the training phase (R² close to 1); however, ELM-EO is the best model in the testing phase (R² = 0.995 for ELM-EO and 0.993 for ELM-PSO)—it is noteworthy that testing performance is the most important factor in the robustness of the model. The final rank values of ELM, ELM-PSO, and ELM-EO are 36, 80, and 101, respectively. R² for MARS is 0.995 in both training and testing phases; thus, the hybridization of the ELM model is concluded to enhance the performance of ELM many miles and further hybridization with various other optimization techniques should be encouraged in future research. The unique advantages of the proposed ELM-EO model are higher prediction accuracy, ease of implementation with the existing datasets, and high generalization capability. On the other hand, the predicting expression of MARS can be used as a user-friendly equation to determine the bearing capacity of the pile. The hybrid ELM models can be extended to other engineering applications once the corresponding database is created. Sensitivity analysis is conducted to assess the impact of input parameters on the output. All the input parameters were found to have a significant impact on the output, friction angle, and L/B ratio having the highest and lowest impact, respectively.

Author Contributions

Conceptualization, M.K., V.K., R.B. and A.M.Y.; data curation, M.K., V.K. and A.M.Y.; funding acquisition, M.R.K.; methodology, M.K., V.K., R.B., P.S. and A.M.Y.; supervision, P.S.; visualization, M.R.K., M.A. and P.S.; writing—original draft, M.K., V.K., R.B. and A.M.Y.; writing—review and editing, M.K., V.K., R.B., M.R.K., M.A. and A.M.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF), funded by the Ministry of Science, ICT & Future Planning, Republic of Korea (2019R1I1A1A01062202).

Data Availability Statement

All the data used in the city is properly reported within the text.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Bowles, J.E. Foundation Analysis and Design, 4th ed.; McGraw-Hill Education: New York, NY, USA, 1988; p. 1004. [Google Scholar]
Cerato, A.B.; Lutenegger, A.J. Scale Effects of Shallow Foundation Bearing Capacity on Granular Material. J. Geotech. Geoenviron. Eng. 2007, 133, 1192–1202. [Google Scholar] [CrossRef]
Fukushima, H.I.; Nishimoto, S.; Tomisawa, K. Scale Effect of Spread Foundation Loading Tests Using Various Size Plates; Independent Administrative Institution Civil Engineering Research Institute for Cold Region: Hokkaido, Japan, 2005. [Google Scholar]
Terzaghi, K. Theoretical Soil Mechanics; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1943. [Google Scholar]
Meyerhof, G.G. Some Recent Research on the Bearing Capacity of Foundations. Can. Geotech. J. 1963, 1, 16–26. [Google Scholar] [CrossRef]
Vesic, A.S. Analysis of Ultimate Loads of Shallow Foundations. ASCE J. Soil Mech. Found. Div. 1973, 99, 45–73. [Google Scholar] [CrossRef]
Aksoy, H.S.; Gör, M.; İnal, E. A New Design Chart for Estimating Friction Angle between Soil and Pile Materials. Geomech. Eng. 2016, 10, 315. [Google Scholar] [CrossRef]
Kalinli, A.; Acar, M.C.; Gündüz, Z. New Approaches to Determine the Ultimate Bearing Capacity of Shallow Foundations Based on Artificial Neural Networks and Ant Colony Optimization. Eng. Geol. 2011, 117, 29–38. [Google Scholar] [CrossRef]
Momeni, E.; Nazir, R.; Jahed Armaghani, D.; Maizir, H. Prediction of Pile Bearing Capacity Using a Hybrid Genetic Algorithm-Based ANN. Meas. J. Int. Meas. Confed. 2014, 57, 122–131. [Google Scholar] [CrossRef]
Kutter, B.L.; Abghari, A.; Cheney, J.A. Strength Parameters for Bearing Capacity of Sand. J. Geotech. Eng. 1988, 114, 491–498. [Google Scholar] [CrossRef]
Van Baars, S. Numerical Check of the Meyerhof Bearing Capacity Equation for Shallow Foundations. Innov. Infrastruct. Solut. 2017, 31, 9. [Google Scholar] [CrossRef]
Rybak, J.; Król, M. Limitations and Risk Related to Static Capacity Testing of Piles-“unfortunate Case” Studies. In Proceedings of the MATEC Web of Conferences, Online, 23 February 2018; Juhásová Šenitková, I., Ed.; EDP Sciences: Les Ulis, France, 2018; Volume 146, p. 02006. [Google Scholar]
Farooq, F.; Ahmed, W.; Akbar, A.; Aslam, F.; Alyousef, R. Predictive Modeling for Sustainable High-Performance Concrete from Industrial Wastes: A Comparison and Optimization of Models Using Ensemble Learners. J. Clean. Prod. 2021, 292, 126032. [Google Scholar] [CrossRef]
Farooq, F.; Czarnecki, S.; Niewiadomski, P.; Aslam, F.; Alabduljabbar, H.; Ostrowski, K.A.; Śliwa-Wieczorek, K.; Nowobilski, T.; Malazdrewicz, S. A Comparative Study for the Prediction of the Compressive Strength of Self-Compacting Concrete Modified with Fly Ash. Materials 2021, 14, 4934. [Google Scholar] [CrossRef]
Javed, M.F.; Amin, M.N.; Shah, M.I.; Khan, K.; Iftikhar, B.; Farooq, F.; Aslam, F.; Alyousef, R.; Alabduljabbar, H. Applications of Gene Expression Programming and Regression Techniques for Estimating Compressive Strength of Bagasse Ash Based Concrete. Crystals 2020, 10, 737. [Google Scholar] [CrossRef]
Khan, M.A.; Farooq, F.; Javed, M.F.; Zafar, A.; Ostrowski, K.A.; Aslam, F.; Malazdrewicz, S.; Maślak, M. Simulation of Depth of Wear of Eco-Friendly Concrete Using Machine Learning Based Computational Approaches. Materials 2021, 15, 58. [Google Scholar] [CrossRef] [PubMed]
Song, H.; Ahmad, A.; Farooq, F.; Ostrowski, K.A.; Maślak, M.; Czarnecki, S.; Aslam, F. Predicting the Compressive Strength of Concrete with Fly Ash Admixture Using Machine Learning Algorithms. Constr. Build. Mater. 2021, 308, 125021. [Google Scholar] [CrossRef]
Ray, R.; Kumar, D.; Samui, P.; Roy, L.B.; Goh, A.T.C.; Zhang, W. Application of Soft Computing Techniques for Shallow Foundation Reliability in Geotechnical Engineering. Geosci. Front. 2021, 12, 375–383. [Google Scholar] [CrossRef]
Debnath, S.; Sultana, P. Prediction of Settlement of Shallow Foundation on Cohesionless Soil Using Artificial Neural Network. In Proceedings of the 7th Indian Young Geotechnical Engineers Conference; Springer: Singapore, 2022; pp. 477–486. [Google Scholar] [CrossRef]
Samui, P. Application of Statistical Learning Algorithms to Ultimate Bearing Capacity of Shallow Foundation on Cohesionless Soil. Int. J. Numer. Anal. Methods Geomech. 2012, 36, 100–110. [Google Scholar] [CrossRef]
Bagińska, M.; Srokosz, P.E. The Optimal ANN Model for Predicting Bearing Capacity of Shallow Foundations Trained on Scarce Data. KSCE J. Civ. Eng. 2018, 23, 130–137. [Google Scholar] [CrossRef] [Green Version]
Padmini, D.; Ilamparuthi, K.; Sudheer, K.P. Ultimate Bearing Capacity Prediction of Shallow Foundations on Cohesionless Soils Using Neurofuzzy Models. Comput. Geotech. 2008, 35, 33–46. [Google Scholar] [CrossRef]
Ahmad, M.; Ahmad, F.; Wróblewski, P.; Al-Mansob, R.A.; Olczak, P.; Kamiński, P.; Safdar, M.; Rai, P. Prediction of Ultimate Bearing Capacity of Shallow Foundations on Cohesionless Soils: A Gaussian Process Regression Approach. Appl. Sci. 2021, 11, 10317. [Google Scholar] [CrossRef]
Huang, G.-B.; Kheong Siew, C.; Zhu, Q.-Y.; Siew, C.-K. Extreme Learning Machine: A New Learning Scheme of Feedforward Neural Networks Sentence Level Sentiment Analysis View Project Neural Networks View Project Extreme Learning Machine: A New Learning Scheme of Feedforward Neural Networks. In Proceedings of the 2004 IEEE International Joint Conference on Neural Networks, Budapest, Hungary, 25–29 July 2004. [Google Scholar] [CrossRef]
Liu, Z.; Shao, J.; Xu, W.; Chen, H.; Zhang, Y. An Extreme Learning Machine Approach for Slope Stability Evaluation and Prediction. Nat. Hazards 2014, 73, 787–804. [Google Scholar] [CrossRef]
Samui, P.; Kim, D.; Jagan, J.; Roy, S.S. Determination of Uplift Capacity of Suction Caisson Using Gaussian Process Regression, Minimax Probability Machine Regression and Extreme Learning Machine. Iran. J. Sci. Technol. Trans. Civ. Eng. 2019, 43, 651–657. [Google Scholar] [CrossRef]
Samui, P. Application of Artificial Intelligence in Geo-Engineering. In Springer Series in Geomechanics and Geoengineering; Springer: Amsterdam, The Netherlands, 2019; pp. 30–44. [Google Scholar] [CrossRef]
Ghani, S.; Kumari, S.; Choudhary, A.K.; Jha, J.N. Experimental and Computational Response of Strip Footing Resting on Prestressed Geotextile-Reinforced Industrial Waste. Innov. Infrastruct. Solut. 2021, 62, 98. [Google Scholar] [CrossRef]
Kang, F.; Li, J.-S.; Wang, Y.; Li, J. Extreme Learning Machine-Based Surrogate Model for Analyzing System Reliability of Soil Slopes. Eur. J. Environ. Civ. Eng. 2017, 21, 1341–1362. [Google Scholar] [CrossRef]
Khaleel, F.; Hameed, M.M.; Khaleel, D.; AlOmar, M.K. Applying an Efficient AI Approach for the Prediction of Bearing Capacity of Shallow Foundations; Springer: Cham, Switzerland, 2022; pp. 310–323. [Google Scholar] [CrossRef]
Kardani, N.; Bardhan, A.; Samui, P.; Nazem, M.; Zhou, A.; Armaghani, D.J. A Novel Technique Based on the Improved Firefly Algorithm Coupled with Extreme Learning Machine (ELM-IFF) for Predicting the Thermal Conductivity of Soil. Eng. Comput. 2021, 1–20. [Google Scholar] [CrossRef]
Bardhan, A.; GuhaRay, A.; Gupta, S.; Pradhan, B.; Gokceoglu, C. A Novel Integrated Approach of ELM and Modified Equilibrium Optimizer for Predicting Soil Compression Index of Subgrade Layer of Dedicated Freight Corridor. Transp. Geotech. 2022, 32, 100678. [Google Scholar] [CrossRef]
Kardani, N.; Bardhan, A.; Roy, B.; Samui, P.; Nazem, M.; Armaghani, D.J.; Zhou, A. A Novel Improved Harris Hawks Optimization Algorithm Coupled with ELM for Predicting Permeability of Tight Carbonates. Eng. Comput. 2021, 1–24. [Google Scholar] [CrossRef]
Gör, M. Analyzing the Bearing Capacity of Shallow Foundations on Two-Layered Soil Using Two Novel Cosmology-Based Optimization Techniques. Smart Struct. Syst. 2022, 29, 513. [Google Scholar] [CrossRef]
Moayedi, H.; Gör, M.; Kok Foong, L.; Bahiraei, M. Imperialist Competitive Algorithm Hybridized with Multilayer Perceptron to Predict the Load-Settlement of Square Footing on Layered Soils. Measurement 2021, 172, 108837. [Google Scholar] [CrossRef]
Jing, Z. Study on Deformation Law of Foundation Pit by Multifractal Detrended Fluctuation Analysis and Extreme Learning Machine Improved by Particle Swarm Optimization. J. Yangtze River Sci. Res. Inst. 2019, 36, 53. [Google Scholar] [CrossRef]
Li, W.; Li, B.; Guo, H.; Fang, Y.; Qiao, F.; Zhou, S. The Ecg Signal Classification Based on Ensemble Learning of Pso-Elm Algorithm. Neural Netw. World 2020, 30, 265–279. [Google Scholar] [CrossRef]
Zeng, J.; Roy, B.; Kumar, D.; Mohammed, A.S.; Armaghani, D.J.; Zhou, J.; Mohamad, E.T. Proposing Several Hybrid PSO-Extreme Learning Machine Techniques to Predict TBM Performance. Eng. Comput. 2021, 1, 1–17. [Google Scholar] [CrossRef]
Chen, F.; Sun, X.; Wei, D.; Tang, Y. Tradeoff Strategy between Exploration and Exploitation for PSO. Proc. 2011 7th Int. Conf. Nat. Comput. ICNC 2011, 3, 1216–1222. [Google Scholar] [CrossRef]
Grimaldi, E.A.; Grimaccia, F.; Mussetta, M.; Zich, R.E. PSO as an Effective Learning Algorithm for Neural Network Applications. In Proceedings of the ICCEA 2004. 2004 3rd International Conference on Computational Electromagnetics and its Applications, Beijing, China, 1–4 November 2004; pp. 557–560. [Google Scholar] [CrossRef]
Askarzadeh, A.; Rezazadeh, A. Artificial Bee Swarm Optimization Algorithm for Parameters Identification of Solar Cell Models. Appl. Energy 2013, 102, 943–949. [Google Scholar] [CrossRef]
Faramarzi, A.; Heidarinejad, M.; Stephens, B.; Mirjalili, S. Equilibrium Optimizer: A Novel Optimization Algorithm. Knowl.-Based Syst. 2020, 191, 105190. [Google Scholar] [CrossRef]
Kardani, N.; Bardhan, A.; Gupta, S.; Samui, P.; Nazem, M.; Zhang, Y.; Zhou, A. Predicting Permeability of Tight Carbonates Using a Hybrid Machine Learning Approach of Modified Equilibrium Optimizer and Extreme Learning Machine. Acta Geotech. 2021, 17, 1239–1255. [Google Scholar] [CrossRef]
Samui, P. Determination of Ultimate Capacity of Driven Piles in Cohesionless Soil: A Multivariate Adaptive Regression Spline Approach. Int. J. Numer. Anal. Methods Geomech. 2012, 36, 1434–1439. [Google Scholar] [CrossRef]
Zhang, W.; Wu, C. Machine Learning Predictive Models for Pile Drivability: An Evaluation of Random Forest Regression and Multivariate Adaptive Regression Splines. In Springer Series in Geomechanics and Geoengineering; Springer: Cham, Switzerland, 2020. [Google Scholar]
Samui, P.; Kim, D. Least Square Support Vector Machine and Multivariate Adaptive Regression Spline for Modeling Lateral Load Capacity of Piles. Neural Comput. Appl. 2013, 23, 1123–1127. [Google Scholar] [CrossRef]
Luat, N.V.; Nguyen, V.Q.; Lee, S.; Woo, S.; Lee, K. An Evolutionary Hybrid Optimization of MARS Model in Predicting Settlement of Shallow Foundations on Sandy Soils. Geomech. Eng. 2020, 21, 583–598. [Google Scholar] [CrossRef]
Dong, J.; Zhu, Y.; Jia, X.; Shao, M.; Han, X.; Qiao, J.; Bai, C.; Tang, X. Nation-Scale Reference Evapotranspiration Estimation by Using Deep Learning and Classical Machine Learning Models in China. J. Hydrol. 2022, 604, 127207. [Google Scholar] [CrossRef]
Rahgoshay, M.; Feiznia, S.; Arian, M.; Hashemi, S.A.A. Simulation of Daily Suspended Sediment Load Using an Improved Model of Support Vector Machine and Genetic Algorithms and Particle Swarm. Arab. J. Geosci. 2019, 12, 227. [Google Scholar] [CrossRef]
Zheng, G.; Zhang, W.; Zhou, H.; Yang, P. Multivariate Adaptive Regression Splines Model for Prediction of the Liquefaction-Induced Settlement of Shallow Foundations. Soil Dyn. Earthq. Eng. 2020, 132, 106097. [Google Scholar] [CrossRef]
Friedman, J.H. Multivariate Adaptive Regression Splines. Ann. Stat. 1991, 19, 1–67. [Google Scholar] [CrossRef]
Kumar, V.; Himanshu, N.; Burman, A. Rock Slope Analysis with Nonlinear Hoek–Brown Criterion Incorporating Equivalent Mohr–Coulomb Parameters. Geotech. Geol. Eng. 2019, 37, 4741–4757. [Google Scholar] [CrossRef]
Seifi, A.; Ehteram, M.; Singh, V.P.; Mosavi, A. Modeling and Uncertainty Analysis of Groundwater Level Using Six Evolutionary Optimization Algorithms Hybridized with ANFIS, SVM, and ANN. Sustainability 2020, 12, 4023. [Google Scholar] [CrossRef]
Eberhart, R.C.; Shi, Y. Comparison between Genetic Algorithms and Particle Swarm Optimization. In Evolutionary Programming VII; Poroto Saravanam, W.N., Waagen, D., Eiben, A.E., Eds.; Springer: Berlin/Heidelberg, Germany, 1998; pp. 611–616. [Google Scholar]
Shi, Y.; Eberhart, R.C. Empirical Study of Particle Swarm Optimization. In Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), Washington, DC, USA, 6–9 July 1999; pp. 1945–1950. [Google Scholar]
Samui, P.; Sitharam, T.G. Site Characterization Model Using Artificial Neural Network and Kriging. Int. J. Geomech. 2010, 10, 171–180. [Google Scholar] [CrossRef]
Legates, D.R.; Mccabe, G.J. A Refined Index of Model Performance: A Rejoinder. Int. J. Climatol. 2013, 33, 1053–1056. [Google Scholar] [CrossRef]
Moriasi, D.N.; Arnold, J.G.; Van Liew, M.W.; Bingner, R.L.; Harmel, R.D.; Veith, T.L. Model Evaluation Guidelines for Systematic Quantification of Accuracy in Watershed Simulations. Trans. ASABE 2007, 50, 885–900. [Google Scholar] [CrossRef]
Willmott, C.J. On the Evaluation of Model Performance in Physical Geography. In Spatial Statistics and Models; Springer: Cham, Switzerland, 1984. [Google Scholar]
Kumar, S.; Rai, B.; Biswas, R.; Samui, P.; Kim, D. Prediction of Rapid Chloride Permeability of Self-Compacting Concrete Using Multivariate Adaptive Regression Spline and Minimax Probability Machine Regression. J. Build. Eng. 2020, 32, 101490. [Google Scholar] [CrossRef]
Biswas, R.; Samui, P.; Rai, B. Determination of Compressive Strength Using Relevance Vector Machine and Emotional Neural Network. Asian J. Civ. Eng. 2019, 20, 1109–1118. [Google Scholar] [CrossRef]
Biswas, R.; Rai, B.; Samui, P.; Roy, S.S. Estimating Concrete Compressive Strength Using MARS, LSSVM and GP. Eng. J. 2020, 24, 41–52. [Google Scholar] [CrossRef]
Biswas, R.; Bardhan, A.; Samui, P.; Rai, B.; Nayak, S.; Armaghani, D.J. Efficient Soft Computing Techniques for the Prediction of Compressive Strength of Geopolymer Concrete. Comput. Concr. 2021, 28, 221–232. [Google Scholar] [CrossRef]

Figure 1. Histogram of the dataset.

Figure 2. Scatter plot of the applied models.

Figure 3. Error matrices for the applied models for (a) training dataset and (b) testing dataset.

Figure 4. Sensitivity analysis using a pie chart.

Figure 5. REC curve plot of the simulated models and respective AOC values.

Table 1. Descriptive statistics of the field data.

	B (m)	D (m)	L/B (-)	γ (KN/m²)	φ (^°)	Q (KPa)
Mean	0.11	0.08	3.92	16.45	38.95	192.84
Minimum	0.06	0.03	1.00	15.70	34.00	58.50
Maximum	0.15	0.15	6.00	17.10	42.50	423.60
Standard Error	0.01	0.01	0.35	0.07	0.44	13.18
Standard Deviation	0.04	0.04	2.47	0.50	3.11	94.13
Sample Variance	0.00	0.00	6.09	0.25	9.67	8860.48
Kurtosis	−1.55	−0.82	−1.94	−1.28	−1.21	−0.38
Skewness	−0.03	0.61	−0.37	−0.22	−0.45	0.65
Range	0.09	0.12	5.00	1.40	8.50	365.10

Table 2. Optimal values of effective parameters of MARS model.

Parameters	MARS-L
GCV penalty per knot	0
Cubic modelling	0 (No)
Self-interactions	1 (No)
Maximum interactions	2
Prune	1 (true)
No. of $F_{b}$ in the final model	15

Table 3. Equation of basic functions in MARS model.

SL.NO	Basic Function	Equation
1	BF1	max(0, φ − 0.352)
2	BF2	max(0, 0.352 − φ)
3	BF3	BF1 × max(0, D − 0.380)
4	BF4	Bf1 × max(0, 0.380 − D)
5	BF5	max(0, B − 0.379)
6	BF6	max(0, 0.379 − B)
7	BF7	BF5 × max(0, γ − 0.57)
8	BF8	max(0, D − 0.53)
9	BF9	max(0, 0.53 − D)

Table 4. Configuration of optimum hybrid ELM models.

Parameters	$n_{s}$	$N_{h}$	$k$	$l b$	$u b$
ELM		25
ELM-PSO	50	25	100	−1	+1
ELM-EO	50	25	100	−1	+1

Table 5. Ideal limit of statistical parameters.

Parameters	Ideal Value	Parameters	Ideal Value
VAF	100	RMSE	0
R²	1	WMAPE	0
PI	2	MAE	0
WI	1	MBE	0
Adj. R²	1	NMBE	0
NS	1	LMI	0
RSR	0	Bias	1

Table 6. Values of the performance parameters.

Model Statistical Parameters	ELM	ELM-EO	ELM-PSO	MARS	ELM	ELM-EO	ELM-PSO	MARS
Model Statistical Parameters	Testing Performance				Training Performance
WMAPE	0.0797	0.0306	0.0441	0.0498	0.0543	0.0030	0.0127	0.0396
RMSE	0.0558	0.0170	0.0186	0.0199	0.0248	0.0014	0.0060	0.0180
VAF	93.921	99.3963	99.3155	99.3155	99.1566	99.9973	99.951	99.5517
R²	0.9425	0.9945	0.9932	0.9954	0.9915	0.9999	0.9995	0.9955
Adj. R²	0.9413	0.9872	0.9840	0.9946	0.9910	0.9999	0.9993	0.9952
NS	0.9386	0.9938	0.9926	0.9916	0.9915	0.9999	0.9995	0.9955
PI	1.8247	1.9641	1.9586	1.9673	1.9578	1.9985	1.9930	1.9727
RSR	0.2477	0.0785	0.0858	0.0916	0.0919	0.0052	0.02200	0.0669
Bias	1.0237	1.1431	0.9178	1.0876	0.9723	0.9731	0.9799	0.9383
NMBE	−1.0848	0.6977	1.205	1.8913	0.1616	0.0087	0.04750	0.1163
WI	0.9830	0.9984	0.9982	0.9978	0.9979	0.9999	0.9998	0.9988
MAE	0.0398	0.01088	0.0157	0.0177	0.0202	0.0012	0.0048	0.0150
MBE	−0.0054	0.00248	−0.0049	0.0067	0.0006	3.24 × 10⁻⁵	0.00018	0.00044
LMI	0.7823	0.93410	0.9052	0.8929	0.9114	0.9950	0.9791	0.9343

Table 7. Rank analysis of the simulated model outcomes for testing and training dataset.

Model Statistical Parameters		ELM	ELM-EO	ELM-PSO	MARS	ELM	ELM-EO	ELM-PSO	MARS
Model Statistical Parameters		Testing Performance				Training Performance
WMAPE	Value	0.0797	0.0306	0.0441	0.0498	0.0543	0.0030	0.0127	0.0396
WMAPE	Score	1	4	3	2	1	4	3	2
RMSE	Value	0.0558	0.0170	0.0186	0.0199	0.0248	0.0014	0.0060	0.0180
RMSE	Score	1	4	3	3	1	4	3	3
VAF	Value	93.921	99.3963	99.3155	99.3155	99.1566	99.9973	99.951	99.5517
VAF	Score	1	4	3	3	1	4	3	3
R²	Value	0.9425	0.9945	0.9932	0.9954	0.9915	0.9999	0.9995	0.9955
R²	Score	1	3	2	4	1	4	3	2
Adj. R²	Value	0.9413	0.9872	0.9840	0.9946	0.9910	0.9999	0.9993	0.9952
Adj. R²	Score	1	3	2	4	1	4	3	2
NS	Value	0.9386	0.9938	0.9926	0.9916	0.9915	0.9999	0.9995	0.9955
NS	Score	1	4	3	2	1	4	3	2
PI	Value	1.8247	1.9641	1.9586	1.9673	1.9578	1.9985	1.9930	1.9727
PI	Score	1	3	2	4	1	4	3	2
RSR	Value	0.2477	0.0785	0.0858	0.0916	0.0919	0.0052	0.02200	0.0669
RSR	Score	1	4	3	2	1	4	3	2
Bias	Value	1.0237	1.1431	0.9178	1.0876	0.9723	0.9731	0.9799	0.9383
Bias	Score	2	3	1	3	2	3	4	1
NMBE	Value	−1.0848	0.6977	12.0509	1.8913	0.1616	0.0087	0.04750	0.1163
NMBE	Score	1	2	4	3	4	1	2	3
WI	Value	0.9830	0.9984	0.9982	0.9978	0.9979	0.9999	0.9998	0.9988
WI	Score	1	4	3	2	1	4	3	2
MAE	Value	0.0398	0.01088	0.0157	0.0177	0.0202	0.0012	0.0048	0.0150
MAE	Score	1	4	3	2	1	4	3	2
MBE	Value	−0.0054	0.00248	−0.0049	0.0067	0.0006	3.24 × 10⁻⁵	0.00018	0.00044
MBE	Score	4	2	3	1	1	4	3	2
LMI	Value	0.7823	0.93410	0.9052	0.8929	0.9114	0.9950	0.9791	0.9343
LMI	Score	1	4	3	2	1	4	3	2
	Total	18	49	38	37	18	52	42	30

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kumar, M.; Kumar, V.; Biswas, R.; Samui, P.; Kaloop, M.R.; Alzara, M.; Yosri, A.M. Hybrid ELM and MARS-Based Prediction Model for Bearing Capacity of Shallow Foundation. Processes 2022, 10, 1013. https://0-doi-org.brum.beds.ac.uk/10.3390/pr10051013

AMA Style

Kumar M, Kumar V, Biswas R, Samui P, Kaloop MR, Alzara M, Yosri AM. Hybrid ELM and MARS-Based Prediction Model for Bearing Capacity of Shallow Foundation. Processes. 2022; 10(5):1013. https://0-doi-org.brum.beds.ac.uk/10.3390/pr10051013

Chicago/Turabian Style

Kumar, Manish, Vinay Kumar, Rahul Biswas, Pijush Samui, Mosbeh R. Kaloop, Majed Alzara, and Ahmed M. Yosri. 2022. "Hybrid ELM and MARS-Based Prediction Model for Bearing Capacity of Shallow Foundation" Processes 10, no. 5: 1013. https://0-doi-org.brum.beds.ac.uk/10.3390/pr10051013

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hybrid ELM and MARS-Based Prediction Model for Bearing Capacity of Shallow Foundation

Abstract

1. Introduction

2. Details of AI-Based Models Used

2.1. MARS

2.2. ELM

2.3. PSO

2.4. EO

2.5. Regression Optimization

3. Details of Dataset

4. Research Methodology

5. Results and Discussion

5.1. Configuration of the Models

5.2. Performance Parameters

5.3. Rank Analysis

5.4. Error Matrix

5.5. Sensitivity Analysis

5.6. REC Curves

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI