Next Article in Journal
Development of a Framework for Wind Turbine Design and Optimization
Previous Article in Journal
A Comparative Study on the Efficiency of Reliability Methods for the Probabilistic Analysis of Local Scour at a Bridge Pier in Clay-Sand-Mixed Sediments
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Accuracy of the Sine Power Lomax Model for Data Fitting

by
Vasili B. V. Nagarjuna
1,
R. Vishnu Vardhan
1 and
Christophe Chesneau
2,*
1
Department of Statistics, Pondicherry University, Pondicherry 605 014, India
2
Department of Mathematics, LMNO, Université de Caen-Normandie, Campus II, Science 3, 14032 Caen, France
*
Author to whom correspondence should be addressed.
Submission received: 25 January 2021 / Revised: 6 February 2021 / Accepted: 9 February 2021 / Published: 13 February 2021

Abstract

:
Every day, new data must be analysed as well as possible in all areas of applied science, which requires the development of attractive statistical models, that is to say adapted to the context, easy to use and efficient. In this article, we innovate in this direction by proposing a new statistical model based on the functionalities of the sinusoidal transformation and power Lomax distribution. We thus introduce a new three-parameter survival distribution called sine power Lomax distribution. In a first approach, we present it theoretically and provide some of its significant properties. Then the practicality, utility and flexibility of the sine power Lomax model are demonstrated through a comprehensive simulation study, and the analysis of nine real datasets mainly from medicine and engineering. Based on relevant goodness of fit criteria, it is shown that the sine power Lomax model has a better fit to some of the existing Lomax-like distributions.

1. Introduction

A large part of applied mathematics consists of defining one or more models of a mathematical nature, allowing a sufficiently general consideration of a given phenomenon. In a somewhat schematic way, we can distinguish two kinds of modelling: the deterministic modelling where random variations are not taken into account and stochastic modelling which takes into account these random variations (roughly speaking, ‘stochastic’ means to be or have a random variable). In the context of stochastic modelling, the random variations are often associated with an underlying probability distribution. Stochastic modelling can be divided into two sub-categories: probabilistic modelling and statistical modelling. The main objective of the probabilistic modelling is to provide a formal framework making it possible to describe the random variations discussed above, and to study the general properties of the phenomena which govern them. More applied, the statistical modelling essentially consists of defining suitable tools aiming to model the observed data taking into account their random nature. This theme is fully developed in [1,2], among others.
Recent developments in stochastic modelling have been driven by the rapid progress and accessibility of computing power. In particular, these have allowed direct applications of existing continuous distributions with some functional complexity for various statistical purposes. Also, these have accelerated the creation of new families of distributions presenting original and practical characteristics. In this regard, we may refer to [3] for a complete overview. Among the latest developments, the families defined by ‘trigonometric transformations’ of a given distribution have attracted much attention due to their applicability and working capacity in many practical situations. The pioneering works of [4,5,6,7] have focused on the sinusoidal transformation leading to the so-called sine generated (S-G or Sin-G) family. The following equations are the generic definitions of the associated cumulative distribution function (cdf) and probability density function (pdf), respectively:
F S ( x ; ζ ) = sin π 2 G ( x ; ζ ) , x R
and
f S ( x ; ζ ) = π 2 g ( x ; ζ ) cos π 2 G ( x ; ζ ) , x R .
In these equations, G ( x ; ζ ) and g ( x ; ζ ) are the cdf and pdf of a certain continuous distribution with parameter(s) vector denoted by ζ , respectively. They are related to a reference distribution chosen a priori by the practitioner, depending on the context of the study. It is now established that the S-G family (i) offers an attractive alternative to the reference family; one can show that G ( x ; ζ ) F S ( x ; ζ ) for any x R , (ii) is of acceptable mathematical complexity without introducing new parameters, and (iii) has the ability to provide flexible statistical models to accommodate data of varying nature. To illustrate these items, in Reference [4], the exponential distribution is used as a reference to define the SE model. It turns out to be well suited to analyse the important bladder cancer patients dataset of [8]. In another study, the inverse Weibull (IW) distribution developed by [9] was considered to be the reference distribution; the sine IW (SIW) model was introduced by [6]. By analyzing the famous guinea pigs dataset by [10], the SIW model is proven to perform better compared to serious and comparable competing models. An open-source R package on the SIW model is developed in [11], facilitating the use of the model beyond these basic purposes. These works inspired the construction of other trigonometric families of distributions, such as the CS-G family by [12], C-G family by [7], TransSC-G family by [13], NS-G family by [14], STL-G family by [15] and SKum-G family by [16].
In this paper, we contribute to the success of the S-G family by applying it to a specific three-parameter survival distribution: the power Lomax (PL) distribution proposed by [17]. We thus introduce the sine PL (SPL) distribution and model. Thus, a retrospective on the PL distribution is necessary to understand the proposed methodology. First, the Lomax distribution was introduced by [18]. It can be presented as a manageable two-parameter heavy-tailed survival distribution with a tuning polynomial decay and also, as a derivation of the Pareto distribution as described in [19] (page 573). It is governed by the cdf and pdf defined by
G L ( x ; ξ ) = 1 ( 1 + λ x ) α , x > 0
and
g L ( x ; ξ ) = α λ ( 1 + λ x ) ( α + 1 ) , x > 0 ,
respectively, with G L ( x ; ξ ) = g L ( x ; ξ ) = 0 for x 0 , where ξ = ( α , λ ) , α is a shape parameter and λ is a scale parameter, all the parameters taking strictly positive values. It finds numerous applications in reliability engineering and life testing. The theory, inference and applications of the Lomax distribution have been the subjects of the following inevitable references: [20,21,22,23,24,25,26]. The PL distribution proposed by [17] is obtained by making use of the power transformation to the Lomax distribution, aiming to increase its capabilities on several functional aspects. It corresponds to the distribution of the random variable X = Y 1 / β , where Y is a random variable with the Lomax distribution and β > 0 . Consequently, based on (3) and (4), the PL distribution is defined by the cdf and pdf defined by
G P L ( x ; ζ ) = G L ( x β ; ξ ) = 1 ( 1 + λ x β ) α , x > 0
and
g P L ( x ; ζ ) = α β λ x β 1 ( 1 + λ x β ) ( α + 1 ) , x > 0 ,
respectively, with G P L ( x ; ζ ) = g P L ( x ; ζ ) = 0 for x 0 , where ζ = ( α , β , λ ) , α is a shape parameter, and β and λ are scale parameters, all the parameters taking strictly positive values. Contrary to the Lomax distribution, it is established in [17] that the PL distribution adapts to both inverted bathtub and decreasing hazard rates. The practical gain is particularly impressive; the PL model is better than ten competing models for analyzing the bladder cancer patients dataset of [8], all based on the Lomax model. For the sake of optimality, some motivated distributions extending or generalizing the PL distribution was introduced, including the type II Topp-Leone PL (TIITLPL) distribution by [27], type I half logistic PL distribution by [28], inverse PL distribution by [29], Marshall-Olkin PL distribution by [30], exponentiated PL distribution by [31] and Kumaraswamy generalized PL distribution (KPL) by [32]. The main strategy of these proposed distributions is to add more parameters to the PL distribution based on exponentiated, transmuted or truncated schemes. Basically, these schemes give better results but add more parameters to the reference distribution; the problem of manipulating all these parameters simultaneously can present a certain difficulty from the modelling point of view. Thus, the immediate motivation of the SPL model is to use the S-G scheme to improve the efficiency of the PL model with the existing parameters. Deeper motivations come after further investigation which is detailed in the next study. To summarize, the functionality and flexibility of the SPL model are particularly attractive for data fitting. Indeed, the corresponding pdf has different kinds of curves such as uni-modal, symmetrical, asymmetrical on right and left, reversed J-shaped curves. Also, the model exhibits decreasing and increasing, inverted bathtub and reversed-J hazard rates. These properties give the SPL model a constant consistency in the precision of the fits unlike many other comparable models. This statement is illustrated in the practical environment by considering nine published datasets mainly from medicine and engineering, and four competing models derived from the Lomax distribution.
We organize the rest of the paper as follows. Section 2 is devoted to the definition, characteristics and main properties of the SPL distribution. The parametric estimation related to the SPL model is discussed and illustrated by a comprehensive simulation study in Section 3. Concrete applications to datasets are provided in Section 4. Finally, conclusions are stated in Section 5.

2. The SPL Distribution

2.1. Function Anlysis

Here, some mathematics of the SPL distribution are presented. First, by considering (5) and (6) in (1) and (2), we obtain the main distributional functions of the SPL distribution; the corresponding cdf and pdf are given as
F S P L ( x ; ζ ) = cos π 2 ( 1 + λ x β ) α , x > 0
where ζ = ( α , β , λ ) , and
f S P L ( x ; ζ ) = π 2 α β λ x β 1 ( 1 + λ x β ) ( α + 1 ) sin π 2 ( 1 + λ x β ) α , x > 0 ,
with F S P L ( x ; ζ ) = f S P L ( x ; ζ ) = 0 for x 0 . We recall that ζ = ( α , β , λ ) , α is a shape parameter, and β and λ are scale parameters, all the parameters taking strictly positive values. Considering different values of the parameters, variant forms of the pdf can be obtained. More specifically, by differentiating (8), it can be readily verified that f S P L ( x ; ζ ) is decreasing for β 1 and unimodal for β > 1 . The more representative of them are shown in Figure 1.
From Figure 1, we observe that the pdf of the SPL distribution can be decreasing or unimodal, with a very versatile asymmetry in all the directions. This versatility is an attractive point for the use of the SPL model in data fitting.
We complete this functional study by discussing the hazard rate function (hrf). First, in full generality, the hrf measures the tendency of an item to fail or die depending on the age reached. Therefore, it plays a key role in the classification of survival distributions. Basically, the shapes of hazard rates are either monotonic (increasing or decreasing) or non-monotonic (bathtub or inverted bathtub). The hrf of the SPL distribution is given by
h S P L ( x ; ζ ) = π 2 α β λ x β 1 ( 1 + λ x β ) ( α + 1 ) cot π 4 ( 1 + λ x β ) α , x > 0 ,
and h S P L ( x ; ζ ) = 0 for x 0 . Upon differentiation of (9), it can be seen that h S P L ( x ; ζ ) is increasing for β 1 and α 1 . It is also conjectured that h S P L ( x ; ζ ) is decreasing for β 1 and α 1 , and unimodal for β 1 and α 1 . The graphical study in Figure 2 supports these claims.
Figure 2 emphasizes the fact that the proposed SPL distribution possesses increasing and decreasing, and also upside down bathtub hazard rates.
Another important function of the SPL distribution is the quantile function (qf). It is defined as the inverse function of the corresponding cdf. Thus, based on (7), it is specified by
Q ( u ; ζ ) = F S P L 1 ( u ; ζ ) = 1 λ 2 π arccos u 1 / α 1 1 / β , u ( 0 , 1 ) .
As the cdf, the qf determined the SPL distribution. Classically, we can use it for determining the median, as well as the lower and upper quartiles. The qf can also be used to generate values from a random variable with the SPL distribution. Further detail on the quantile-based reliability analysis can be found in [33].

2.2. Moment Analysis

We now conduct a moment analysis. The following result gives a series expansion for the (crude) moments of a random variable with the SPL distribution.
Proposition 1.
Let r 1 be an integer and X be a random variable with the SPL distribution. Then, for r < 2 α β , the r-th moment of X exists and can be expanded as
E ( X r ) = α λ r / β k = 1 + ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k B r β + 1 , 2 k α r β ,
where E denotes the mathematical expectation and B ( a , b ) refers to the standard beta function given as B ( a , b ) = 0 1 t a 1 ( 1 t ) b 1 d t = 0 + t a 1 ( 1 + t ) ( a + b ) d t for a , b > 0 .
Proof. 
First, the definition of E ( X r ) is
E ( X r ) = + x r f S P L ( x ; ζ ) d x = 0 + x r f S P L ( x ; ζ ) d x ,
since f S P L ( x ; ζ ) = 0 for x 0 .
Let us study the mathematical existence of this integral by the Riemann integrability criterion. When x 0 , we have x r f S P L ( x ; ζ ) ( π / 2 ) α β λ x r + β 1 , which is integrable over ( 0 , δ ) with δ > 0 since β > 0 . For the case x + , we have x r f S P L ( x ; ζ ) ( π 2 / 4 ) α β λ 2 α x r 2 α β 1 , which is integrable over ( δ , + ) with δ > 0 if and only if r < 2 α β . The desired condition is obtained.
Let us now investigate a linear representation of the cdf expressed in (7), from which we will deduce a series expansion for the pdf as given by (8). By using the Taylor series expansion of the cosine function, for x > 0 , we get
F S P L ( x ; ζ ) = cos π 2 ( 1 + λ x β ) α = k = 0 + ( 1 ) k ( 2 k ) ! π 2 ( 1 + λ x β ) α 2 k = k = 0 + ( 1 ) k ( 2 k ) ! π 2 2 k ( 1 + λ x β ) 2 k α .
By applying a first order differentiation with respect to x, the following series expansion of the pdf comes:
f S P L ( x ; ζ ) = α λ β k = 1 + ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k x β 1 ( 1 + λ x β ) 2 k α 1 .
Please note that we ignored the term in k = 0 since the corresponding term disappears. From (11) and (12), by integrating f S P L ( x ; ζ ) with respect to x, swapping the symbols ∫ and ∑ by the dominated convergence theorem, and applying the change of variables y = λ x β , we obtain
E ( X r ) = α λ β k = 1 + ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k 0 + x r + β 1 ( 1 + λ x β ) 2 k α 1 d x = α λ r / β k = 1 + ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k B r β + 1 , 2 k α r β .
This completes the proof of Proposition 1. □
A computational remark is that, for K large enough, a precise approximation of μ r is obtained as
E ( X r ) α λ r / β k = 1 K ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k B r β + 1 , 2 k α r β .
Diverse moment measure can be defined from Proposition 1. Here, we restrict our attention on the variance of X basically defined by Var = E ( X 2 ) [ E ( X ) ] 2 .
The first four moments and variance of X for different parameter values are indicated in Table 1 provided α β > 2 .
From Table 1, we see the numerical versatility of the moment measures considered, varying from small to large values; central and dispersion indicators may be negligible or substantial. This confirms the claim about the overall flexibility of the SPL distribution.
Based on similar developments employed in the proof Proposition 1, it is possible to express various series expansions of moment-type functions. Here, we complete our moment analysis by investigating the incomplete moments of the SPL distribution which are involved in the definition of many applied measures and indicators.
Proposition 2.
Let r 1 be an integer, t 0 and X be a random variable with the SPL distribution. Then, the r-th incomplete moment of X with the truncated value t exists and can be expanded as
E ( X r I ( X t ) ) = α λ r / β k = 1 + ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k B λ t β / ( 1 + λ t β ) r β + 1 , 2 k α r β ,
where I denotes the indicator function and B u ( a , b ) refers to the truncated beta function given as B u ( a , b ) = 0 u t a 1 ( 1 t ) b 1 d t for a , b > 0 and u ( 0 , 1 ) .
Proof. 
First, we have
E ( X r I ( X t ) ) = 0 t x r f S P L ( x ; ζ ) d x .
The rest of the development follows the lines of the proof of Proposition 1; From (12) and (13), by integrating f S P L ( x ; ζ ) with respect to x, swapping the symbols ∫ and ∑ owing to the dominated convergence theorem, and applying the change of variables y = λ x β , we obtain
E ( X r I ( X t ) ) = α λ β k = 1 + ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k 0 t x r + β 1 ( 1 + λ x β ) 2 k α 1 d x = α λ r / β k = 1 + ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k 0 λ t β y r / β ( 1 + y ) 2 k α 1 d y .
Next, with the change of variables z = y / ( 1 + y ) , we get
E ( X r I ( X t ) ) = α λ r / β k = 1 + ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k 0 λ t β / ( 1 + λ t β ) z r / β ( 1 z ) 2 k α r / β 1 d z = α λ r / β k = 1 + ( 1 ) k + 1 ( 2 k 1 ) ! π 2 2 k B λ t β / ( 1 + λ t β ) r β + 1 , 2 k α r β .
This completes the proof of Proposition 2. □
Based on the incomplete moments, we can define the mean residual life function, mean waiting time, mean deviation about the mean, and various inequalities measures (Lorenz curve, Gini index, Bonferroni curve, Atkinson index, Zenga index, Pietra index, etc.). In this regard, we may refer the reader to the book of [34]. However, these measures are beyond the applied line of this paper.

3. Inference of the SPL Model

This section is devoted to the inferential treatment of the SPL distribution for the perspectives of statistical modelling. The maximum likelihood method, as described in full generality in [35], is employed. A mathematical description of this method in the context of the SPL distribution is provided below.
First, let x 1 , x 2 , x n be observations drawn from a random variable X with the SPL distribution. Then the corresponding likelihood function and log-likelihood function are
L S P L ( ζ ) = π 2 n ( α β λ ) n i = 1 n x i β 1 ( 1 + λ x i β ) ( α + 1 ) sin π 2 ( 1 + λ x i β ) α
and
log L S P L ( ζ ) = n log π 2 + n log ( α β λ ) + ( β 1 ) i = 1 n log x i ( α + 1 ) i = 1 n log ( 1 + λ x i β ) + i = 1 n log sin π 2 ( 1 + λ x i β ) α ,
respectively. Then, the maximum likelihood estimates (MLEs) are defined by ζ ^ = a r g m a x ζ L S P L ( ζ ) = a r g m a x ζ [ log L S P L ( ζ ) ] . The components of ζ ^ , say α ^ , β ^ and λ ^ , form the MLEs of α , β and λ , respectively. The MLEs can be formalized through non-linear equations involving the partial differentiation of the log-likelihood function with respect to the parameters α , β and λ . These partial derivatives are given as
α log L S P L ( ζ ) = n α i = 1 n log ( 1 + λ x i β ) π 2 i = 1 n ( 1 + λ x i β ) α log ( 1 + λ x i β ) × cot π 2 ( 1 + λ x i β ) α
β log L S P L ( ζ ) = n β + i = 1 n log x i ( α + 1 ) λ i = 1 n x i β 1 + λ x i β log x i π 2 α λ i = 1 n x i β log x i × ( 1 + λ x i β ) α 1 cot π 2 ( 1 + λ x i β ) α
and
λ log L S P L ( ζ ) = n λ ( α + 1 ) i = 1 n x i β 1 + λ x i β π 2 α i = 1 n x i β ( 1 + λ x i β ) α 1 cot π 2 ( 1 + λ x i β ) α .
Simple analytical expressions for α ^ , β ^ and λ ^ remain impossible, but practice only requires numerical evaluations of them. These numerical values can be easily obtained using specific tools in statistical software as the R software (see [36]). Also, the well-established theory on MLEs ensures that the random version of ζ ^ is asymptotically three-dimensional normal with mean vector ζ and variance-covariance matrix V = { ζ 2 log L S P L ( ζ ) ζ = ζ ^ } 1 , where ξ denotes the gradient according to ξ .
In particular, the (asymptotic) estimated standard error (SE) of α ^ is obtained by taking the square-root of the first diagonal component of V, and we can proceed in a similar way to obtain the SEs of the two other parameters. The asymptotic normal distribution is at the basis of diverse statistical tests or confidence intervals. Also, based on ξ ^ , f S P L ( x ; ζ ^ ) is the estimated pdf of f S P L ( x ; ζ ) . This estimated pdf plays a central role in fitting the normalized histogram of the data, as discussed in the next section on applications.
We now evaluate the accuracy of the MLEs of the SPL model. The data are artificial; they are generated by using the qf as defined by (10) through the inverse transform sampling technique. We conduct 1000 Monte Carlo simulations for each sample size n with n = 50 , 100, 200, 300 and 500 to the following different sets of parameters: Set I = ( 0.5 , 1.5 , 0.5 ) , Set II = ( 1.25 , 1.25 , 0.5 ) , Set III = ( 1.5 , 1.5 , 0.5 ) and Set IV = ( 1.5 , 2.5 , 0.5 ) with reference to the usual order ( α , β , λ ) . In each case, the standard mean MLE (MMLE), bias (Bias) and mean squared error (MSE) are calculated. The results are reported in Table 2.
From Table 2, we see that the maximum likelihood method performs quite well to estimate the parameters for the considered sample sizes. Indeed, as the sample size increases, the biases and the SEs of the MLEs decrease as expected. Also, we observe that when the sample size increases, the MMLEs are closed to the true parameter values.
We now present some useful measures of adequacy by using the notation of the SPL distribution for convenience. Let x 1 , x 2 , , x n be the data and x ( 1 ) , x ( 2 ) , , x ( n ) be their ordered values. First, we consider the Cramér-von Mises (W*), Anderson Darling (A*) and Kolmogorov-Smirnov (K-S) statistics ( D n ) defined by
W * = 1 12 n + i = 1 n F S P L ( x ( i ) ; ζ ^ ) 2 i 1 n 2 ,
A * = n i = 1 n 2 i 1 n log ( F S P L ( x ( i ) ; ζ ^ ) ) + log ( 1 F S P L ( x i ; ζ ^ ) )
and
D n = max i = 1 , , n i n F S P L ( x ( i ) ; ζ ^ ) , F S P L ( x ( i ) ; ζ ^ ) i 1 n ,
respectively, where ζ denotes the parameters of the distribution, i.e., ζ = ( α , β , λ ) for the SPL distribution and ζ ^ for its MLE. The p-Value of the K-S test related to D n is also considered. The above definitions can be adapted for any other distribution by changing the definition of the cdf and the notation of the parameters. These adequacy measures are widely used to find out which model is best suited. The model with the minimum value for W* or A*, and maximum value for p-Value, is chosen as the best one that is in adequacy to the data.
Also, we consider the Akaike information criterion (AIC), corrected Akaike information criterion (CAIC), Bayesian information criterion (BIC) and Hannan-Quinn information criterion (HQIC), defined in the context of the SPL distribution as
AIC = 2 log L S P L ( x ; ζ ^ ) + 2 k , BIC = 2 log L S P L ( x ; ζ ^ ) + k log ( n ) , CAIC = 2 log L S P L ( x ; ζ ^ ) + 2 k n n k 1 , HQIC = 2 log L S P L ( x ; ζ ^ ) + 2 k log [ log ( n ) ] ,
respectively, where k is the number of parameters so k = 3 for the SPL distribution. As commonly accepted, the model with the minimum value for AIC or CAIC or BIC or HQIC is chosen as the best one that fits the data. Further informations on the use and interpretation of the measures W*, A*, AIC, CAIC, BIC and HQIC can be found in [37].
In this study, we aim to compare the SPL model related to the SPL distribution with the useful and competitive Lomax-type model listed in Table 3.
We can notice that the Lomax model is nested in the TLGL, EL and PL models. The proposed SPL model is completely different in this sense. In addition, conceptually, the TLGL and EL models are closed; they coincide with a reparametrization of the parameters.

4. Applications of the SPL Model

Based on the above methodology, we apply the SPL model on nine datasets. They differ mainly in size, characteristics or background, but all of them are of modern interest to their respective fields. For each dataset, we proceed as follows:
  • We briefly present the data, with reference(s).
  • We provide a table that summarizes the main statistical characteristics of the data.
  • We assess the quality of the fit measures of the models considered and organize them in a table in order of the model performance.
  • As complementary work, we indicate the MLES of the model parameters as well as the related SEs.
  • We end with a visual approach by plotting the histogram of the data and the fitted pdfs, and, in another graph, the probability-probability (PP) plot for the SPL model only.
Data set 1: We consider a real dataset on the remission times (in months) of a random sample of 128 bladder cancer patients. This dataset is given by Lee and Wang [40] and it contains the following values: 0.08, 2.09, 3.48, 4.87, 6.94, 8.66, 13.11, 23.63, 0.20, 2.23, 3.52, 4.98, 6.97, 9.02, 13.29, 0.40, 2.26, 3.57, 5.06, 7.09, 9.22, 13.80, 25.74, 0.50, 2.46, 3.64, 5.09, 7.26, 9.47, 14.24, 25.82, 0.51, 2.54, 3.70, 5.17, 7.28, 9.74, 14.76, 26.31, 0.81, 2.62, 3.82, 5.32, 7.32, 10.06, 14.77, 32.15, 2.64, 3.88, 5.32, 7.39, 10.34, 14.83, 34.26, 0.90, 2.69, 4.18, 5.34, 7.59, 10.66, 15.96, 36.66, 1.05, 2.69, 4.23, 5.41, 7.62, 10.75, 16.62, 43.01, 1.19, 2.75, 4.26, 5.41, 7.63, 17.12, 46.12, 1.26, 2.83, 4.33, 5.49, 7.66, 11.25, 17.14, 79.05, 1.35, 2.87, 5.62, 7.87, 11.64, 17.36, 1.40, 3.02, 4.34, 5.71, 7.93, 11.79, 18.10, 1.46, 4.40, 5.85, 8.26, 11.98, 19.13, 1.76, 3.25, 4.50, 6.25, 8.37, 12.02, 2.02, 3.31, 4.51, 6.54, 8.53, 12.03, 20.28, 2.02, 3.36, 6.76, 12.07, 21.73, 2.07, 3.36, 6.93, 8.65, 12.63, 22.69.
A summary measure of descriptive statistics of dataset 1 is provided in Table 4.
We see in Table 4 that the data are right skewed and highly leptokurtic with high variance. With respect to model adequacy, the measures W*, A*, D n , p-Value, AIC, CAIC, BIC and HQIC are reported in Table 5.
From Table 5, we observe that the SPL model possesses the lowest values for W*, A*, D n , AIC, CAIC, BIC and HQIC, and the highest value for p-Value compared to the other models. It can be considered the best. The second best model is the PL model.
Please note that for this dataset, the results for the TLGL and EL models are almost identical due to their similar nature, but small numerical variations are observed without rounding.
For additional information, the MLEs of the model parameters as well as their SEs are reported in Table 6.
From Table 6, among other, we see that the parameters α , β and λ of the SPL model have been estimated by α ^ = 1.0216200 , β ^ = 1.3956063 and λ ^ = 0.0371991 , respectively, with quite small SEs.
Figure 3 shows two graphics: the histogram of the data fitted by the estimated pdfs, and the PP plot for the SPL model only.
In Figure 3, we observe that the empirical objects are almost perfectly adjusted by the estimated objects. In particular, in the PP plot, the black line is almost confused with the estimated red line related to the SPL model.
Data set 2: The considered data represent the failure times of the mechanical components of the aircraft windshield. They are taken from [41]. They were recently reviewed by [42]. The data are: 0.040, 1.866, 2.385, 3.443, 0.301, 1.876, 2.481, 3.467, 0.309, 1.899, 2.610, 3.478, 0.557, 1.911, 2.625, 3.578, 0.943, 1.912, 2.632, 3.595, 1.070, 1.914, 2.646, 3.699, 1.124, 1.981, 2.661, 3.779, 1.248, 2.010, 2.688, 3.924, 1.281, 2.038, 2.823, 4.035, 1.281, 2.085, 2.890, 4.121, 1.303, 2.089, 2.902, 4.167, 1.432, 2.097, 2.934, 4.240, 1.480, 2.135, 2.962, 4.255, 1.505, 2.154, 2.964, 4.278, 1.506, 2.190, 3.000, 4.305, 1.568, 2.194, 3.103, 4.376, 1.615, 2.223, 3.114, 4.449, 1.619, 2.224, 3.117, 4.485, 1.652, 2.229, 3.166, 4.570, 1.652, 2.300, 3.344, 4.602, 1.757, 2.324, 3.376, 4.663.
A summary of descriptive statistics for dataset 2 is provided in Table 7.
Based on the information of Table 7, we can say that the data are approximately symmetric and platykurtic, with little dispersion. One more point, we observe that the data have a negative kurtosis value which means that the underlying distributions should have lighter tails.
The statistical measures considered for the comparison of the models are given in Table 8.
From Table 8, the values of the model adequacy measures and goodness of fit test are clearly in favor of the SPL model. The second best model is the PL model.
The MLEs of the parameters of the SPL model and other models with their SEs are reported in Table 9.
In addition, the estimated pdfs over the histogram and PP plot of the SPL model are displayed in Figure 4.
From Figure 4, it is obvious that the light tails of the SPL model are instrumental in having a better fit. In addition, the PP plot underlines this power of adaptation; the black line is almost confused with the estimated red line.
Data set 3: We now consider a dataset containing 27 observations of time of successive failures of the air conditioning system of jets in a fleet of Boeing 720 as reported in Proschan [43]. Recently, this data was studied by [44] and the data are: 1, 4, 11, 16, 18, 18, 18, 24, 31, 39, 46, 51, 54, 63, 68, 77, 80, 82, 97, 106, 111, 141, 142, 163, 191, 206, 216.
Some descriptive measures of dataset 3 are provided in Table 10.
From Table 10, we see that the data are right skewed and platykurtic with a high variance.
Table 11 indicates the values of the statistical measures considered to compare the models.
The analysis of Table 11 ensures that the SPL model is the best with, in particular, p-Value = 0.9399 . The second best model is the EL model.
The MLEs of the model parameters as well as their SEs are reported in Table 12.
The estimated pdfs over the histogram and the PP plot of the SPL model are shown in Figure 5.
In Figure 5, the fitted power of the SPL model is flagrant; the corresponding estimated pdf has captured the decreasing roundness shape of the histogram, contrary to the other estimated pdfs. In addition, the red line of the PP plot is generally close to the black line.
Data set 4: The data represent 69 strength measures for single carbon fibers (and impregnated 1000-carbon fiber tows). They are given by [45]. The measures in GPA by subtracting 1 are: 0.0312, 0.314, 0.479, 0.552, 0.700, 0.803, 0.861, 0.865, 0.944, 0.958, 0.966, 0.977, 1.006, 1.021, 1.027, 1.055, 1.063, 1.098, 1.140, 1.179, 1.224, 1.240, 1.253, 1.270, 1.272, 1.274, 1.301, 1.301, 1.359, 1.382, 1.382, 1.426, 1.434, 1.435, 1.478, 1.490, 1.511, 1.514, 1.535, 1.554, 1.566, 1.570, 1.586, 1.629, 1.633, 1.642, 1.648, 1.684, 1.697, 1.726, 1.770, 1.773, 1.800, 1.809, 1.818, 1.821, 1.848, 1.880, 1.954, 2.012, 2.067, 2.084, 2.090, 2.096, 2.128, 2.233, 2.433, 2.585, 2.585,4.32.
A statistical description of dataset 4 is given in Table 13.
Table 13 shows that the data are almost symmetric and leptokurtic, with a low variance.
The fitting performance of the considered models are investigated numerically in Table 14.
From Table 14, we see that the SPL model is more relevant for the fit of the dataset than the other models. Indeed, it has the lowest value for all the statistical measures considered, except for the p-Value where it has the highest value. The second best model is the PL model.
Table 15 contains the MLEs of the considered models along with their SEs.
The fitted histogram of the data is shown in Figure 6, along with the PP plot of the SPL model.
From Figure 6, the curve of the estimated pdf of the SPL model is close to the shape of the histogram and has captured the ‘elbow phenomena’ in the right. The corresponding PP plot is also convincing.
Data set 5: We now consider a dataset containing 100 observations on breaking stress of carbon fibers (in Gba). It was studied by [46] and the data are: 3.7, 2.74, 2.73, 2.5, 3.6, 3.11, 3.27, 2.87, 1.47, 3.11,4.42, 2.41, 3.19, 3.22, 1.69, 3.28, 3.09, 1.87, 3.15, 4.9, 3.75, 2.43, 2.95, 2.97, 3.39, 2.96, 2.53,2.67, 2.93, 3.22, 3.39, 2.81, 4.2, 3.33, 2.55, 3.31, 3.31, 2.85, 2.56, 3.56, 3.15, 2.35, 2.55, 2.59,2.38, 2.81, 2.77, 2.17, 2.83, 1.92, 1.41, 3.68, 2.97, 1.36, 0.98, 2.76, 4.91, 3.68, 1.84, 1.59, 3.19,1.57, 0.81, 5.56, 1.73, 1.59, 2, 1.22, 1.12, 1.71, 2.17, 1.17, 5.08, 2.48, 1.18, 3.51, 2.17, 1.69,1.25, 4.38, 1.84, 0.39, 3.68, 2.48, 0.85, 1.61, 2.79, 4.7, 2.03, 1.8, 1.57, 1.08, 2.03, 1.61, 2.12,1.89, 2.88, 2.82, 2.05, 3.65.
A summary of descriptive statistics for these data is presented in Table 16.
From Table 16, we see that the data are approximately symmetric and platykurtic with a low variability.
The statistical measures considered for the comparison of the models are given in Table 17.
In our framework, Table 17 attests to the superior adequacy of the SPL model.
The MLEs of the model parameters and their SEs are reported in Table 18.
A visual work is performed in Figure 7, showing the histogram and PP plot of the SPL model.
In Figure 7, the flexible skewness of the SPL model is clearly the key, allowing the symmetrical nature of the data to be fully captured. The observation of the PP plot confirm the high quality of the fit of the SPL model.
Data set 6: The data correspond to times in days between 109 successive mining catastrophes in Great Britain, for the period 1875-1951, as published in [47]. The sorted data are given as follows: 1, 4, 4, 7, 11, 13, 15, 15, 17, 18, 19, 19, 20, 20, 22, 23, 28, 29, 31, 32, 36, 37, 47, 48, 49, 50, 54, 54, 55, 59, 59, 61, 61, 66, 72, 72, 75, 78, 78, 81, 93, 96, 99, 108, 113, 114, 120, 120, 120, 123, 124, 129, 131, 137, 145, 151, 156, 171, 176, 182, 188, 189, 195, 203, 208, 215, 217, 217, 217, 224, 228, 233, 255, 271, 275, 275, 275, 286, 291, 312, 312, 312, 315, 326, 326, 329, 330, 336, 338, 345, 348, 354, 361, 364, 369, 378, 390, 457, 467, 498, 517, 566, 644, 745, 871, 1312, 1357, 1613, 1630.
A descriptive statistical summary of dataset 6 is presented in Table 19.
From Table 19, we can say that the data are right skewed and leptokurtic, with a very high variance.
The goodness of fit measures of the considered models are calculated and collected in Table 20.
From Table 20, the SPL model shows the best results, far superior to those of the competition. The second best model is the EL model.
The MLEs of the model parameters along with their SEs are reported in Table 21.
Figure 8 illustrates the nice fit of the SPL model by two different graphical approaches.
From Figure 8, we observe that the adjustment of the SPL model proposes a slope more adapted to the form of the histogram of the data, compared to those of the other models. A nice result in the PP plot is also observed.
Data set 7: The data are measures of life of Kevlar 373/epoxy fatigue fractures that are subjected to constant pressure (at the 90% stress level) until all has failed. These data was recently studied by [13] and they are: 0.0251, 0.0886, 0.0891, 0.2501, 0.3113, 0.3451, 0.4763, 0.5650, 0.5671, 0.6566, 0.6748, 0.6751, 0.6753, 0.7696, 0.8375, 0.8391, 0.8425, 0.8645, 0.8851, 0.9113, 0.9120, 0.9836, 1.0483, 1.0596, 1.0773, 1.1733, 1.2570, 1.2766, 1.2985, 1.3211, 1.3503, 1.3551, 1.4595, 1.4880, 1.5728, 1.5733, 1.7083, 1.7263, 1.7460, 1.7630, 1.7746, 1.8275, 1.8375, 1.8503, 1.8808, 1.8878, 1.8881, 1.9316, 1.9558, 2.0048, 2.0408, 2.0903, 2.1093, 2.1330, 2.2100, 2.2460, 2.2878, 2.3203, 2.3470, 2.3513, 2.4951, 2.5260, 2.9911, 3.0256, 3.2678, 3.4045, 3.4846, 3.7433, 3.7455, 3.9143, 4.8073, 5.4005, 5.4435, 5.5295, 6.5541, 9.0960.
Table 22 presents a brief summary of descriptive statistics for these data.
From Table 22, it can be deduced that the data are right skewed and leptokurtic, with a low variability.
According to Table 23, for the purpose of optimal data fit, the SPL model is more pertinent than the other models. The second best model is the PL model.
We numerically complete the above results by showing the MLEs of the model parameters as well as the SEs inTable 24.
The histogram and PP plot of the data with the model fits are shown in Figure 9.
From Figure 9, in the fitting exercise, we see that the SPL model is slightly better than the competing models. A favorable PP plot is also observed.
Data set 8: Data on service times for a particular model windshield are now considered. They are given from [41]. The unit for measurement is 1000 h and the data are: 0.046, 1.436, 2.592, 0.140, 1.492, 2.600, 0.150, 1.580, 2.670, 0.248, 1.719, 2.717,0.280, 1.794, 2.819, 0.313, 1.915, 2.820, 0.389, 1.920, 2.878, 0.487, 1.963, 2.950, 0.622, 1.978, 3.003, 0.900, 2.053, 3.102, 0.952, 2.065, 3.304, 0.996, 2.117, 3.483, 1.003, 2.137, 3.500, 1.010, 2.141, 3.622, 1.085, 2.163, 3.665, 1.092, 2.183, 3.695, 1.152, 2.240, 4.015, 1.183, 2.341, 4.628, 1.244, 2.435, 4.806, 1.249, 2.464, 4.881, 1.262, 2.543, 5.140.
Table 25 presents a concise statistical description of these data.
We see in Table 25 that the data are right skewed and platykurtic, with a moderate variability.
Table 26 indicates that the SPL model is the most appropriate fitted model. The second best model is the PL model.
Some additional elements are now given. The MLEs of the models along with their SEs are shown in Table 27.
We visually see the adjustability of the SPL model in Figure 10.
From Figure 10, it is evident that the histogram of the data is better fitted by the estimated pdf of the SPL model. The red line of the PP plot is relatively close to the black line, confirming the SPL model fitting power.
Data set 9: Data relating to the strengths of 1.5 cm glass fibres which was obtained by workers at the UK National Physical Laboratory are now used. They were previously analysed by [48]. The data are: 0.55, 0.74, 0.77, 0.81, 0.84, 1.24, 0.93, 1.04, 1.11, 1.13, 1.30, 1.25, 1.27, 1.28, 1.29, 1.48, 1.36, 1.39, 1.42, 1.48, 1.51, 1.49, 1.49, 1.50, 1.50, 1.55, 1.52, 1.53, 1.54, 1.55, 1.61, 1.58, 1.59, 1.60, 1.61, 1.63, 1.61, 1.61, 1.62, 1.62, 1.67, 1.64, 1.66, 1.66, 1.66, 1.70, 1.68, 1.68, 1.69, 1.70, 1.78, 1.73, 1.76, 1.76, 1.77, 1.89, 1.81, 1.82, 1.84, 1.84, 2.00, 2.01, 2.24.
A first statistical approach of these data is proposed in Table 28.
From Table 28, we observe that the data are left skewed and platykurtic, with almost negligible dispersion.
The goodness of fit measures of the considered models are calculated and collected in Table 29.
According to Table 29, we assert that the SPL model has a better goodness of fit than the other models. The second best model is the PL model.
The MLEs of the model parameters and their SEs are shown in Table 30.
Estimated pdfs over the histogram of the data and PP plot of the SPL model are shown in Figure 11.
From Figure 11, unsurprisingly in view of Table 30, the SPL model shows the best fit curve of the histogram. A nice fit of the SPL model is also validated by the PP plot.

5. Conclusions

The main contribution of the article is to propose a new efficient statistical modelling strategy through a flexible trigonometric extension of the famous power Lomax model. In this regard, we use the functionalities of the sine generalized (S-G) family of distributions and introduce the sine power Lomax (SPL) distribution. We exhibited some of its interesting characteristics, with an emphasis on the modelling ability of the corresponding probability density and hazard rate functions, and discussed the moments and incomplete moments. Simulations and applications illustrate the usefulness of the considered SPL model. In particular, we carried out nine practical datasets for the evaluation of the SPL model with the main existing models derived from the Lomax model. Whenever the data is symmetric or skewed, the SPL model performs better than the competing models considered. Thus, the results obtained are quite satisfactory, showing that the SPL model can be used fairly to efficiently analyse a large panel of datasets.

Author Contributions

V.B.V.N., R.V.V. and C.C. have contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

The authors are very grateful to the two anonymous referees for all the constructive comments that improved this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Freedman, D.A. Statistical Models: Theory and Practice; Cambridge University Press: Cambridge, UK, 2005; ISBN 978-0-521-67105-7. [Google Scholar]
  2. McCullagh, P. What is a statistical model ? Ann. Stat. 2002, 30, 1225–1310. [Google Scholar] [CrossRef]
  3. Brito, C.R.; Rêgo, L.C.; Oliveira, W.R.; Gomes-Silva, F. Method for generating distributions and classes of probability distributions: The univariate case. Hacet. J. Math. Stat. 2019, 48, 897–930. [Google Scholar]
  4. Kumar, D.; Singh, U.; Singh, S.K. A new distribution using sine function: its application to bladder cancer patients data. J. Stat. Appl. Probab. 2015, 4, 417–427. [Google Scholar]
  5. Souza, L. New Trigonometric Classes of Probabilistic Distributions. Ph.D. Thesis, Universidade Federal Rural de Pernambuco, Recife, Brazil, 2015. [Google Scholar]
  6. Souza, L.; Junior, W.R.O.; de Brito, C.C.R.; Chesneau, C.; Ferreira, T.A.E.; Soares, L. On the Sin-G class of distributions: theory, model and application. J. Math. Model. 2019, 7, 357–379. [Google Scholar]
  7. Souza, L.; Junior, W.R.O.; de Brito, C.C.R.; Chesneau, C.; Ferreira, T.A.E.; Soares, L. General properties for the Cos-G class of distributions with applications. Eurasian Bull. Math. 2019, 2, 63–79. [Google Scholar]
  8. Lee, C.; Famoye, F.; Olumolade, O. Beta-Weibull distribution: Some properties and applications to censored data. J. Mod. Appl. Stat. Methods 2007, 6, 173–186. [Google Scholar] [CrossRef]
  9. Nelson, W. Applied Life Data Analysis; John Wiley and Sons: New York, NY, USA, 1982. [Google Scholar]
  10. Bjerkedal, T. Acquisition of resistance in guinea pigs infected with different doses of virulent tubercle bacilli. Am. J. Hyg. 1960, 72, 130–148. [Google Scholar]
  11. Souza, L.; Gallindo, L.; Serafim-de-Souza, L. SinIW: The SinIW Distribution. R Package Version 0.2. 2016. Available online: https://CRAN.R-project.org/package=SinIW (accessed on 2 February 2021).
  12. Chesneau, C.; Bakouch, H.S.; Hussain, T. A new class of probability distributions via cosine and sine functions with applications. Commun. Stat. Simul. Comput. 2019, 48, 2287–2300. [Google Scholar] [CrossRef]
  13. Jamal, F.; Chesneau, C. A new family of polyno-expo-trigonometric distributions with applications, Infinite Dimensional Analysis. Quantum Probab. Relat. Top. 2019, 22, 1950027. [Google Scholar] [CrossRef] [Green Version]
  14. Mahmood, Z.; Chesneau, C.; Tahir, M.H. A new sine-G family of distributions: properties and applications. Bull. Comput. Appl. Math. 2019, 7, 53–81. [Google Scholar]
  15. Al-Babtain, A.A.; Elbatal, I.; Chesneau, C.; Elgarhy, M. Sine Topp-Leone-G family of distributions: Theory and applications. Open Phys. 2020, 18, 574–593. [Google Scholar] [CrossRef]
  16. Jamal, F.; Chesneau, C. The sine Kumaraswamy-G family of distributions. J. Math. Ext. 2021, in press. [Google Scholar]
  17. Rady, E.H.A.; Hassanein, W.A.; Elhaddad, T.A. The power Lomax distribution with an application to bladder cancer data. SpringerPlus 2016, 5, 1–22. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Lomax, K.S. Business Failures; Another example of the analysis of failure data. J. Am. Stat. Assoc. 1954, 49, 847–852. [Google Scholar] [CrossRef]
  19. Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions 1, 2nd ed.; Wiley: New York, NY, USA, 1994. [Google Scholar]
  20. Abdullah, M.A.; Abdullah, H.A. Estimation of Lomax parameters based on generalized probability weighted moment. J. King Abdulaziz Univ. Sci. 2010, 22, 171–184. [Google Scholar]
  21. Afaq, A.; Ahmad, S.P.; Ahmed, A. Bayesian Analysis of shape parameter of Lomax distribution under different loss functions. Int. J. Stat. Math. 2010, 2, 55–65. [Google Scholar]
  22. Ahsanullah, M. Record values of Lomax distribution. Stat. Ned. 1991, 41, 21–29. [Google Scholar] [CrossRef]
  23. Balakrishnan, N.; Ahsanullah, M. Relations for single and product moments of record values from Lomax distribution. Sankhya B 1994, 56, 140–146. [Google Scholar]
  24. Balkema, A.; de Haan, L. Residual life time at great age. Ann. Probability 1974, 2, 792–804. [Google Scholar] [CrossRef]
  25. Bryson, M.C. Heavy-tailed distributions: properties and tests. Technometrics 1974, 16, 61–68. [Google Scholar] [CrossRef]
  26. Ferreira, P.H.; Ramos, E.; Ramos, P.L.; Gonzales, J.F.B.; Tomazella, V.L.D.; Ehlers, R.S.; Silva, E.B.; Louzada, F. Objective Bayesian analysis for the Lomax distribution. Stat. Probab. Lett. 2020, 159, 108677. [Google Scholar] [CrossRef] [Green Version]
  27. Al-Marzouki, S.; Jamal, F.; Chesneau, C.; Elgarhy, M. Type II Topp Leone power Lomax distribution with applications. Mathematics 2020, 8, 4. [Google Scholar] [CrossRef] [Green Version]
  28. Fayomi, A. Type I half logistic power Lomax distribution: Statistical properties and application. Adv. Appl. Stat. 2019, 54, 85–98. [Google Scholar] [CrossRef]
  29. Hassan, A.S.; Abd-Allah, M. On the inverse power Lomax distribution. Ann. Data Sci. 2019, 6, 259–278. [Google Scholar] [CrossRef]
  30. Haq, M.A.; Srinivasa-Rao, G.; Albassam, M.; Aslam, M. Marshall-Olkin power lomax distribution for modeling of wind speed data. Energy Rep. 2020, 6, 1118–1123. [Google Scholar] [CrossRef]
  31. Abd El-Monsef, M.M.E.; Sweilam, N.H.; Sabry, M.A. The exponentiated power Lomax distribution and its applications. Qual. Reliab. Eng. Int. 2021. [Google Scholar] [CrossRef]
  32. Nagarjuna, V.B.V.; Vardhan, R.V.; Chesneau, C. Kumaraswamy generalized power Lomax distribution and its applications. Stats 2021, 4, 28–45. [Google Scholar] [CrossRef]
  33. Nair, N.U.; Sankaran, P.; Balakrishnan, N. Quantile-Based Reliability Analysis; Birkhäuser: Basel, Switzerland, 2013. [Google Scholar]
  34. Cordeiro, G.M.; Silva, R.B.; Nascimento, A.D.C. Recent Advances in Lifetime and Reliability Models; Bentham Books: Sharjah, United Arab Emirates, 2020. [Google Scholar]
  35. Casella, G.; Berger, R.L. Statistical Inference; Duxbury Advanced Series Thomson Learning: Pacific Grove, CA, USA, 2002. [Google Scholar]
  36. R Development Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2005; ISBN 3-900051-07-0. [Google Scholar]
  37. Konishi, S.; Kitagawa, G. Information Criteria and Statistical Modeling; Springer: New York, NY, USA, 2007. [Google Scholar]
  38. Oguntunde, P.E.; Khaleel, M.A.; Okagbue, H.I.; Odetunmibi, O.A. The Topp-Leone lomax (TLLO) distribution with applications to airbone communication transceiver dataset. Wirel. Pers. Commun. 2019, 109, 349–360. [Google Scholar] [CrossRef]
  39. Lemonte, A.J.; Cordeiro, G.M. An extended Lomax distribution. Statistics 2013, 47, 800–816. [Google Scholar] [CrossRef]
  40. Lee, E.T.; Wang, J.W. Statistical Methods for Survival Data Analysis; Wiley: New York, NY, USA, 2003. [Google Scholar] [CrossRef]
  41. Murthy, D.N.P.; Xie, M.; Jiang, R. Weibull Models; John Wiley and Sons: New York, NY, USA, 2004. [Google Scholar]
  42. Silva, R.V.; Silva, F.G.; Ramos, M.W.A.; Cordeiro, G.M. A new extended gamma generalized model. Int. J. Pure Appl. Math. 2015, 100, 309–335. [Google Scholar] [CrossRef] [Green Version]
  43. Proschan, F. Theoretical explanation of observed decreasing failure rate. Technometrics 1963, 5, 375–383. [Google Scholar] [CrossRef]
  44. Lorenzo, E.; Malla, G.; Mukerjee, H. A new test for decreasing mean residual lifetimes. Commun. Stat. Theory Methods 2018, 47, 2805–2812. [Google Scholar] [CrossRef]
  45. Bader, M.; Priest, A. Statistical aspects of fibre and bundle strength in hybrid composites. In Progress in Science and Engineering Composites; Hayashi, T., Kawata, K., Umekawa, S., Eds.; ICCM-IV: Tokyo, Japan, 1982; pp. 1129–1136. [Google Scholar]
  46. Nichols, M.D.; Padgett, W.J. A bootstrap control chart for Weibull percentiles. Qual. Reliab. Eng. Int. 2006, 22, 141–151. [Google Scholar] [CrossRef]
  47. Maguire, B.A.; Pearson, E.; Wynn, A. The time intervals between industrial accidents. Biometrika 1952, 39, 168–180. [Google Scholar] [CrossRef]
  48. Smith, R.L.; Naylor, J.C. A comparison of maximum likelihood and bayesian estimators for the three-parameter weibull distribution. Appl. Stat. 1987, 36, 258–369. [Google Scholar] [CrossRef]
Figure 1. Curves of the pdf of the SPL distribution at different parameter values.
Figure 1. Curves of the pdf of the SPL distribution at different parameter values.
Modelling 02 00005 g001
Figure 2. Curves of the hrf of the SPL distribution at different parameter values.
Figure 2. Curves of the hrf of the SPL distribution at different parameter values.
Modelling 02 00005 g002
Figure 3. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 1.
Figure 3. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 1.
Modelling 02 00005 g003
Figure 4. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 2.
Figure 4. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 2.
Modelling 02 00005 g004
Figure 5. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 3.
Figure 5. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 3.
Modelling 02 00005 g005
Figure 6. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 4.
Figure 6. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 4.
Modelling 02 00005 g006
Figure 7. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 5.
Figure 7. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 5.
Modelling 02 00005 g007
Figure 8. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 6.
Figure 8. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 6.
Modelling 02 00005 g008
Figure 9. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 7.
Figure 9. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 7.
Modelling 02 00005 g009
Figure 10. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 8.
Figure 10. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 8.
Modelling 02 00005 g010
Figure 11. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 9.
Figure 11. (a) Plot of the estimated pdfs over the histogram and (b) PP plot of the SPL model for dataset 9.
Modelling 02 00005 g011
Table 1. Moments of a random variable X with SPL distribution for different parameter values.
Table 1. Moments of a random variable X with SPL distribution for different parameter values.
Parameters α E ( X ) E ( X 2 ) E ( X 3 ) E ( X 4 ) Var
β = 1.15    λ = 0.05 52.09829588.058780546.731802380.7505653.6559354
101.09659612.10214575.76665020.8432740.8996228
150.75936550.99418831.8350144.3899990.4175523
200.58696200.59005770.8303891.5033600.2455333
β = 0.95    λ = 0.15 2.51.996964310.875856141.2282745891.0006286.8879901
3.51.29509624.06576224.541340276.7933582.3884878
41.09910662.83847113.451765110.2083861.6304353
60.68022991.0191792.5697889.9060190.5564665
Table 2. Results of the simulation study for the SPL model.
Table 2. Results of the simulation study for the SPL model.
n α ^ β ^ λ ^
MMLEBiasMSEMMLEBiasMSEMMLEBiasMSE
Set I
501.1088390.608839111.326781.6142030.11420340.2434570.71969260.21969266.40414
1000.60115190.10115190.14363951.5401660.040166370.083033070.5475550.047555020.1272187
2000.52930780.029307760.030479321.5267630.02676280.035829710.52899760.028997620.04151895
3000.51331770.013317680.013828651.5222770.022276880.02218750.51968450.019684550.02336737
5000.51289720.012897240.0080659971.507050.0070499750.012709580.50598560.00598560.01260153
Set II
506.4689015.218901159.09661.3295570.079556830.10835181.1101710.610170537.25226
1003.5935582.34355854.053081.2657950.015795270.032095070.55246360.052463650.2133836
2001.8590370.60903715.8170421.2577940.0077942380.017955860.52492960.02492960.1045979
3001.5144970.26449691.5942991.2571450.0071453330.011711770.52571280.025712770.06209381
5001.383470.133470.41422431.2514820.0014822670.0064784370.50395280.00395280.02902843
Set III
509.3618447.861844305.60111.590880.090879550.14091310.92029530.42029537.984993
1004.7591373.25913772.483321.5193140.019314410.053901780.57788750.077887490.42830
2002.4679680.967968314.134031.5212030.021203010.022782870.54433010.044330060.1120899
3002.0067220.50672194.9193881.5065620.0065624970.016530960.51644820.016448210.07516359
5001.664710.16471040.49009111.5070260.0070258080.0090091180.51547910.015479070.03658865
Set IV
508.9115217.411521314.7562.6754210.17542050.39284051.1047840.604783937.61909
1004.9480693.448069100.05952.5460410.046041290.142320.59197490.091974950.3972336
2002.4779610.97796114.330942.5295030.029503040.073705630.55406130.054061250.1307966
3001.8774040.37740442.7989782.5180450.018044880.045785720.53307290.033072890.07655854
5001.7647780.26477821.4385292.5011130.001112610.026427560.4970354-0.0029645610.03740574
Table 3. Competitive models of the SPL model.
Table 3. Competitive models of the SPL model.
ModelsAbbreviationsCdfs ( x > 0 )References
Topp-Leone LomaxTLGL 1 ( 1 + α x ) 2 β λ [38]
power LomaxPL 1 1 + λ x β α [17]
exponentiated LomaxEL 1 β x + β α λ [39]
LomaxLomax 1 β x + β λ [18]
Table 4. Descriptive statistics of dataset 1.
Table 4. Descriptive statistics of dataset 1.
MeanMedianVarianceSkewnessKurtosisMinimumMaximum
9.365626.395110.4253.2865715.483080.0879.05
Table 5. Goodness of fit measures of the models for dataset 1
Table 5. Goodness of fit measures of the models for dataset 1
ModelsW*A* D n p-ValueAICCAICBICHQIC
SPL0.01860.12390.03490.9977825.3925825.5861833.9486828.8689
PL0.01950.13080.03510.9974825.4798825.6733834.0359828.9562
TLGL0.02830.19020.04050.9847826.1436826.3372834.6997829.6200
EL0.02830.19020.04040.9847826.1436826.3372834.6997829.6200
Lomax0.08070.48760.09660.1831831.6658831.7618837.3698833.9834
Table 6. MLEs of the model parameters for dataset 1 (in parenthesis are the SEs).
Table 6. MLEs of the model parameters for dataset 1 (in parenthesis are the SEs).
Models α β λ
SPL1.0216200 (0.45875225)1.3956063 (0.18303304)0.0371991 (0.01408465)
PL2.070725 (0.9705209)1.427499 (0.1782097)34.861099 (13.9162924)
TLGL1.586149 (0.2798032)2.292993 (1.1137263)24.744613 (16.6935617)
EL4.589053 (2.2316031)24.763807 (16.7230668)1.586145 (0.2798554)
Lomax-13.96063 (15.45659)121.24393 (143.40888)
Table 7. Descriptive statistics of dataset 2
Table 7. Descriptive statistics of dataset 2
MeanMedianVarianceSkewnessKurtosisMinimumMaximum
2.557452.35451.251770.09949−0.652320.044.663
Table 8. Goodness of fit measures of the models for dataset 2.
Table 8. Goodness of fit measures of the models for dataset 2.
ModelsW*A* D n p-ValueAICCAICBICHQIC
SPL0.06260.64470.05630.9531268.5388268.8388275.8312271.4703
PL0.10310.96860.10610.3016275.1259275.4259282.4184278.0574
EL0.23931.87770.12360.1536288.6155288.9155295.9079291.5470
TLGL0.24651.92320.12040.1751289.4639289.7639296.7563292.3954
Lomax0.19331.58240.30772.49 × 10 7 337.4818337.6299342.3434339.4361
Table 9. MLEs of the model parameters for dataset 2 (in parenthesis are the SEs).
Table 9. MLEs of the model parameters for dataset 2 (in parenthesis are the SEs).
Models α β λ
SPL2.97275466 (1.266693742)2.44917417 (0.234312750)0.01610661 (0.007105774)
PL2.510918 (1.0039915)2.501948 (0.2813778)24.858636 (8.8454850)
EL24.107930 (13.9109419)30.212370 (18.6585652)3.661293 (0.6506768)
TLGL3.721336 (0.7759183)9.745047 (5.5473841)24.585348 (16.2933590)
Lomax-8.650051 (3.207235)21.150309 (8.180986)
Table 10. Descriptive statistics of dataset 3.
Table 10. Descriptive statistics of dataset 3.
MeanMedianVarianceSkewnessKurtosisMinimumMaximum
76.81481634059.3110.80235−0.426691216
Table 11. Goodness of fit measures of the models for dataset 3.
Table 11. Goodness of fit measures of the models for dataset 3.
ModelsW*A* D n p-ValueAICCAICBICHQIC
SPL0.03790.28110.10230.9399296.0297297.0732299.9172297.1857
EL0.13960.92520.16090.4865304.4668305.5103308.3543305.6228
TLGL0.16231.06830.17380.3880306.2415307.285310.129307.3975
PL0.09320.63680.23450.1025309.2454310.2889313.133310.4014
Lomax0.10520.70900.21610.1605306.0443306.5443308.6359306.8149
Table 12. MLEs of the model parameters for dataset 3 (in parenthesis are the SEs).
Table 12. MLEs of the model parameters for dataset 3 (in parenthesis are the SEs).
Models α β λ
SPL1.382763741 (0.6448660998)1.221321892 (0.1405012622)0.002328135 (0.0004797677)
EL1.123151 (0.3229477)17.681731 (10.9965985)2.336629 (0.7915584)
TLGL2.8564521 (1.0394244)0.5234784 (0.1417636)11.9763851 (7.9755583)
PL1.1193937 (0.5529827)0.8687552 (0.1596078)24.1383129 (10.1687168)
Lomax-0.9108902 (0.2758631)29.3494386 (12.4143031)
Table 13. Descriptive statistics of dataset 4.
Table 13. Descriptive statistics of dataset 4.
MeanMedianVarianceSkewnessKurtosisMinimumMaximum
1.488021.4840.37021.241915.468690.03124.32
Table 14. Goodness of fit measures of the models for dataset 4.
Table 14. Goodness of fit measures of the models for dataset 4.
ModelsW*A* D n p-ValueAICCAICBICHQIC
SPL0.12180.86880.07200.8614131.0550131.4187137.8005133.7344
PL0.13450.94500.07780.7909131.9917132.3553138.7372134.6711
TLGL0.40602.52910.14430.1085152.7280153.0916159.4734155.4073
EL0.42652.64400.14400.1098153.4122153.7758160.1577156.0916
Lomax0.32132.05400.35544.18 × 10 8 204.3163204.4954208.8133206.1026
Table 15. MLEs of the model parameters for dataset 4 (in parenthesis are the SEs).
Table 15. MLEs of the model parameters for dataset 4 (in parenthesis are the SEs).
Models α β λ
SPL1.78152330 (1.05814438)3.06914764 (0.43240005)0.08628047 (0.05177348)
PL3.457354 (2.0440478)3.162505 (0.4336058)13.575912 (7.9684278)
TLGL4.817675 (0.9783213)12.333991 (7.2953270)16.079007 (10.2327362)
EL21.972027 (12.564484)13.246999 (7.875189)5.455173 (1.080415)
Lomax-12.35939 (5.819302)17.92672 (8.717093)
Table 16. Descriptive statistics of dataset 5.
Table 16. Descriptive statistics of dataset 5.
MeanMedianVarianceSkewnessKurtosisMinimumMaximum
2.62142.71.027960.368150.104940.395.56
Table 17. Goodness of fit measures of the models for dataset 5.
Table 17. Goodness of fit measures of the models for dataset 5.
ModelsW*A* D n p-ValueAICCAICBICHQIC
SPL0.07150.39490.06280.8248288.6900288.9400296.5055291.8530
PL0.17500.89140.12570.0848296.9140297.1640304.7295300.0770
EL0.25491.34620.11030.1751300.7922301.0422308.6077303.9553
TLGL0.27061.43680.11310.1552302.1661302.4161309.9816305.3292
Lomax0.16760.86050.31395.52 × 10 9 405.1160405.2397410.3263407.2247
Table 18. MLEs of the model parameters for dataset 5 (in parenthesis are the SEs).
Table 18. MLEs of the model parameters for dataset 5 (in parenthesis are the SEs).
Models α β λ
SPL2.55459370 (0.908492280)2.93269704 (0.268147463)0.01073138 (0.003521699)
PL1.624010 (0.5246620)3.169221 (0.3380815)29.455632 (8.5643898)
TLGL25.408341 (15.707237)22.975388 (15.610496)8.504096 (1.789886)
EL8.964875 (1.934670)8.283858 (3.997527)14.222593 (7.879147)
Lomax-9.946361 (3.517630)25.833924 (9.683001)
Table 19. Descriptive statistics of dataset 6.
Table 19. Descriptive statistics of dataset 6.
MeanMedianVarianceSkewnessKurtosisMinimumMaximum
233.321114587873.332.95729.9943911630
Table 20. Goodness of fit measures of the models for dataset 6.
Table 20. Goodness of fit measures of the models for dataset 6.
ModelsW*A* D n p-ValueAICCAICBICHQIC
SPL0.08110.50280.06460.75341407.7121407.9411415.7861410.986
EL0.55253.14870.12460.06791442.1151442.3441450.1891445.389
TLGL0.57313.26690.13000.04991443.5601443.7891451.6341446.834
PL0.23741.33740.19170.00061458.1611458.391466.2351461.436
Lomax0.37752.14250.21140.00011463.4461463.5591468.8291465.629
Table 21. MLEs of the model parameters for dataset 6 (in parenthesis are the SEs).
Table 21. MLEs of the model parameters for dataset 6 (in parenthesis are the SEs).
Models α β λ
SPL1.667282340 (0.7019358659)0.985393302 (0.0964258687)0.002185021 (0.0001928461)
EL0.7859451 (0.09768566)11.4402958 (8.24677574)3.8369019 (1.53466623)
TLGL4.3433527 (3.63593087)0.4023303 (0.07476114)10.1888715 (15.43343884)
PL1.0290758 (0.22032790)0.7672704 (0.06388817)30.6523845 (6.75638611)
Lomax-0.5771954 (0.07560304)30.9556050 (6.48208435)
Table 22. Descriptive statistics of dataset 7.
Table 22. Descriptive statistics of dataset 7.
MeanMedianVarianceSkewnessKurtosisMinimumMaximum
1.959241.736152.477411.979565.160790.02519.096
Table 23. Goodness of fit measures of the models for dataset 7.
Table 23. Goodness of fit measures of the models for dataset 7.
ModelsW*A* D n p-ValueAICCAICBICHQIC
SPL0.08570.51280.08160.6620248.7158249.0491255.7080251.5102
PL0.09240.55130.08650.5904249.0608249.3941256.0530251.8552
TLGL0.11790.70570.08450.6184250.9647251.2980257.9569253.7591
EL0.11830.70830.09080.528319251.0226251.3559258.0148253.8170
Lomax0.11620.69280.17550.016153260.8785261.0429265.540262.7415
Table 24. MLEs of the model parameters for dataset 7 (in parenthesis are the SEs).
Table 24. MLEs of the model parameters for dataset 7 (in parenthesis are the SEs).
Models α β λ
SPL1.5772126 (1.0282246)1.5747198 (0.2344551)0.1439615 (0.1010120)
PL3.720675 (3.720675)1.583297 (0.2352289)10.034301 (8.7770949)
TLGL1.870763 (0.3248518)6.903672 (6.2287549)17.059630 (16.7566780)
EL12.677044 (10.829670)16.160300 (15.788961)1.821291 ( 0.344925)
Lomax-11.57571 (6.638425)21.51162 (13.080670)
Table 25. Descriptive statistics of dataset 8.
Table 25. Descriptive statistics of dataset 8.
MeanMedianVarianceSkewnessKurtosisMinimumMaximum
2.085272.0651.550590.43959−0.267410.0465.14
Table 26. Goodness of fit measures of the models for dataset 8.
Table 26. Goodness of fit measures of the models for dataset 8.
ModelsW*A* D n p-ValueAICCAICBICHQIC
SPL0.10690.65230.098440.5418207.4985207.9052213.9279210.0272
PL0.14790.90250.11700.3283210.1077210.5145216.5371212.6364
EL0.22871.38370.16130.0672214.9548215.3616221.3842217.4835
TLGL0.24731.49550.15490.0875216.3146216.7214222.744218.8433
Lomax0.22111.33800.21650.0045227.3478227.5478231.634229.0336
Table 27. MLEs of the model parameters for dataset 8 (in parenthesis are the SEs).
Table 27. MLEs of the model parameters for dataset 8 (in parenthesis are the SEs).
Models α β λ
SPL4.98613008 (3.14343174)1.67430554 (0.18377985)0.02945598 (0.01925146)
PL4.607661 (2.4059839)1.771149 (0.1990476)17.766353 (10.1037955)
EL20.786925 (16.9150052)26.841521 (21.8605907)2.035379 (0.3637032)
TLGL1.994239 (0.3651147)5.493725 (3.0004375)14.121823 (8.4044275)
Lomax-8.558363 (3.989779)16.854870 (8.309326)
Table 28. Descriptive statistics of dataset 9.
Table 28. Descriptive statistics of dataset 9.
MeanMedianVarianceSkewnessKurtosisMinimumMaximum
1.506831.590.10506−0.899930.923760.552.24
Table 29. Goodness of fit measures of the models for dataset 9.
Table 29. Goodness of fit measures of the models for dataset 9.
ModelsW*A* D n p-ValueAICCAICBICHQIC
SPL0.26371.44440.16390.067836.9581937.3649743.3875939.4869
PL0.41572.29160.22320.003846.0943446.5011252.5237548.62306
TLGL0.80174.36910.22580.003270.4496370.8564176.8790472.97835
EL0.82224.47670.22630.003171.8244272.231278.2538274.35313
Lomax0.58543.21010.42103.98 × 10 10 186.006186.206190.2923187.6918
Table 30. MLEs of the model parameters for dataset 9 (in parenthesis are the SEs).
Table 30. MLEs of the model parameters for dataset 9 (in parenthesis are the SEs).
Models α β λ
SPL3.02842979 (1.619006155)5.77611926 (0.666909675)0.01281263 (0.007095599)
PL1.945485 (0.6988577)6.010487 (0.7239624)23.913933 (8.2279128)
TLGL32.13406 (10.38371)24.84223 (19.00308)18.21983 (14.95689)
EL27.01510 (15.272308)9.35084 (5.989516)35.10721 (11.975072)
Lomax-13.77391 (6.867307)19.96670 (9.995415)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Nagarjuna, V.B.V.; Vardhan, R.V.; Chesneau, C. On the Accuracy of the Sine Power Lomax Model for Data Fitting. Modelling 2021, 2, 78-104. https://0-doi-org.brum.beds.ac.uk/10.3390/modelling2010005

AMA Style

Nagarjuna VBV, Vardhan RV, Chesneau C. On the Accuracy of the Sine Power Lomax Model for Data Fitting. Modelling. 2021; 2(1):78-104. https://0-doi-org.brum.beds.ac.uk/10.3390/modelling2010005

Chicago/Turabian Style

Nagarjuna, Vasili B. V., R. Vishnu Vardhan, and Christophe Chesneau. 2021. "On the Accuracy of the Sine Power Lomax Model for Data Fitting" Modelling 2, no. 1: 78-104. https://0-doi-org.brum.beds.ac.uk/10.3390/modelling2010005

Article Metrics

Back to TopTop