Next Article in Journal
Bayesian and Frequentist Approaches for a Tractable Parametric General Class of Hazard-Based Regression Models: An Application to Oncology Data
Next Article in Special Issue
Representative Points from a Mixture of Two Normal Distributions
Previous Article in Journal
An Active School Transport Instrument to Measure Parental Intentions: The Case of Indonesia
Previous Article in Special Issue
Huber Regression Analysis with a Semi-Supervised Method
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quantitative Analysis of the Balance Property in Factorial Experimental Designs 24 to 28

by
Ricardo Ramírez-Tapia
1,
Armando Javier Ríos-Lira
2,*,†,
Yaquelin Verenice Pantoja-Pacheco
2,
José Antonio Vázquez-López
2 and
Edgar Augusto Ruelas-Santoyo
2
1
Doctorado en Ciencias de la Ingeniería, Tecnológico Nacional de México en Celaya, Celaya 38010, Guanajuato, Mexico
2
Tecnológico Nacional de México/Instituto Tecnológico de Celaya, Celaya 38010, Guanajuato, Mexico
*
Author to whom correspondence should be addressed.
Departamento de Ingeniería Industrial, Tecnológico Nacional de México/Instituto Tecnológico de Celaya, Antonio García Cubas No. 600, Fovissste, Celaya 38010, Guanajuato, Mexico.
Submission received: 27 August 2022 / Revised: 20 September 2022 / Accepted: 21 September 2022 / Published: 15 October 2022
(This article belongs to the Special Issue Distribution Theory and Application)

Abstract

:
Experimental designs are built by using orthogonal balanced matrices. Balance is a desirable property that allows for the correct estimation of factorial effects and prevents the identity column from aliasing with factorial effects. Although the balance property is well known by most researchers, the adverse effects caused by the lack or balance have not been extensively studied or quantified. This research proposes to quantify the effect of the lack of balance on model term estimation errors: type I error, type II error, and type I and II error as well as R2, R2adj, and R2pred statistics under four balance conditions and four noise conditions. The designs considered in this research include 24–28 factorial experiments. An algorithm was developed to unbalance these matrices while maintaining orthogonality for main effects, and the general balance metric was used to determine four balance levels. True models were generated, and a MATLAB program was developed; then a Monte Carlo simulation process was carried out. For each true model, 50,000 replications were performed, and percentages for model estimation errors and average values for statistics of interest were computed.

1. Introduction

An experiment can be defined as a test or a series of tests in which deliberate changes are made to the input variables of a process to observe the reasons for the changes in the output variable [1]. Figure 1 shows the representation of a process where inputs are affected by both controllable ( x p ) and uncontrollable ( z q ) factors, having an impact on the output or response variable ( y ) .
Factorial designs are commonly used because they contain all possible combinations of factors levels and can estimate factorial effects and interactions. In addition, they are commonly used as factor-filtering designs to identify the factors that have the greatest significance or influence on the response variable.
Two commonly used types of factorial designs include the 2 k and 3 k (where k represents the number of factors to evaluate, and the base represents the number of levels). Other designs include two-level fractional factorial designs, three-level fractional factorial designs, and mixed level designs. This research focuses only on two-level factorial designs, also called 2k designs.
The 2k factorial design is a design in which each factor has two levels commonly called high and low, represented by 1 and −1 respectively. These designs can be used to analyze both quantitative and qualitative factors [1]. The design matrix is an orthogonal array with 2 k runs and k columns; this is illustrated in Table 1.
A design is balanced when each column contains the same number of runs for each factor level [2]. According to Birkes et al. (1999), the balance exists when there is an equal number of observations for all combinations of factor levels [3]. Ríos (2009) mentions that balance is an important property in experimental design, and a design is balanced when in each column each possible factor level appears the same number of times [4].
Balanced experiments have two major advantages when it comes to analysis. First, each experimental combination is estimated with the same precision. Secondly, in a balanced factorial experiment, the effect of each factor can be evaluated independently from other factors [5]. Pantoja et al. (2009), point out that balance is an important property because it prevents the main effects and interactions from aliasing with the interception column β 0 [6]. Balance also plays another important role; it leads to a considerable simplification of the calculations [7].
Balance property has been extensively mentioned in literature. In 1935, the term balance was already included in articles published by Yates; in these articles, it was mentioned that to avoid sacrificing all the information about a possibly important interaction, a balanced arrangement was used wherein the interaction was confused as little as possible. In 1938, Yates conducted research evaluating the extent to which a balanced experimental design on human nutrition resulted in an efficiency gain, compared to several alternative randomized block designs that could have been used to solve the same problem [8]. In 1965, Nelder introduced the concept of general equilibrium for multi-stratum experiments. He mentions that only in general equilibrium designs can information on any treatment effect be obtained and combined from more than one [9]. In 1972, John and Smith introduced the concept of orthogonal factorial structure for factorial designs; they mentioned that if all the estimation is done in the lowest stratum, only in these designs can the factorial effects be estimated independently [10]. Because of this feature, several later research papers were restricted to this type of design (see, for example, Cotter et al. (1973) [11], John (1981) [12], Mukerjbe (1979) [13], Mukerjee (1981) [14] and Gupta (1983) [15]). In 1978, different existing methods for analyzing unbalanced designs were evaluated by using the computer programs of the time [16]. In 1986, David G. Herr published an article in which he talks about the history of unbalanced designs and the methods to deal with them [17].
In 1993, Shaw and Mitchel mention that unfortunately, for various reasons, it is rare for a biological or ecological study to have fully balanced data, and that unbalance requires care in analysis and interpretation. They also try to explain some of the consequences of unbalance and give some guidelines for analyzing unbalanced data for models involving fixed effects [18]. In 1996, Huber and Zwerina mentioned the importance of balance in optimal experimental designs [19]. In 2001, balanced experimental designs were used in conjunction with microarray models to choose an optimal experimental design suitable for gene expression [20]. In 2009, Guo et al. presented the general metric balance, a parameter that explains how to measure the degree of balance in a factorial design [21].
In 2010, Andy Hector et al. performed a study of variance for unbalanced data in the areas of ecology and evolution. They summarized the main developments to deal with unbalanced designs and highlighted the search for the right ANOVA method by which to present one or several models that best fit the objectives of the analysis [22].
Ríos et al. (2011) present a sequential experimentation approach to increase Resolution III fractions. This method was able to overcome the drawbacks of the general methods while maintaining some of their benefits. They mention the importance of maintaining balance and orthogonality [23]. Landsheer and Van Den Wittenboer in 2015 compared balanced and unbalanced 2 × 2 designs. They used ANOVA correction methods with the sum of squares type II and type III. The purpose was to determine which type of SS provides lower H0 rejection rates, in their study. They compared 2 × 2 balanced and 2 × 2 unbalanced designs with an interaction to determine whether ANOVA correction methods provide satisfactory results in the presence of an interaction. In this study, they compared the methods for calculating ANOVA with balanced data, i.e., with the SS I method, to calculate ANOVA with unbalanced designs they used the ANOVA SS II and SS III methods. They concluded that ANOVA with SS II only works satisfactorily for unbalanced designs when an interaction can be excluded, whereas the application of SS III, when there is an interaction, showed a power about 1% to 5% lower than for the balanced datasets that were simulated. The application of SS II cannot be recommended when there is an interaction in an unbalanced design [24].
Voelkel in 2019 uses systematic methods to create optimal designs by using the balance property and orthogonal arrays [25]. Balanced experiments have two major advantages when it comes to analysis. First, each experimental combination is estimated with the same precision. Secondly, in a balanced factorial experiment, the effect of the intercept or identity column can be evaluated independently from the other factors [5].
Currently, one of the key factors for the success of any organization, whether industrial or service, is to improve and make its products and processes more efficient. This is where experimentation comes in, as it is one of the elements that can contribute the most to the improvement of products and processes. The use of design of experiments (DOE) is an effective tool for understanding and improving processes and products in the industry.
However, in industrial experimentation, it is not uncommon to find that companies resort to unbalanced experimental designs. These designs usually occur due to several circumstances. One such circumstance may arise when the experimenter has made a mistake at the time of experimenting or because some experimental conditions cannot be created once the experiment is running. For example, this often occurs when a machine is unavailable due to a breakdown or failure, or when a batch of raw material is unavailable, either due to a shortage of material or a change of supplier. Another event that happens in industries is the fact that an operator who was being taken as a level of the experiment is missing. It also happens in industries such as the agri-food industry that, due to weather conditions or natural disasters, crops that could be under experimentation are lost or pests arise that are difficult to destroy.
In situations such as this, experimenters may be interested in knowing the consequences of running an unbalanced design. With this research, experimenters will be able to answer whether it is feasible to run an unbalanced design and the consequences of running it. Experimenters will also be able to find out what levels of imbalance are most favorable for the experiment if it is necessary to use an unbalanced design.
In this investigation, we worked with designs 2 4 to 2 8 by using four different balance levels. The levels included: balanced, low unbalance, medium unbalance, and high unbalance. The general balance metric (GBM) was used to measure the balance property given that it is simple and effective minimum aberration criterion that measures the balance property of experimental designs. GBM is a parameter that determines the degree of balance of a given design matrix. According to Guo et al. (2009), GBM can be calculated for a matrix d of size n × k , where n is the number of rows and k is the number of columns, d t ( t = 1 , , k ) indicates the columns of interactions for the factors from 1 to k , and d 1 represents the main effects matrix [21].
As shown in Equation (1), GBM considers the difference between the number of times that a factor level appears in a column and the number of times that it should appear if the design were balanced. Equation (2) shows how these quantities are squared and added for each submatrix of main effects two-factor interactions and three-factor interactions. Finally, an array consisting of k numbers is obtained (Equation (3)) where H 1 indicates the balance for main effects, H 2 for 2-factor interactions, and H k for k-factor interactions. When a design is balanced for main effects, two-factor interactions, and three-factor interactions, the GBM is a vector of zeros ( 0 , 0 , 0 ) [26]. Alternatively, these three quantities can be added to obtain a single number. If this number is zero, the model matrix is completely balanced. The higher this number, the more unbalanced the design will be.
Therefore,
H j t = r = 1 l j t ( C r j t n l j t ) 2
H t = j = 1 ( t k ) H j t = j = 1 ( t k ) r = 1 l j t ( C r j t n l j t ) 2
G B M = ( H 1 , H 2 , , H k ) ,
where
C r j t   is   the number of times a level r appears in column j , and
l j t is the number of levels that the column j contains.
The design matrices are governed by another property in addition to balance. This property is known as orthogonality. Orthogonality ensures that the effects of the different factors to which the experimental material is subjected. Orthogonality can be estimated separately and without confusion [27]. In other words, orthogonality ensures that effects can be estimated independently so each column provides different information to the design [6].
The orthogonality property can be measured through several highly sophisticated criteria such as minimum generalized aberration [28], minimum aberration moment [29], moment aberration projection [30], J2-optimality [31], J2-modified optimality, variance inflation factors (VIF), dot product, simple correlation, and Pearson’s correlation index. For practical purposes, we used VIF to measure orthogonality. VIF is a measure of the amount of multicollinearity in a set of multiple regression variables. Experience indicates that if any of the VIF are greater than 5, it is an indication that the associated regression coefficients are poorly estimated due to multicollinearity. VIF is calculated as follows,
V I F i = 1 1 R i 2 ,
where R i 2 is the coefficient of determination of the auxiliary regression of the variable xi.
To complete this research, we made use of Monte Carlo simulation. The first step consisted of the creation of true models. A true model is an equation that relates a response variable, also known as dependent variable, to a set of regressors or independent variables [32] (see Equation (5)). Independent variables may also include two-factor interactions given that these interactions occur frequently in practice. To establish this relationship, two characteristics of the regression coefficients are determined by the experimenter, magnitude (size of the coefficient), and direction (sign of the coefficient). In addition, a random error from a normal distribution with mean zero and variance σ2 is introduced to emulate the variability that occurs naturally in the process. The level of significance of the independent variables is directly related to the size of the regression coefficient with respect to the variance of the random error.
It is called a true model because the variables involved, the sizes and signs of the regression coefficients, the level of noise, and the level of significance are determined by the experimenter. True models are useful because they permit the user to simulate data, to manipulate this data, and to observe changes in the regression coefficient estimates with respect to their true values.
The general true model structure depicted in Equation (5) was chosen according to the sparsity of the effects principle, which establishes that most processes are affected by main effects and low order interactions (2FIs and 3FIs):
y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3   + β 12 x 1 x 2 + β 123 x 1 x 2   x 3 + ε .
Several unbalanced matrices were created, and their response variables were generated by using the true models previously described; then these arrays were introduced in a MATLAB computer program to perform simulations. The Monte Carlo method is used to solve problems derived from stochastic phenomena, in which physical experimentation is impracticable, and the creation of an exact formula is impossible; it is also adapted to the resolution of complex deterministic problems [33].
The method by which the datasets were analyzed was stepwise regression. The stepwise regression procedure starts by choosing an equation that contains the best variable X and then tries to build with the subsequent additions of X ’s one at a time, as long as it is worthwhile. The order of addition is determined by using the partial values of the F test to select which variable to enter next. The highest partial F-value is compared with an F-value to enter (selected or default). After a variable is added, the equation is examined to see if any variables should be removed [34]. In addition to regression coefficients estimates, several performance indicators were analyzed. These include the following.
(i)
% No error: indicates the efficiency of the method in picking up the right model:
%   no   error   = number   of   models   without   error × 100   number   of   simulated   models .
(ii)
Type I error [30] in statistics means the number of insignificant terms in the model. %Type I error was calculated and measured as the percentage of times that the models showed insignificant terms during the simulations [35]:
%   Type   I   error = number   of   models   with   insignificant   terms × 100 number   of   simulated   models .
(iii)
Type II error is the number of significant terms missing in the model [36]. % Type II measure the percentage of times that the models showed significant missing terms during the simulations:
%   Type   II   error =   number   of   models   with   missing   terms × 100 number   of   simulated   models .
(iv)
Type I and II error refers to a situation in which both errors are present. % Type I and II error measure the percentage of times that the models presented insignificant and significant missing terms in the model during the simulations:
  %   Type   I   and   II   error = number   of   models   with   insignificant   and   significant   missing   terms × 100   number   of   simulated   models .
(v)
R2 is defined as the proportion of the variability in the response that can be explained by the regressors included in the model (see Equation (10)). Clearly, values closer to 1 are more desirable. We have
R 2 = S S M o d e l   S S T o t a l ,
where
S S M o d e l is the sum of squares corresponding to the model, and
S S T o t a l is the total sum of squares.
The use of AIC and BIC statistics would be a good complement to determine if the best model has been chosen, but the computational complexity would increase, making the simulations longer; however it would be interesting for future research to see how these indicators behave in the absence of balance.
(vi)
The R2adj is a variation of the ordinary R2 statistic that reflects the number of factors in the model (Equation (11)). It can be a useful statistic for more complex experiments with several design factors when we wish to evaluate the impact of increasing or decreasing the number of model terms [1]. We have
R 2 a d j = 1 S S E / d f E S S T o t a l d f T o t a l   ,
where d f E and d f T o t a l correspond to degrees of freedom (Equation (12)) from error and total degrees of freedom (Equation (13)), respectively. We have
d f E = n k 1
d f T o t a l = n 1 ,
where:
n is the number of treatments in the experiment, and
k is the number of significant terms in the regression model.
(vii)
R2 can be calculated for prediction based on the PRESS (see Equation (14)). Values closer to 1 for this statistic are desirable because this indicates that the model will give reasonable performance in prediction [1]. PRESS stands for prediction error sum of squares; it is a measure of how accurate the model is to predict future observations. Small values of PRESS are desirable. We have
R 2 p r e d = 1 P R E S S   S S T o t a l .

2. Method

This research was conducted in four stages, which are represented in Figure 2.

2.1. Stage 1 Generate Unbalanced and Orthogonal Design Matrices for 2k Experiments

The first stage consisted of generating arrays with four balance conditions that preserve the orthogonality property. Given that the objective was to quantify the effect of the lack of balance on the errors and statistics described in the previous section, we did not want orthogonality to be affected. By conducting the experiments in this way, we ensured that the effect on the statistics can be attributed purely to the lack of balance.
An algorithm was developed to alter the signs in a of two-level factorial experimental design to create arrays that are unbalanced but maintain orthogonality for main effects. This algorithm is based on the additive multiplicative properties of equality.
These properties refer to the fact that if we change the sign of the elements of an array, that is, multiplying by minus −1 each element in a row, the array will continue being orthogonal because each element in the row has been multiplied by the same sign, thus respecting the property of multiplicative equality. In addition, by changing the sign of each element of the row, the balance of the matrix will be affected. This procedure would unbalance a design matrix while maintaining the orthogonality property. The metric that we used to measure the level of balance in an array was GBM. By using this metric, we were able to determine different degrees for the lack of balance.
Figure 3 shows a design matrix with the respective VIFs and GBM to measure orthogonality and balance. In this figure, it can be seen that the design matrix is balanced because the GBM takes a value of 0; on the contrary, when the GBM takes a value greater than zero, the array is unbalanced. VIFs have also been observed in 1 so it is an orthogonal design.
Figure 4 shows how the matrix can be unbalanced by multiplying 2 rows by −1. This figure shows the design matrix shown in Figure 3 with two modified lines. The lines that were modified are 2 and 4 respectively. After the modifications, the GBM takes a value of 16 which indicates that the matrix is now unbalanced. The table also shows that the VIFs are less than or equal to 5; this means that the design matrix is still orthogonal for main effects.
An analysis was performed to determine the number of rows that should be multiplied by −1 to achieve a given level of unbalance. The levels of unbalance were defined as follows: low unbalance refers to the lowest unbalance that can be achieved by altering n1 rows. High unbalance refers to the highest level of unbalance (when a design column becomes a column of 1s or −1s values) that can be achieved by altering n3 rows. Medium unbalance is achieved by altering n2 rows where n2 is the number between n1 and n3. Table 2 shows these quantities. Note that the number of rows to modify is different depending on the level of unbalance one wants to achieve and the experimental design in which one is working. In addition, Table 2 shows the expected GBM.
Note that the rows to be modified do not follow a specific order. They can be chosen randomly from the design matrix. For example, if we want to create a medium level of unbalance in a design with five factors, we could invert signs in any subset of nine rows. This could be the first nine rows, the last nine rows, or any other possible combination.

2.2. Stage 2 True Model Generation

The true models generated were first-order models with two-factor and three-factor interactions; interactions between two and three factors were included given that these interactions are present in many real-world situations [37]. The factors and interactions included in the true models were those that presented unbalance after some rows were altered (multiplied by −1); all possible combinations for design size (five designs), level of unbalance (four levels) and level of noise (four levels), were analyzed, and three different models were created for each combination. These models were different regarding the number of factors involved, starting with a model with few terms, then a model with more terms, and finally a model with many terms. These conditions were established in order to observe how the errors behave in the presence of more or fewer factors in the model. In the end, 5 × 3 × 4 × 4 = 240 true models were generated.
The noise level was induced by modifying the size of the regression coefficients in the true model with respect to the error variance. A low noise level is obtained when the regression coefficients are approximately three times the error variance, a medium level is when this relation is 2 to 1, a high level is obtained when the relation becomes 1 to 1, and finally a very high noise level is obtained with a ½ to 1 relation; that is, regression coefficients that are half the error variance. If the error variance is fixed to 2, the low, medium, high, and very high noise levels are obtained by assigning values of 6, 4, 2, and 1 to the regression coefficients. The true models used for the different simulations are shown in Table 3, Table 4, Table 5 and Table 6. Each model has been assigned an item consisting of a number and the initial of the noise level to which it belongs to identify it in later tables.

2.3. Stage 3 MATLAB Simulations

This stage consisted in performing simulations for each of the true models generated under different levels of balance and noise. To do this, it was necessary to develop a computer program in MATLAB software. Each model was used to generate a response variable by using a Monte Carlo simulation, and 50,000 iterations were performed for each model. In each iteration, the regression coefficients were slightly varied at 10% above and below the original value. This was done to create a diversity of models. Stepwise regression was used to compute the regression coefficient estimates. Then the model errors were computed and averaged to obtain a single data point. This process is shown in Figure 5.
The simulation strategy was carried out by using design matrices with balance, low unbalance, medium unbalance, and high unbalance. All of them are orthogonal for main effects. The number of iterations was determined by observing that as the number of iterations increased, convergence was achieved for the different errors. By convergence, we mean that the averages stabilized; that is, it did not change significantly as the number of simulations increased, indicating that the averages of the estimated errors were close to their real value. This was accomplished by simulating 50,000 iterations for each model.

3. Results of Simulations

This section presents the results of the simulations in a series of tables and graphs. The objective is to identify how the model errors behave under different conditions of balance and noise. The results are show in Table 7, Table 8, Table 9 and Table 10.
The values of the tables were synthesized by taking the averages for each type of error and balance level. Table 11, Table 12, Table 13 and Table 14 show the synthesized values.
Table 11, Table 12 and Table 13 show similar behaviors for the different types of errors under low, medium, and high noise.
Table 14 shows that when the noise is very high, the percentage of no error decreases dramatically, predominating the type II error and type I and II error.
These tables show a tendency for balanced and low unbalanced designs to have high percentages of no error and type I error. However, it is observed that when the designs have medium and high unbalance, the percentages of no error and error type I decrease, increasing in turn, the percentages of type II error and type I and II error.
The same procedure was used to evaluate the behavior of the R2, R2adj, and R2pred statistics under different balance and noise levels. The results are shown Table 15, Table 16 and Table 17.
Table 15 and Table 16 show that the R2 and R2adj statistics are robust to the different levels of unbalance because their values do not change as the unbalance increases if the noise is maintained in a fixed value. It can also be observed that they resist in a good way the low, medium, and high noise level; however, a significant decrease in their values is noticed in the presence of very high noise.
From Table 17 it can be observed that designs with a high level of unbalance are not good for prediction because the R2pred reduces drastically. It can also be seen that the designs are not good at predicting when the noise level is very high.

4. Conclusions

This investigation analyzed how the lack of balance affects the experimental design properties, specifically error types and R2, R2adj, and R2pred statistics. We analyzed how the different types of errors change as the unbalance increases and how the R2, R2adj, and R2pred statistics are modified in the presence of unbalance and experimental noise.
As a result of this research, it was observed that as the degree of unbalance increases, the type II, as well as type I and II errors, become more probable in the model. It was also observed that, in general, a design with any level of balance resists low, medium, and high noise levels, given that errors become only significant when a very high degree of noise is present.
Balanced designs are more desirable for obvious reasons, but if we have no option and need to run an unbalanced design and it is intended that the errors generated are as minimum as possible, it is advisable to run a design with a low level of unbalance. This is because a design with low unbalance will provide the correct model about 40% of the times as long as the noise level is low to high, and the errors will be distributed in the following way: type I error 20%, type II error 13%, and type I and II error 26%, approximately.
When the design has medium unbalance and the level of noise is low to high, the percentages are distributed in the following way: no error 30%, type I error 15%, type II error 35%, and type I and II error 20%, approximately. If the design has high unbalance and the level of noise is low to high, the percentages are as follows: no error 19%, type I error 8%, type II error 41%, and type I and II error 31%, approximately.
When the design has a very high noise level, only a balanced design will provide acceptable percentages for error types. In this situation (very high noise), any degree of unbalance will produce results that are not favorable given that no error will decrease and error type II as well as error type I and II will increase significantly.
Regarding the R2 and R2adj statistics, they are robust to the different levels of unbalance. Under low, medium, and high noise levels, the R2 presents vales that range from 0.92 to 0.99 and the R2adj ranges from 0.87 to 0.98. The R2 and R2adj statistics reduce significantly in the presence of a very high level of noise, taking values of 0.78 and 0.66, respectively.
Designs with a high level of unbalance are not good at predicting given that values for R2pred become low, and this situation is even worse if the noise level is very high. The R2pred begins to have significant changes when a medium level of unbalance is present and reaches its lowest values when the noise level is very high.
As a general conclusion, perfect balance is not necessary in experimentation; a low level of unbalance is tolerable given that it will produce acceptable values for errors and statistics under low to high noise conditions.
Finally, similar behavior for errors and statistics can be expected in other types of designs, such as fractional factorial designs, response surface designs, and mixed-level designs, but an in-depth study is needed to determine for sure how they behave in the presence of unbalance.

Author Contributions

Conceptualization, R.R.-T.; methodology, R.R.-T. and A.J.R.-L.; validation, R.R.-T.; formal analysis, Y.V.P.-P. and J.A.V.-L.; investigation, R.R.-T., A.J.R.-L., Y.V.P.-P. and E.A.R.-S.; writing—original draft preparation, R.R.-T. and A.J.R.-L.; writing—review and editing, R.R.-T., A.J.R.-L. and Y.V.P.-P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We thank the editor and reviewers for their helpful comments and suggestion that greatly improved the content and quality of paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Montgomery, D.C. Design and Analysis of Experiments; John Wiley & Sons: Hoboken, NJ, USA, 2017. [Google Scholar]
  2. Gutierrez, H.; De la Vara, R. Analisis y Diseño de Experimentos, 3rd ed.; McGrawHill: New York, NY, USA, 2012. [Google Scholar]
  3. Birkes, D.S.; Seely, J.F.; Vanleeuwen, D.M. Balance and orthogonality in designs for mixed classification models. Ann. Stat. 1999, 27, 1927–1947. [Google Scholar] [CrossRef]
  4. Ríos, A.J. Sequential Experimentation, A New Experimentation Approach for Resolution III, Mixed-Level and Robust Designs. VDM Verlag Dr. Müller. 2009. Available online: http://www.vdm-verlag.de/ (accessed on 6 September 2018).
  5. Van Belle, G.; Kerr, K.F. Design and Analysis of Experiments in the Health Sciences; Jonh Wiley & Sons: Hoboken, NJ, USA, 2012. [Google Scholar] [CrossRef]
  6. Pantoja, Y.V.; Ríos, A.J.; Tapia, M. A method for construction of mixed-level fractional designs. Qual. Reliab. Eng. Int. 2019, 35, 1646–1665. [Google Scholar] [CrossRef]
  7. Yates, F. Complex experiments. Suppl. J. R. Stat. Soc. 1935, 2, 181. [Google Scholar] [CrossRef]
  8. Yates, F. The gain in efficiency resulting from the use of balanced designs. Suppl. J. R. Stat. Soc. 1938, 5, 70. [Google Scholar] [CrossRef]
  9. Nelder, J. The analysis of randomized experiments with orthogonal block structure. I. Block structure and the null analysis of variance. Proc. R. Soc. Lond. Ser. A Math. Phys. Sci. 1965, 283, 147–162. [Google Scholar] [CrossRef]
  10. John, J.A.; Smith, T.M.F. Two-factor experiments in non-orthogonal designs. J. R. Stat. Soc. Ser. B Methodol. 1972, 34, 401–409. [Google Scholar] [CrossRef]
  11. Cotter, S.C.; John, J.A.; Smith, T.M.F. Multi-factor experiments in non-orthogonal designs. J. R. Stat. Soc. Ser. B Methodol. 1973, 35, 361–367. [Google Scholar] [CrossRef]
  12. John, J.A. Factorial-balance and the analysis of designs with factorial structure. J. Stat. Plan. Inference 1981, 5, 99–105. [Google Scholar] [CrossRef]
  13. Mukerjbe, R. Inter-effect-orthogonality in factorial experiments. Calcutta Stat. Assoc. Bull. 1979, 28, 83–108. [Google Scholar] [CrossRef]
  14. Mukerjee, R. Construction of effect-wise orthogonal factorial designs. J. Stat. Plan. Inference 1981, 5, 221–229. [Google Scholar] [CrossRef]
  15. Gupta, S.C. Some new methods for constructing block designs having orthogonal factorial structure. J. R. Stat. Soc. Ser. B Methodol. 1983, 45, 297–307. [Google Scholar] [CrossRef]
  16. Speed, F.M.; Hocking, R.R.; Hackney, O.P. Methods of analysis of linear models with unbalanced data. J. Am. Stat. Assoc. 1978, 73, 105–112. [Google Scholar] [CrossRef]
  17. Herr, D.G. On the history of ANOVA in unbalanced, factorial designs: The first 30 years. Am. Stat. 1986, 40, 265–270. [Google Scholar] [CrossRef]
  18. Shaw, R.G.; Mitchell-Olds, T. ANOVA for unbalanced data: An overview. Ecology 1993, 74, 1638–1645. [Google Scholar] [CrossRef]
  19. Huber, J.; Zwerina, K. The Importance of Utility Balance in Efficient. J. Mark. Res. 1996, 33, 307–317. [Google Scholar] [CrossRef]
  20. Kerr, M.K.; Churchill, G.A. Experimental design for gene expression microarrays. Biostatistics 2001, 2, 183–201. [Google Scholar] [CrossRef]
  21. Guo, Y.; Simpson, J.R.; Pignatiello, J.J., Jr. The General Balance Metric for Mixed-level Fractional Factorial Designs. Qual. Reliab. Eng. Int. 2009, 25, 335–344. [Google Scholar] [CrossRef]
  22. Hector, A.; Von Felten, S.; Schmid, B. Analysis of variance with unbalanced data: An update for ecology & evolution. J. Anim. Ecol. 2010, 79, 308–316. [Google Scholar] [CrossRef] [PubMed]
  23. Rios, A.J.; Simpson, J.R.; Vázquez, J.A. Sequential experimentation approach for augmenting of resolution III fractions. Commun. Stat.-Theory Methods 2011, 40, 2337–2357. [Google Scholar] [CrossRef]
  24. Landsheer, J.A.; van den Wittenboer, G. Unbalanced 2 X 2 factorial designs and the interaction effect: A troublesome combination. PLoS ONE 2015, 10, e0121412. [Google Scholar] [CrossRef]
  25. Voelkel, J.G. The design of order-of-addition experiments. J. Qual. Technol. 2019, 51, 230–241. [Google Scholar] [CrossRef]
  26. Naranjo, F.; Ríos, A.J.; Pantoja, Y.V.; Tapia, M. Diseños Factoriales de Taguchi fraccionados. Ing. Investig. Tecnol. 2020, 21, 1–12. [Google Scholar] [CrossRef]
  27. Yates, F. The Principles of Orthogonality and Confounding in Replicated Experiments; Cambridge University Press: Cambridge, UK, 1933; pp. 108–145. [Google Scholar] [CrossRef]
  28. Wu, C.F.J.; Xu, H. Generalized Minimum Aberration for Asymmetrical Fractional Factorial Designs. Ann. Stat. 2001, 29, 1066–1077. [Google Scholar] [CrossRef]
  29. Xu, H. Minimum Moment aberration for nonregular designs and supersaturated designs. Stat. Sin. 2003, 13, 691–708. [Google Scholar]
  30. Xu, H.; Deng, L.Y. Moment Aberration Projection for Nonregular Fractional Factorial Designs. Technometrics 2005, 47, 121–131. [Google Scholar] [CrossRef] [Green Version]
  31. Xu, H. An Algorithm for Constructing Orthogonal and Nearly-Orthogonal Arrays with Mixed Levels and Small Runs. Technometrics 2002, 44, 356–368. [Google Scholar] [CrossRef] [Green Version]
  32. Arias, E.H.; Ríos, A.J.; Vázquez, J.A.; Pérez, R. Estudio comparativo entre los enfoques de diseño experimental robusto de Taguchi y tradicional en presencia de interacciones de control por control. Ing. Investig. Tecnol. 2015, 16, 131–142. [Google Scholar] [CrossRef] [Green Version]
  33. Mancilla, A.M. Simulación Herramienta Para el Estudio de Sistemas Reales. Revista Científica Ingeniería y Desarrollo. 1999. Available online: https://rcientificas.uninorte.edu.co/index.php/ingenieria/article/view/2226 (accessed on 19 November 2021).
  34. Draper, N.R.; Smith, H. Applied Regression Analysis; John Wiley & Sons: Hoboken, NJ, USA, 1998; Volume 326. [Google Scholar] [CrossRef]
  35. Misra, H.; Ríos, A.J.; Simpson, J.R.; Vázquez, J.A. The Quarterfold, a Sequential Augmentation Procedure for Resolution IV Fractions. Qual. Eng. 2013, 25, 118–135. [Google Scholar] [CrossRef]
  36. Ríos, A.J.; Guerrero, G. Sequential experimentation approach for robust design. Qual. Reliab. Eng. Int. 2018, 34, 1556–1565. [Google Scholar] [CrossRef]
  37. Jaccard, J.J. Interaction Effects in Factorial Analysis of Variance; SAGE Publications: Thousand Oaks, CA, USA, 1998. [Google Scholar]
Figure 1. The general model of a process.
Figure 1. The general model of a process.
Mathematics 10 03812 g001
Figure 2. Research method.
Figure 2. Research method.
Mathematics 10 03812 g002
Figure 3. Balanced design matrix 2 3 .
Figure 3. Balanced design matrix 2 3 .
Mathematics 10 03812 g003
Figure 4. Unbalanced design matrix 23.
Figure 4. Unbalanced design matrix 23.
Mathematics 10 03812 g004
Figure 5. Simulation process.
Figure 5. Simulation process.
Mathematics 10 03812 g005
Table 1. Example of a 2 3 factorial design matrix.
Table 1. Example of a 2 3 factorial design matrix.
RunFactors
ABC
1−1−1−1
21−1−1
3−11−1
411−1
5−1−11
61−11
7−111
8111
Table 2. Number of rows to be modified to achieve a desired level of unbalance.
Table 2. Number of rows to be modified to achieve a desired level of unbalance.
DesignLow UnbalanceGBM ValueMedium UnbalanceGBM ValueHigh UnbalanceGBM Value
2318216324
24232342574
255140926813396
26739814816211246
27151808303640455488
283382106716,51410124,898
Table 3. True models with low noise.
Table 3. True models with low noise.
DesignItemTrue Models
241LY = 10 − 6A − 6D + N(0,2)
2LY = 10 − 6A − 6D + 6AC + N(0,2)
3LY = 10 − 6A − 6D + 6AC + 6AD + 6ACD + N(0,2)
254LY = 10 + 6A − 6D + 6E + 6AE + 6BD + N(0,2)
5LY = 10 + 6A − 6D + 6E + 6AD + 6AE − 6BC + 6BD + N(0,2)
6LY = 10 + 6A − 6D + 6E + 6AD + 6AE − 6BC + 6BD − 6DE + 6ADE + N(0,2)
267LY = 10 − 6A + 6E − 6F + 6AC − 6AE + 6BE − 6BF + N(0,2)
8LY = 10 − 6A + 6E − 6F + 6AC − 6AE + 6BE − 6BF + 6CF + 6DE+ N(0,2)
9LY = 10 − 6A + 6E − 6F + 6AC − 6AE + 6BE − 6BF + 6CF + 6DE − 6DF + 6AEF + N(0,2)
2710LY = 10 + 6A − 6F + 6G − 6AE + 6AF + 6AG − 6BC − 6BD + 6BE + N(0,2)
11LY = 10 + 6A − 6F + 6G − 6AE + 6AF + 6AG − 6BC − 6BD + 6BE + 6CF + 6DF + N(0,2)
12LY = 10 + 6A − 6F + 6G − 6AE + 6AF + 6AG − 6BC − 6BD + 6BE + 6CF + 6DF − 6FG + 6AFG + N(0,2)
2813LY = 10 + 6A + 6G + 6H − 6AF − 6AG + 6AH + 6BE + 6BF − 6BG + 6CF − 6CG + N(0,2)
14LY = 10 + 6A + 6G + 6H − 6AF − 6AG + 6AH + 6BE + 6BF − 6BG + 6CF − 6CG + 6DF − 6DH + N(0,2)
15LY = 10 + 6A + 6G + 6H − 6AF − 6AG + 6AH + 6BE + 6BF − 6BG + 6CF − 6CG + 6DF − 6DH + 6GH + 6AGH + N(0,2)
Table 4. True models with medium noise.
Table 4. True models with medium noise.
DesignItemTrue Models
241MY = 10 − 4A − 4D + N(0,2)
2MY = 10 − 4A − 4D + 4AC + N(0,2)
3MY = 10 − 4A − 4D + 4AC + 4AD + 4ACD + N(0,2)
254MY = 10 + 4A − 4D + 4E + 4AE + 4BD + N(0,2)
5MY = 10 + 4A − 4D + 4E + 4AD + 4AE − 4BC + 4BD + N(0,2)
6MY = 10 + 4A − 4D + 4E + 4AD + 4AE − 4BC + 4BD − 4DE + 4ADE + N(0,2)
267MY = 10 − 4A + 4E − 4F + 4AC − 4AE + 4BE − 4BF + N(0,2)
8MY = 10 − 4A + 4E − 4F + 4AC − 4AE + 4BE − 4BF + 4CF + 4DE+ N(0,2)
9MY = 10 − 4A + 4E − 4F + 4AC − 4AE + 4BE − 4BF + 4CF + 4DE − 4DF + 4AEF + N(0,2)
2710MY = 10 + 4A − 4F + 4G − 4AE + 4AF + 4AG − 4BC − 4BD + 4BE + N(0,2)
11MY = 10 + 4A − 4F + 4G − 4AE + 4AF + 4AG − 4BC − 4BD + 4BE + 4CF + 4DF + N(0,2)
12MY = 10 + 4A − 4F + 4G − 4AE + 4AF + 4AG − 4BC − 4BD + 4BE + 4CF + 4DF − 4FG + 4AFG + N(0,2)
2813MY = 10 + 4A + 4G + 4H − 4AF − 4AG + 4AH + 4BE + 4BF − 4BG + 4CF − 4CG + N(0,2)
14MY = 10 + 4A + 4G + 4H − 4AF − 4AG + 4AH + 4BE + 4BF − 4BG + 4CF − 4CG + 4DF − 4DH + N(0,2)
15MY = 10 + 4A + 4G + 4H − 4AF − 4AG + 4AH + 4BE + 4BF − 4BG + 4CF − 4CG + 4DF − 4DH + 4GH + 4AGH + N(0,2)
Table 5. True models with high noise.
Table 5. True models with high noise.
DesignItemTrue Models
241HY = 10 − 2A − 2D + N(0,2)
2HY = 10 − 2A − 2D + 2AC + N(0,2)
3HY = 10 − 2A − 2D + 2AC + 2AD + 2ACD + N(0,2)
254HY = 10 + 2A − 2D + 2E + 2AE + 2BD + N(0,2)
5HY = 10 + 2A − 2D + 6E + 2AD + 2AE − 2BC + 2BD + N(0,2)
6HY = 10 + 2A − 2D + 2E + 2AD + 2AE − 2BC + 2BD − 2DE + 2ADE + N(0,2)
267HY = 10 − 2A + 2E − 2F + 2AC − 2AE + 2BE − 2BF + N(0,2)
8HY = 10 − 2A + 2E − 2F + 2AC − 2AE + 2BE − 2BF + 2CF + 2DE+ N(0,2)
9HY = 10 − 2A + 2E − 2F + 2AC − 2AE + 2BE − 2BF + 2CF + 2DE − 2DF + 2AEF + N(0,2)
2710HY = 10 + 2A − 2F + 2G − 2AE + 2AF + 2AG − 2BC − 2BD + 2BE + N(0,2)
11HY = 10 + 2A − 2F + 2G − 2AE + 2AF + 2AG − 2BC − 2BD + 2BE + 2CF + 2DF + N(0,2)
12HY = 10 + 2A − 2F + 2G − 2AE + 2AF + 2AG − 2BC − 2BD + 2BE + 2CF + 2DF − 2FG + 2AFG + N(0,2)
2813HY = 10 + 2A + 2G + 2H − 2AF − 2AG + 2AH + 2BE + 2BF − 2BG + 2CF − 2CG + N(0,2)
14HY = 10 + 2A + 2G + 2H − 2AF − 2AG + 2AH + 2BE + 2BF − 2BG + 2CF − 2CG + 2DF − 2DH + N(0,2)
15HY = 10 + 2A + 2G + 2H − 2AF − 2AG + 2AH + 2BE + 2BF − 2BG + 2CF − 2CG + 2DF − 2DH + 2GH + 2AGH + N(0,2)
Table 6. True models with very high noise.
Table 6. True models with very high noise.
DesignItemTrue Models
241VHY = 10 − A − D + N(0,2)
2VHY = 10 − A − D + AC + N(0,2)
3VHY = 10 − A − D + AC + AD + ACD + N(0,2)
254VHY = 10 + A − D + E + AE + BD + N(0,2)
5VHY = 10 + A − D + E + AD + AE − BC + BD + N(0,2)
6VHY = 10 + A − D + E + AD + AE − BC + BD − DE + ADE + N(0,2)
267VHY = 10 − A + E − F + AC − AE + BE − BF + N(0,2)
8VHY = 10 − A + E − F + AC − AE + BE − BF + CF + DE+ N(0,2)
9VHY = 10 − A + E − F + AC − AE + BE − BF + CF + DE − DF + AEF + N(0,2)
2710VHY = 10 + A − F + G − AE + AF + AG − BC − BD + BE + N(0,2)
11VHY = 10 + A − F + G − AE + AF + AG − BC − BD + BE + CF + DF + N(0,2)
12VHY = 10 + A − F + G − AE + AF + AG − BC − BD + BE + CF + DF − FG + AFG + N(0,2)
2813VHY = 10 + 2A + 2G + 2H − 2AF − 2AG + 2AH + 2BE + 2BF − 2BG + 2CF − 2CG + N(0,2)
14VHY = 10 + A + G + H − AF − AG + AH + BE + BF − BG + CF − CG + DF − DH + N(0,2)
15VHY = 10 + A + G + H − AF − AG + AH + BE + BF − BG + CF − CG + DF − DH + GH + AGH + N(0,2)
Table 7. % Error behavior with low noise.
Table 7. % Error behavior with low noise.
Design2425262728
True Model1L2L3L4L5L6L7L8L9L10L11L12L13L14L15L
BalancedNo error626674687685626977535966485459
Error I383425322415383123474134524641
Error II001000000000000
Error I and II 000000000000000
Unbalance levelLowNo error6367387090636977546065120
Error I3733123020373123464034110
Error II 00300850000000060
Error I and II004708915000000979740
MediumNo error 64684400062697749550000
Error I36321400038312336300000
Error II00377181860000066525861
Error I and II005291914000151634484239
HighNo error 50460000626977000000
Error I21170000383123000000
Error II007818186000566366515760
Error I and II 293722991914000443734494340
Table 8. % Error behavior with medium noise.
Table 8. % Error behavior with medium noise.
Design2425262728
True Model1M2M3M4M5M6M7M8M9M10M11M12M13M14M15M
BalancedNo error666657687685616977605965495459
Error I343419322414393123404135514641
Error II0024000000000000
Error I and II 000000000000000
Unbalance levelLowNo error67671970110626977546065220
Error I323362920383123464034110
Error II 002700850000010059
Error I and II004818715000000979741
MediumNo error 68682200063707745510000
Error I3232700037302333280000
Error II00417181850000066515960
Error I and II0030291915000222134494140
HighNo error 47470000636977000000
Error I17170000373123000000
Error II007718185000576365515760
Error I and II 363623991915000433735494340
Table 9. % Error behavior with high noise.
Table 9. % Error behavior with high noise.
Design2425262728
True Model1H2H3H4H5H6H7H8H9H10H11H12H13H14H15H
BalancedNo error565535687584626977535966495460
Error I353012312415383123474134514640
Error II81352011000000000
Error I and II 111000000000000
Unbalance levelLowNo error5542461140626977495445330
Error I322122330383123413524220
Error II 92863358500000200060
Error I and II3831137815000101011969540
MediumNo error 52473000636977323535000
Error I29221000373123211818000
Error II111967708085000101212525862
Error I and II81228302015000373435484238
HighNo error 34280000626876000000
Error I16110000373022000000
Error II2428538086011566366515860
Error I and II263325972014110443734494240
Table 10. % Error behavior with very high noise.
Table 10. % Error behavior with very high noise.
Design2425262728
True Model1V2V3V4V5V6V7V8V9V10V11V12V13V14V15V
BalancedNo error7722014950515454065475559
Error I661126230241746034534641
Error II6968885268821419240571000
Error I and II 17199161277650430000
Unbalance levelLowNo error330514044464127312560
Error I32023026191323191440
Error II 7070796058619243711630059
Error I and II24242033781410119494934919141
MediumNo error 440000143826440000
Error I2200007167220000
Error II676777707786432749414863525760
Error I and II272623302314361818534637484340
HighNo error 220000151212000000
Error I110000744000000
Error II656577217985435162525760515758
Error I and II 313223792115353222484340494342
Table 11. Values for different levels of unbalance with low noise.
Table 11. Values for different levels of unbalance with low noise.
Type Error2k Design with Low Noise
BalancedUnbalanced Level
LowMediumHigh
No error0.650.430.330.20
Error I0.350.220.160.09
Error II0.000.100.340.40
Type I and II error0.000.260.170.31
Table 12. Values for different levels of unbalance with medium noise.
Table 12. Values for different levels of unbalance with medium noise.
Type Error2k Design with Medium Noise
BalancedUnbalanced Level
LowMediumHigh
No error0.650.420.310.20
Error I0.340.210.150.08
Error II0.020.110.340.40
Type I and II error0.000.260.200.32
Table 13. Values for different levels of unbalance with high noise.
Table 13. Values for different levels of unbalance with high noise.
Type Error2k Design with High Noise
BalancedUnbalanced Level
LowMediumHigh
No error0.610.360.280.18
Error I0.330.180.130.08
Error II0.050.180.360.43
Type I and II error0.000.270.230.31
Table 14. Values for different levels of unbalance with very high noise.
Table 14. Values for different levels of unbalance with very high noise.
Type Error2k Design with very High Noise
BalancedUnbalanced Level
LowMediumHigh
No error0.330.150.060.03
Error I0.220.080.030.01
Error II0.360.380.590.59
Type I and II error0.090.390.320.37
Table 15. Behavior of the R2 statistic at different noise and unbalanced levels.
Table 15. Behavior of the R2 statistic at different noise and unbalanced levels.
R2
Level NoiseBalancedUnbalanced Level
LowMediumHigh
Low0.990.990.990.99
Medium0.980.980.980.98
High0.930.930.930.92
Very high0.780.790.790.77
Table 16. Behavior of the R2adj statistic at different noise and unbalanced levels.
Table 16. Behavior of the R2adj statistic at different noise and unbalanced levels.
R2adj
Level NoiseBalancedUnbalanced Level
LowMediumHigh
Low0.980.990.990.98
Medium0.960.980.980.96
High0.870.880.880.87
Very high0.650.670.670.66
Table 17. Behavior of the R2pred statistic at different noise and unbalanced levels.
Table 17. Behavior of the R2pred statistic at different noise and unbalanced levels.
R2pred
Level NoiseBalancedUnbalanced Level
LowMediumHigh
Low0.960.930.880.79
Medium0.920.860.850.77
High0.740.720.720.70
Very high0.470.490.500.50
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Ramírez-Tapia, R.; Ríos-Lira, A.J.; Pantoja-Pacheco, Y.V.; Vázquez-López, J.A.; Ruelas-Santoyo, E.A. Quantitative Analysis of the Balance Property in Factorial Experimental Designs 24 to 28. Mathematics 2022, 10, 3812. https://0-doi-org.brum.beds.ac.uk/10.3390/math10203812

AMA Style

Ramírez-Tapia R, Ríos-Lira AJ, Pantoja-Pacheco YV, Vázquez-López JA, Ruelas-Santoyo EA. Quantitative Analysis of the Balance Property in Factorial Experimental Designs 24 to 28. Mathematics. 2022; 10(20):3812. https://0-doi-org.brum.beds.ac.uk/10.3390/math10203812

Chicago/Turabian Style

Ramírez-Tapia, Ricardo, Armando Javier Ríos-Lira, Yaquelin Verenice Pantoja-Pacheco, José Antonio Vázquez-López, and Edgar Augusto Ruelas-Santoyo. 2022. "Quantitative Analysis of the Balance Property in Factorial Experimental Designs 24 to 28" Mathematics 10, no. 20: 3812. https://0-doi-org.brum.beds.ac.uk/10.3390/math10203812

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop