Next Article in Journal
Multivariate Statistical and Correlation Analysis between Acoustic and Geotechnical Variables in Soil Compression Tests Monitored by the Acoustic Emission Technique
Previous Article in Journal
A Numerical Approach for Dealing with Fractional Boundary Value Problems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Risk Evaluation Method Based on Fault Propagation and Diffusion

1
School of Mechatronic Engineering, Xi’an Technological University, Xi’an 710021, China
2
School of Mechanical and Aerospace Engineering, Jilin University, Changchun 130022, China
3
School of Materials Science and Chemical Engineering, Xi’an Technological University, Xi’an 710021, China
*
Author to whom correspondence should be addressed.
Submission received: 21 August 2023 / Revised: 22 September 2023 / Accepted: 25 September 2023 / Published: 26 September 2023

Abstract

:
The high reliability demand of the machining center emphasizes the accuracy of the fault risk evaluation. In the traditional fault risk evaluation research of the machining center, the influence of fault mode is mostly based on subjective recommendation or does not consider the propagation and diffusion of fault, which makes the risk evaluation results different from the real situation. Therefore, this paper presents a framework to evaluate the fault risk for machining center components. A certain type of machining center is considered as a case study. The fault mode frequency ratio of components is calculated by fault mode analysis. The fault rate calculation is conducted based on the Johnson method. Considering that different fault modes have different influences on fault propagation breadth and depth, the hypergraph theory is used to build a hypernetwork model. The propagation and diffusion influence degree are defined to describe the propagation and diffusion process of faults. Then, the comprehensive influence degree of fault mode is calculated. The risk evaluation is realized by considering the component fault rate, fault mode frequency ratio, and the comprehensive influence degree of fault mode. The method proposed in this paper can provide a reference for the formulation of risk strategies for the machining center.

1. Introduction

As the “core equipment” of the manufacturing industry, the machining center has the characteristics of high precision, high efficiency, and high flexibility [1]. Affected by the internal structure and the external working conditions, its use process is more and more complicated. Minor damage to the components will cause a series of cascade reactions in the whole machine [2]. If the component fails and cannot be handled in time, it will affect the whole production schedule. In serious cases, it may even threaten the personal safety of on-site operators. This puts forward higher requirements for the reliability and safety of the machining center. Therefore, risk evaluation plays a significant role in ensuring the safe and stable operation of the machining center.
Risk is the possibility of encountering hazards in the future [3,4,5]. Risk evaluation is not only to test, measure, and estimate risks but also to reprocess the results of risk identification. At present, the commonly used risk evaluation methods for complex systems are based on expert knowledge [6], fault mode and effect analysis (FMEA) [7], fault mode effect and criticality analysis (FMECA) [8], fault tree [9], probabilistic risk analysis [10], Bayesian network [11], and other methods [12].
Combining the functional fault identification and propagation of the system with fuzzy logic, Wang et al. proposed a functional fault risk analysis method for natural gas fracturing system based on expert knowledge [13]. Wang et al. improved the FMEA method and constructed a hybrid FMEA risk analysis framework by applying the cloud model, Choquet integral, and gain–loss dominance scoring method [14]. This method considered the risk of groups and individuals to prioritize fault risks and carried out machine tool risk analysis accordingly. On the basis of the traditional FMEA method, Li et al. transformed the method into a cloud model and then applied the best–worst method (BWM) to obtain the cloud weight [15]. Combined with the cloud distance to evaluate the expert’s weight on the fault mode risk factors, the comprehensive cloud expression of each fault mode was obtained to evaluate the risk of the machine tool. Wang et al. applied multi-criteria decision-making and probabilistic hesitant fuzzy linguistic term sets to improve traditional FMEA [16]. This method increased the cognitive uncertainty and group risk and then achieved the risk evaluation of fault modes via expert groups. Yucesan et al. applied the fuzzy best–worst method to weigh the risk parameters in FMEA and integrated the Fuzzy Bayesian Network (FBN) with FMEA to clarify the correlation between equipment fault events [17]. Then, they carried out the risk analysis of industrial equipment. Shen et al. proposed an improved FMEA method based on multi-attribute group decision making for the problems of unreasonable allocation of expert weights [18]. Rouabhia-Essalhi, Boukrouh, and Ghemari applied the FMECA method in the risk analysis of industrial equipment [19]. By taking measures to reduce the risk priority number to an acceptable risk value, the potential fault of equipment was reduced, thereby reducing the downtime of equipment. Aiming at the lack of evaluation data and poor interpretability in the risk evaluation process of complex systems, Zhu et al. constructed a risk evaluation model based on belief rule base (BRB) and fault tree analysis (FTA) [20]. To identify system risks, Kwag, Gupta, and Dinh constructed a performance-based probabilistic risk evaluation method, which uses Bayesian statistics to clarify the correlation between non-Boolean relationships and different levels of system events [21]. Considering that many factors will affect the reliability of high-speed trains, Zeng et al. constructed a risk evaluation model based on a Bayesian network array to reduce the impact of faults on the operation reliability of trains [22]. Based on fault tree analysis, integrated fuzzy set theory, Bayesian network, and Markov chain, Mohammadi et al. conducted the top event probability and reliability evaluation of cranes to determine the appropriate risk evaluation methods [23].
As described in the above literature, the research on risk evaluation of complex systems has made important progress, but it also faces certain challenges. In the process of fault risk evaluation of complex systems, the fault propagation breadth and depth of different modes are different, and there are certain functional dependencies between components. It is necessary to further explore the influence of fault modes. In addition, most of the risk evaluation models established in the past are static models, which cannot realize real-time dynamic evaluation of fault risk. Therefore, this paper proposes a novel framework for risk evaluation. A case study on the CNC machine center is conducted to elaborate the framework. The framework integrates the fault mode frequency ratio, the fault rate function, and the comprehensive influence degree of fault mode to dynamically evaluate the component risk.
The main contributions of this paper are as follows: (1) A framework to evaluate the risk of the CNC machining center component is presented. (2) A component fault mode hypernetwork model is constructed based on the hypergraph theory, considering the heterogeneity of the network between components and the network between fault modes. (3) A calculation method of comprehensive influence of fault mode is developed, which takes into account the propagation and diffusion of faults. (4) A case study using the real data of a CNC machining center is conducted, and we finally realized the evaluation of the risk of a certain type of CNC machining center at different times.
The rest of this paper is organized as follows: Section 2 presents the framework for risk evaluation. Section 3 elaborates on the fault mode analysis. Section 4 calculates the fault rate of the component. Section 5 provides the method of comprehensive influence of fault mode. In Section 6, the risk evaluation of the component is conducted. Conclusions are given in Section 7.

2. A Framework for Risk Evaluation

The machining center considered in this paper is a typical manufacturing equipment. Its healthy operation is crucial for the entire production system. We collected fault data from customers who purchased this type of machining center. The time span of the fault data collection was one year. Figure 1 presents a framework for risk evaluation, and the main technologies included in this framework are as follows:
(1)
Fault mode analysis. Fault mode analysis is the first step in developing the risk evaluation of machining center components. The fault frequency and fault mode of the components are recorded during the operation of the machining center. Then, we calculate the frequency ratio of component fault modes. This step acts as a basis for risk evaluation of the machining center.
(2)
Fault rate calculation. Fault rate acts as a core role in component risk evaluation. We use the Johnson method to correct the fault data rank. The two-parameter Weibull model fits in as a hypothetical distribution of the fault data. The least-squares method can obtain accurate parameter estimates. Eventually, the fault rate function of each component is determined.
(3)
Comprehensive influence degree of fault mode. The comprehensive fault mode influence degree is to reflect the influence degree of each fault mode more accurately. First, we construct a hypernetwork model of the component fault mode. Second, the propagation and diffusion influence of fault modes is calculated. Finally, the comprehensive influence degree of the fault mode can be obtained.
(4)
Risk evaluation. In this paper, the risk evaluation considers the component fault rate, the frequency ratio of fault mode, and the comprehensive influence degree of fault modes. The accuracy and reliability of the risk evaluation results can provide a reference for the formulation of the risk strategy of the machining center components.

3. Fault Mode Analysis

In this section, we conduct the fault mode analysis of the components. According to the functional structure mapping relationship of the machining center, we divide it into components, and clarify the composition and relationship of each component. We compiled the operation record table and the fault record table of the machining center. The record table mainly includes the start time of the record, fault occurrence time, fault location, fault causes, and solutions. Then, the fault information is collected, summarized, and sorted out. Therefore, we carried out fault mode analysis, explored the fault mechanism, and obtained the fault mode frequency ratio of components.
This paper takes a certain series of machining centers as the research object. It is divided into 11 components: tool magazine (M), spindle system (S), cooling system (W), hydraulic system (D), electrical system (V), worktable (T), pneumatic system (G), XYZ axis feed system (J), lubrication system (L), auxiliary system (K), and numerical control system (NC). Via fault analysis, we draw the fault distribution histogram of the machining center components, as shown in Figure 2.
Via the fault mode analysis, the frequency histogram of the fault mode is obtained, as shown in Figure 3, and the code is detailed in Table 1.
As can be found in Figure 3, the most common fault mode of this series of machining centers is 101, 103, and 301. Therefore, it is of great significance to ensure the reliable quality of components for the reliable operation of the machining center.

4. Fault Rate Calculation

The Johnson method is applied to correct the fault time rank of machining center components [24]. It is assumed that the fault data of machining center components obey the Weibull distribution [25]. The reliability modeling process is shown in Figure 4. As shown in Figure 4, the Johnson method is applied to correct the order of the collected fault data, and then a two-parameter Weibull model is constructed. Using the least squares method for parameter estimation.
The relationship between the fault rate function and the reliability function is shown below.
λ ( t ) = f ( t ) R ( t ) = γ θ γ t γ - 1 , t 0
where λ ( t ) is fault rate function, f ( t ) is probability density function, R ( t ) is reliability function, t is time variable, θ is scale parameter, and γ is shape parameter.
According to the reliability modeling process in Figure 4, combined with the collected fault data of components, the component fault rate function can be obtained, as shown in Table 2. Since the fault rate of components is a function of time, different times will correspond to different values. This paper arbitrarily takes 4000 h as an example to illustrate the method. The fault rate of the component at 4000 h can be obtained based on Table 2, as shown in Table 3.

5. Comprehensive Influence Degree of Fault Mode

In traditional risk evaluation, the influence degree of fault mode is mostly based on the reference value given by recommended standards or quantitative estimation [26]. In this paper, a hypernetwork model is used to calculate the propagation and diffusion value of each fault mode.

5.1. Construction of Component Fault Mode Hypernetwork

The concept of the hypergraph was proposed by C. Berge in 1970. Hypergraphs can be used to analyze multi-layer heterogeneous networks. A network that can be described by a hypergraph is a hypernetwork [27,28]. A component of the machining center has multiple fault modes, and a fault mode may occur in multiple components. The components and fault modes are associated in the form of a set. The component fault mode network is a heterogeneous network composed of two types of nodes with different properties. Therefore, the machining center component fault mode hypernetwork model can be established to describe the interactions and effects between component fault modes.
During the operation of the machining center, different components may have different fault modes, and the same mode may occur multiple times in the same component. Therefore, this paper uses the number of modes occurring in the components as weights to build a weighted component fault mode hypergraph with components as hyperedges and fault modes as nodes.
Defining the component fault mode hypergraph as H = ( V , E , C ) , then
1.
V = { v 1 , v 2 , , v g } , g is the number of fault modes and v i V ( 1 i g ) is the fault mode with code i , called the node of the hypergraph.
2.
E = { e 1 , e 2 , , e h } is a subset of V , e j E ( 1 j h ) represents the component of the code j , which is called hyperedge, and has the following properties:
(1)
e j V ( e j , 1 j h ) , which means that the component can have one or more modes.
(2)
j = 1 h e j = V , the set of elements of all hyperedges in the hypergraph is the set of modes.
(3)
v i e j means that fault mode v i is a unit of e j , indicating that the component has the v i th fault mode.
(4)
If v i , v j e j , node v i and v k are adjacent in component e j , denoted as v i e j v k . If there is no component making node v i adjacent to v k , denoted as v i v k .
(5)
If e i e j , the hyperedge e i is adjacent to e j , denoted as e i e j , and the adjacent hyperedges have a common node. For v i , e j make v i e j , there is no isolated fault mode that does not belong to any component.
3.
C = { c ij | v i e j , 1 i g , 1 j h } represents the set of times the fault mode occurs in the component. c ij is the weight of the node v i belonging to the hyperedge e j , if v i e j , the value of c ij is not 0, otherwise it is 0.
4.
The fault mode node v i V is described by the fault mode code i and the fault mode occurrence probability μ i j ( t ) . The larger μ i j ( t ) is, the greater the probability of the fault mode v i occurring in the component e j is.
5.
Component hyperedge e j E , described by code j and reliability R e j ( t ) . The larger R e j ( t ) is, the higher the reliability of component e j is, and the smaller the probability of fault is.
6.
The component fault mode hypergraph H can be expressed in two forms: set or matrix. Since the matrix can express the relationship between elements more clearly, this paper uses the form of matrix to describe the relationship between components and fault modes. The correlation matrix corresponding to the hypergraph H is expressed by H .
The h row in the matrix H corresponds to the component hyperedge e 1 , e 2 , , e h , and the g column corresponds to the fault mode node v 1 , v 2 , , v g . Let H [ i , j ] = c ij denote the element in the i th row and j th column of the matrix.
The edge load attribute will be an important structural feature of key fault propagation path identification in subsequent systems. From the process of building a component fault mode hypernetwork model, combined with the collected fault information, a machining center weighted component fault mode hypergraph is obtained, as shown in Figure 5.
According to Figure 5, the correlation matrix H can be obtained as follows:
H = 101 301 103 102 605 108 201 203 404 506 [ 6 2 2 0 3 0 4 0 1 0 3 0 0 0 0 0 2 2 0 2 8 1 0 1 0 0 0 1 1 0 4 5 1 0 0 0 7 0 0 0 2 0 0 0 2 0 2 0 0 0 0 0 0 0 1 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 ]     M J S V K G W NC L D T

5.2. Propagation and Diffusion Influence Degree of Fault Mode

When one fault mode occurs in a component, it will cause the occurrence of other fault modes of the component, and it will diffuse to other components via the hyperedge connection in the hypernetwork, causing faults. The influence of a certain fault mode on the entire network has always been a concern of researchers. The calculation of the fault influence degree can quantitatively guide the risk evaluation. Therefore, on the basis of establishing the component fault mode hypernetwork, the fault propagation and diffusion influence degree are studied. It should be noted that the following definitions should be made when calculating the fault propagation and diffusion influence degree of components:
1.
The propagation influence of fault mode. If fault mode v i occurs and there is v i e j v k , the occurrence of fault mode v i will cause the occurrence of v k , and this process is called the propagation influence of fault mode.
2.
Fault propagation capability. The probability of occurrence of a fault mode in a component that causes other fault modes in the component to occur. The fault propagation capability is related to the occurrence probability of the mode v i .
3.
The influence degree of fault propagation. The propagation influence degree represents the probability that the occurrence of a certain fault mode will cause the occurrence of a specific fault mode. It is related to the propagation capability of the faults.
4.
The diffusion influence of fault mode. If v i e x v j , v j e y v k , and v i v k , the phenomenon that the occurrence of fault mode v i eventually leads to the occurrence of mode v k is called fault mode diffusion influence.
5.
The influence degree of fault diffusion. The diffusion influence degree indicates the probability that the occurrence of a certain fault mode will cause the occurrence of other fault modes.
6.
Fault diffusion trajectory. The diffusion trajectory from v 1 to v t + 1 is composed of a set of fault mode node sequences ( v 1 , v 2 , , v t , v t + 1 ) . Adjacent nodes belong to the same component and have an association relationship because they belong to the same component. Non-adjacent nodes belong to different components. The length of the trajectory from v i to v i + 1 is counted as one step, the length of the trajectory from v i to v i + k is counted as k steps, and v i + k is called the k -step reachable node of v i .
7.
The influence degree of fault propagation diffusion. The sum of the propagation and diffusion influences caused by a certain fault mode in the hypernetwork.
(1)
Fault mode propagation influence capability.
The fault propagation influence capability of a fault mode is related to the probability of occurrence of the fault mode and the number of times it occurs in a component. According to the established hypergraph, when a certain fault mode of a component occurs many times during the operation of the machining center, it indicates that the mode of the component has a strong fault propagation capability. The function φ ( v i , e j ) represents the fault propagation capability of fault mode v i in component e j , is shown as follows:
φ ( v i , e j ) = μ i j ( t ) × H [ i , j ] k = 1 n ( H [ k , j ] ) 2 , ( k i )
where 0 φ ( v i , e j ) 1 . H [ i , j ] = c ij indicates the number of times of fault mode v i appearing in component e j . μ i ( t ) represents the occurrence probability of fault mode v i , and the calculation equation is shown in Equation (4):
μ i j ( t ) = α ji ( t ) λ j ( t ) λ Z ( t )
where α ji ( t ) is the frequency of fault mode v i in component e j at time t . λ j ( t ) represents the fault rate of component e j at time t , and λ z ( t ) represents the fault rate of the whole machine at time t .
According to Equation (4), combined with Table 1 and Table 3, the fault mode frequency ratio and fault mode occurrence probability of components at 4000 h are obtained and summarized, as shown in Table 4.
Using the correlation properties of the correlation matrix, the propagation capability matrix B of the fault mode in each component can be calculated as B [ i , j ] = φ ( v i , e j ) . The stronger the propagation capability of a certain fault mode, the greater the probability of causing other fault modes, and the larger the corresponding element value in matrix B .
B = 101 301 103 203 404 506 [ 0.1096 0.0769 0.0042 0.0025 0 0.0235 0 0 0 0.0100 0.0362 0.0026 0 0.0192 0 0.0400 0.0142 0.0026 0 0 0.0010 0 0 0 0 0 0 0 0 0 0.0030 0 0 0 0 0 ]     M       J     S       L   D   T  
(2)
Calculation of fault mode propagation influence degree
According to the fault mode propagation capability matrix, the propagation influence matrix between the fault modes can be further calculated. Corresponding to the fault mode-component hypergraph, which can represent the fault propagation situation between the fault modes. The fault mode propagation influence matrix represented by the matrix A is shown in Equation (6):
A = B B T I
where B T is the transpose of matrix B and I is the diagonal matrix of B B T . Since the propagation of faults occurs between different modes, it is not necessary to consider the diagonal elements in A , and set it to 0. The elements in the matrix can be represented as
A = { 0 ,       i = j k = 1 n ( B [ i , k ] B [ k , j ] ) , i j
This matrix characterizes the fault propagation between fault modes within the same component. At this time, the fault is only propagated inside the component. The off-diagonal elements represent the influence of fault propagation between modes v i and v j , that is, the probability of causing another fault mode when one of the fault modes occurs. If the value of the off-diagonal element is 0, it means that there is no fault propagation between the fault modes.
We bring the matrix B into Equation (7) to obtain the matrix A , as shown below. The one-step fault propagation trajectory of each fault mode can be obtained from the propagation influence matrix A . Taking mode 102 as an example, its one-step propagation trajectory is v 102 e NC v 501 , v 102 e V v 108 , and v 102 e V v 409 .
A =           101       301       103             203       404       506 101 301 103 203 404 506 [ 0 0.0311 0.1700 0.0004 0 0.0334 0.0311 0 0 0 0.0006 0 0.1700 0.0192 0 0 0.0002 0 0.0004 0 0 0 0 0 0 0.0006 0.0002 0 0 0 0.0334 0 0 0 0 0 ] × 10 2
(3)
Calculation of fault mode diffusion influence degree
The fault mode propagation influence matrix A indicates the situation of fault propagation via “shared” hyperedges. In fact, the fault not only propagates once but also further diffuses. As shown in Figure 6, the relationship between nodes and edges is v 1 , v 2 e 1 , v 2 , v 3 e 2 . v 1 can propagate faults to v 2 , v 2 can propagate faults to v 3 , and v 1 cannot directly propagate faults to v 3 , but v 2 also belongs to the hyperedges e 1 and e 2 . v 1 fault can still transmit the fault to v 3 via the “springboard” effect of v 2 . This situation is known as fault diffusion.
It should be noted that the fault probability of the k-step reachable node is a conditional probability based on the fault of the (k − 1) step reachable node.
P = r = 1 n , r i , j ( A ( k 1 ) [ i , r ] A [ r , j ] )
In the process of fault propagation, the reliability of the component will also affect the diffusion probability of the fault mode. The higher the reliability of the component, the stronger the ability to resist the fault. Therefore, combining the conditional probability with the fault probability, the diffusion influence matrix is obtained as follows:
A ( k ) [ i , j ] = { A ( k ) [ i , j ] = 0 ,   i = j r = 1 n , r i , j ( A ( k 1 ) [ i , r ] A [ r , j ] ) ,   i j , A ( k 1 ) [ i , j ] 0 r = 1 n , r i , j { A ( k 1 ) [ i , r ] A [ r , j ] ( 1 R e j ( t ) ) } ,   i j , A ( k 1 ) [ i , j ] = 0
According to Equations (1) and (10), combined with Table 4, the two-step fault diffusion influence matrix A ( 2 ) can be obtained, as shown below.
A ( 2 ) =           101       301       103               203       404       506 101 301 103 203 404 506 [ 0 0.1736 0.2688 0.0017 0.0005 0.0397 0.1736 0 0.0542 0.0001 0.0002 0.0104 0.2688 0.0542 0 0.0008 0.0006 0.0567 0.0017 0.0001 0.0007 0 0 0.0002 0.0004 0.0002 0.0006 0 0 0 0.0397 0.0090 0.0503 0.0002 0 0 ] × 10 5
We still take mode 102 as an example, whose two-step diffusion trajectory is shown in Table 5. It is important to note that the fault will not propagate continuously. When the influence value is less than a certain value, the fault will not propagate. According to literature research [29], this paper assumes that the fault will not diffuse when the influence degree of fault diffusion is less than 10−8.
In theory, the two-step diffusion trajectory of mode 102 can realize fault diffusion, but via calculation, it is found that the two-step diffusion influence value of mode 102 to some nodes is less than 10−8, as shown in the dotted line node in Table 5. At this time, the fault will not be transmitted. Therefore, the two-step diffusion trajectory of mode 102 has a total of 8 paths reachable.
Meanwhile, according to Table 5, it can be seen that there may be multiple diffusion paths between the two modes. For example, mode 102 can pass e NC and then propagate the fault to the mode 101 through e K , or it can pass e NC and then propagate to mode 101 through e M .
Similarly, the three-step fault diffusion influence matrix A ( 3 ) of each fault mode can be calculated, as shown below.
A ( 3 ) =           101       301       103               203       404       506 101 301 103 203 404 506 [ 0 0.0263 0.0537 0.0002 0.0002 0.0026 0.1095 0 0.0422 0.0001 0 0.0071 0.4542 0.0111 0 0.0004 0 0.0158 0.0014 0.0001 0.0004 0 0 0.0001 0.0002 0.0001 0.0001 0 0 0 0.0939 0.0064 0.0156 0.0001 0 0 ] × 10 7
From the matrix A ( 3 ) , it can be found that when the fault diffuses in the third step, there have been many modes with the fault diffusion influence degree less than 10−8. After the calculation of the fault diffusion in the fourth step, the fault diffusion influence degree of all modes is less than the threshold of 10−8. Therefore, the fault of the machining center occurs in three steps at most at 4000 h.
As the number of diffusion steps of fault mode increases, the diffusion influence of fault is constantly decreasing, but the influence of the comprehensive fault mode is on the rise. Therefore, although the influence of diffusion is becoming smaller and smaller, its influence on the whole fault propagation cannot be ignored.

5.3. Comprehensive Influence Degree of Fault Mode

It is assumed that after the occurrence of fault mode v i , k-step propagation and diffusion are conducted, and its influence on other nodes in the hypernetwork is expressed by the comprehensive influence degree of the fault mode:
U ( i , k ) = u ( i , 1 ) + u ( i , 2 ) + + u ( i , k )
where, 1 i g , 1 k h . When k = 1, it means that the fault has one-step propagation, and only needs to calculate U ( i , 1 ) . At this time, the direct propagation influence of the fault can be obtained according to the influence degree matrix of the direct propagation of the fault. The calculation equation is as follows:
U ( i , k ) = u ( i , 1 ) = j = 1 g A [ i , j ]   ( i j )
When k > 1 , it means that the fault has diffused further. It should be noted that when calculating U ( i , k ) , it is not necessary to re-calculate the influence of the node that has failed in the step k.
According to the multi-step propagation diffusion influence matrix, combined with the hypergraph model, the propagation diffusion influence value of each step of the fault mode can be obtained. Furthermore, the comprehensive influence value of the fault mode can be calculated by Equation (13), which is summarized in Table 6.
As can be found from Table 6, the modes with high comprehensive influence degree at 4000 h are 101, 503, 103, 502, 601, 603, and 803. However, the traditional methods for determining the influence of fault modes are mostly based on the subjective evaluation of humans, lacking objective quantitative indicators.

6. Risk Evaluation

The main factors affecting the fault risk of components are the fault rate of the component, the frequency ratio of fault modes, and the influence degree of fault modes. According to Equation (15), the risk of components can be calculated as
R j ( t ) = i = 1 g α ji ( t ) λ j ( t ) U j ( i , k )
where R j ( t ) is the value of risk for the component j . t is a time variable. g is the number of types of fault modes. a ji ( t ) is the fault mode frequency ratio of component j in mode i . λ j ( t ) represents the fault rate of component j , and U ( i , k ) is the i th fault mode comprehensive influence degree.
According to Equation (15), combined with the analysis of the calculation results in Table 5 and Table 6, the risk values of the components at 4000 h are calculated, as shown in Table 7.
It can be seen from Table 7 that when the machining center runs for 4000 h, the risk values of components are in descending order: M > J > K > W > T > L > S > D > N C > G > V . The component with the highest risk of fault is M, then J, and the least is V. Therefore, attention should be paid to the maintenance and health management of the tool magazine system and XYZ axis feed system.
The risk evaluation method for components of machining centers proposed in this paper is a time-varying model, which can obtain the risk value corresponding to components at any time, and fully consider the influence of propagation and diffusion of fault modes on components. In order to show its superiority, 300 h and 1000 h are selected as examples for application promotion. Considering the influence of fault propagation and diffusion, the fault risk value of the component is calculated. The comparison chart of the sorting trend of component risk value at different moments is obtained, as shown in Figure 7.
As can be found in Figure 7, the sorting results of the fault risk of the components at different times are different. The sorting result of the fault risk value at 300 h is N C > D > L > K > W > T > J > M > V > S > G , 1000 h is J > M > W > K > T > L > D > S >   N C > V > G and 4000 h is M > J > K > W > T > L > S > D > N C > G > V . From this, it can be seen that in the early stages of machining center operation, the components with a higher risk of fault are the numerical control system and hydraulic system. As time goes by, there is a greater risk of faults in the tool magazine and feed system. This is because, with the extension of the operation time of the machining center, the comprehensive influence degree of the tool magazine and the feed system is increasing. This is in line with the engineering.

7. Conclusions

In this paper, we presented a framework for the evaluation of the fault risk. Fault mode analysis is conducted to explore the fault mechanism and obtain the fault mode frequency ratio of components. The fault rate function is built based on Weibull distribution. A hypergraph model is established to calculate the comprehensive influence degree of fault mode. The risk evaluation is carried out based on the fault mode frequency ratio, fault rate, and the comprehensive influence of fault. A real fault data set of the CNC machining center has been used to present the procedure of risk evaluation. The main concluding remarks are given as follows:
(1)
The most common fault mode of this series of machining centers is 101, 103, and 301. Therefore, it is of great significance to ensure the reliable quality of components for the reliable operation of the machining center.
(2)
In the process of fault propagation, although the diffusion influence is becoming smaller and smaller, its cumulative influence on the whole fault propagation cannot be ignored. The modes with a high comprehensive influence degree at 4000 h are 101, 503, 103, 502, 601, 603, and 803.
(3)
Based on the proposed fault risk evaluation method, the risk value of the machining center at 4000 h is obtained, and the sorting result is M > J > K > W > T > L > S > D > NC > G > V . The component with the highest risk of fault is M, then J, and the least one is V. Therefore, attention should be paid to the maintenance and health management of the tool magazine system and XYZ axis feed system.
(4)
Via the calculation, we found that the sorting of the risk value of the machining center components is different at different times. At 300 h is N C > D > L > K > W > T > J > M > V > S > G , 1000 h is J > M > W > K > T > L > D > S > N C > V > G , and 4000 h is M > J > K > W > T > L > S > D > N C > G > V . The influence of fault is not static, but dynamic.
However, this paper still has some shortcomings and needs to be improved in the future. (1) The research on fault propagation and diffusion in this paper is based on statistical data, which can be integrated into real-time online monitoring data to obtain more accurate risk evaluation results. (2) This paper does not consider the impact of different maintenance strategies on the risk of component fault in machining centers. However, different levels of maintenance will inevitably have different impacts on component faults. Therefore, in future research, a more comprehensive risk evaluation framework should be established considering the impact of maintenance strategies.

Author Contributions

Conceptualization, L.M. and Y.Z.; methodology, L.M.; software, Q.Z.; validation, L.M. and Y.Z.; formal analysis, Y.Z.; investigation, L.M. and Q.Z.; resources, Y.Z.; data curation, Y.Z.; writing—original draft preparation, L.M.; writing—review and editing, L.M.; visualization, Q.Z.; supervision, Y.Z.; project administration, Y.Z.; funding acquisition, Y.Z. and L.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Jilin Province Science and Technology Development Plan Project (Grant No. 20190302104GX).

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhang, Y.Z.; Mu, L.M.; Liu, J.L.; Liu, J.T.; Tian, Z.F.; Zhang, Y.L. Application of fault propagation intensity in fault diagnosis of CNC machine tool. J. Chin. Inst. Eng. 2020, 43, 153–161. [Google Scholar] [CrossRef]
  2. Zhang, Y.Z.; Liang, S.B.; Liu, J.L.; Cao, P.L.; Luan, L. Evaluation for machine tool components importance based on improved LeaderRank. Proc. Inst. Mech. Eng. Part O-J. Risk Reliab. 2021, 235, 331–337. [Google Scholar] [CrossRef]
  3. Joubert, F.; Steyn, E.; Pretorius, L. Using the HAZOP Method to Conduct a Risk Assessment on the Dismantling of Large Industrial Machines and Associated Structures: Case Study. J. Constr. Eng. Manag. 2021, 147, 1–15. [Google Scholar] [CrossRef]
  4. Chen, Y.F.; Zhang, G.B.; Ran, Y. Risk Analysis of Coupling Fault Propagation Based on Meta-Action for Computerized Numerical Control (CNC) Machine Tool. Complexity 2019, 2019, 3237254. [Google Scholar] [CrossRef]
  5. Ou-Yang, C.; Chen, W.L. Applying a risk assessment approach for cost analysis and decision-making: A case study for a basic design engineering project. J.-Chin. Inst. Eng. 2017, 40, 378–390. [Google Scholar] [CrossRef]
  6. Awodi, N.J.; Liu, Y.-K.; Ayodeji, A.; Adibeli, J.O. Expert judgement-based risk factor identification and analysis for an effective nuclear decommissioning risk assessment modeling. Prog. Nucl. Energy 2021, 136, 103733. [Google Scholar] [CrossRef]
  7. Lo, H.W.; Liou, J.J.H.; Huang, C.N.; Chuang, Y.C. A novel failure mode and effect analysis model for machine tool risk analysis. Reliab. Eng. Syst. Saf. 2019, 183, 173–183. [Google Scholar] [CrossRef]
  8. Singh, J.; Singh, S.; Singh, A. Distribution transformer failure modes, effects and criticality analysis (FMECA). Eng. Fail. Anal. 2019, 99, 180–191. [Google Scholar] [CrossRef]
  9. Yasli, F.; Bolat, B. A risk analysis model for mining accidents using a fuzzy approach based on fault tree analysis. J. Enterp. Inf. Manag. 2018, 31, 577–594. [Google Scholar] [CrossRef]
  10. Sakurahara, T.; Reihani, S.; Kee, E.; Mohaghegh, Z. Global importance measure methodology for integrated probabilistic risk assessment. Proc. Inst. Mech. Eng. Part O-J. Risk Reliab. 2020, 234, 377–396. [Google Scholar] [CrossRef]
  11. Fan, S.Q.; Yang, Z.L.; Blanco-Davis, E.; Zhang, J.F.; Yan, X.P. Analysis of maritime transport accidents using Bayesian networks. Proc. Inst. Mech. Eng. Part O-J. Risk Reliab. 2020, 234, 439–454. [Google Scholar] [CrossRef]
  12. Chemweno, P.; Pintelon, L.; Muchiri, P.N.; Van Horenbeek, A. Risk assessment methodologies in maintenance decision making: A review of dependability modelling approaches. Reliab. Eng. Syst. Saf. 2018, 173, 64–77. [Google Scholar] [CrossRef]
  13. Wang, Q.; Diao, X.; Zhao, Y.; Chen, F.; Yang, G.; Smidts, C. An expert-based method for the risk analysis of functional failures in the fracturing system of unconventional natural gas. Energy 2021, 220, 119570. [Google Scholar] [CrossRef]
  14. Wang, W.Z.; Liu, X.W.; Liu, S.L. Failure Mode and Effect Analysis for Machine Tool Risk Analysis Using Extended Gained and Lost Dominance Score Method. IEEE Trans. Reliab. 2020, 69, 954–967. [Google Scholar] [CrossRef]
  15. Li, X.L.; Ran, Y.; Zhang, G.B.; He, Y. A failure mode and risk assessment method based on cloud model. J. Intell. Manuf. 2020, 31, 1339–1352. [Google Scholar] [CrossRef]
  16. Wang, Z.C.; Ran, Y.; Chen, Y.F.; Yang, X.; Zhang, G.B. Group risk assessment in failure mode and effects analysis using a hybrid probabilistic hesitant fuzzy linguistic MCDM method. Expert Syst. Appl. 2022, 188, 116013. [Google Scholar] [CrossRef]
  17. Yucesan, M.; Gul, M.; Celik, E. A holistic FMEA approach by fuzzy-based Bayesian network and best-worst method. Complex Intell. Syst. 2021, 7, 1547–1564. [Google Scholar] [CrossRef]
  18. Shen, G.X.; Zhen, J.; Zhang, Y.Z.; Song, J.; Li, Z. Risk analysis of machining center failure mode based on multi-attribute group decision making. J. Jilin Univ. (Eng. Technol. Ed.) 2022, 52, 338–344. [Google Scholar] [CrossRef]
  19. Rouabhia-Essalhi, R.; Boukrouh, E.; Ghemari, Y. Application of failure mode effect and criticality analysis to industrial handling equipment. Int. J. Adv. Manuf. Technol. 2022, 120, 5269–5280. [Google Scholar] [CrossRef]
  20. Zhu, H.L.; Liu, S.S.; Qu, Y.Y.; Han, X.X.; He, W.; Cao, Y. A new risk assessment method based on belief rule base and fault tree analysis. Proc. Inst. Mech. Eng. Part O-J. Risk Reliab. 2022, 236, 420–438. [Google Scholar] [CrossRef]
  21. Kwag, S.; Gupta, A.; Dinh, N. Probabilistic risk assessment based model validation method using Bayesian network. Reliab. Eng. Syst. Saf. 2018, 169, 380–393. [Google Scholar] [CrossRef]
  22. Zeng, Y.C.; Song, D.L.; Zhang, W.H.; Zhou, B.; Xie, M.Y.; Qi, X.Y. Risk assessment of wheel polygonization on high-speed trains based on Bayesian networks. Proc. Inst. Mech. Eng. Part O-J. Risk Reliab. 2021, 235, 182–192. [Google Scholar] [CrossRef]
  23. Mohammadi, H.; Fazli, Z.; Kaleh, H.; Azimi, H.R.; Hanifi, S.M.; Shafiee, N. Risk Analysis and Reliability Assessment of Overhead Cranes Using Fault Tree Analysis Integrated with Markov Chain and Fuzzy Bayesian Networks. Math. Probl. Eng. 2021, 2021, 6530541. [Google Scholar] [CrossRef]
  24. Zhang, Y.Z.; Mu, L.M.; Shen, G.X.; Yu, Y.; Han, C.Y. Fault diagnosis strategy of CNC machine tools based on cascading failure. J. Intell. Manuf. 2019, 30, 2193–2202. [Google Scholar] [CrossRef]
  25. Ebeling, C.E. An Introduction to Reliability and Maintainability Engineering; McGraw-Hill Education Asia Co.: New York, NY, USA, 2008. [Google Scholar]
  26. Gupta, G.; Mishra, R.P. A Failure Mode Effect and Criticality Analysis of Conventional Milling Machine Using Fuzzy Logic: Case Study of RCM. Qual. Reliab. Eng. Int. 2017, 33, 347–356. [Google Scholar] [CrossRef]
  27. Gazdík, I. Modelling systems by hypergraphs. Kybernetes 2006, 35, 1369–1381. [Google Scholar] [CrossRef]
  28. Wang, L.; Egorova, E.K.; Mokryakov, A.V. Development of Hypergraph Theory. J. Comput. Syst. Sci. Int. 2018, 57, 109–114. [Google Scholar] [CrossRef]
  29. Zhu, G.; Chen, Y.H.; Hou, J.L. Design and Realization of K-steps Fault Pervasion Algorithm. Comput. Autom. Meas. Control 2005, 13, 784–787. [Google Scholar] [CrossRef]
Figure 1. Framework for risk evaluation.
Figure 1. Framework for risk evaluation.
Mathematics 11 04083 g001
Figure 2. Fault distribution histogram of machining center components.
Figure 2. Fault distribution histogram of machining center components.
Mathematics 11 04083 g002
Figure 3. Frequency histogram of the fault mode of components.
Figure 3. Frequency histogram of the fault mode of components.
Mathematics 11 04083 g003
Figure 4. Reliability modeling process diagram.
Figure 4. Reliability modeling process diagram.
Mathematics 11 04083 g004
Figure 5. Machining center weighted components-fault mode hypergraph where the circle represents the component. The triangle represents the fault mode. The number on the connecting line represents the number of times the fault mode has occurred.
Figure 5. Machining center weighted components-fault mode hypergraph where the circle represents the component. The triangle represents the fault mode. The number on the connecting line represents the number of times the fault mode has occurred.
Mathematics 11 04083 g005
Figure 6. Example diagram of propagation and diffusion of fault mode in the machining center.
Figure 6. Example diagram of propagation and diffusion of fault mode in the machining center.
Mathematics 11 04083 g006
Figure 7. Comparison chart of the sorting trend of component fault risk value at different times.
Figure 7. Comparison chart of the sorting trend of component fault risk value at different times.
Mathematics 11 04083 g007
Table 1. Summary table of fault mode of the machining center.
Table 1. Summary table of fault mode of the machining center.
NumberMode CodeFault Mode
1101Parts are damaged
2103Damage to liquid, gas, oil parts
3301Liquid, gas, and oil leakage
4201Fasteners are loose
5203The pretight parts are loose
6404Improper pressure adjustment
7105Protective plate and cover are damaged
8108The fuse is damaged
9409Control line connection is wrong
10601Temperature rises too high
29506Moving parts
Table 2. Fault rate function of components.
Table 2. Fault rate function of components.
CodeFault Rate
M λ ( t ) = ( 1.312 / 2458.200 1.312 ) t ( 1.312 1 )
J λ ( t ) = ( 1.295 / 2606.44 0 1.295 ) t ( 1.295 1 )
S λ ( t ) = ( 0.641 / 7832.664 0.641 ) t ( 0.641 1 )
V λ ( t ) = ( 0.771 / 7935.696 0.771 ) t ( 0.771 1 )
K λ ( t ) = ( 0.931 / 5960.776 0.931 ) t ( 0.931 1 )
G λ ( t ) = ( 0.524 / 27,503.400 0.524 ) t ( 0.524 1 )
W λ ( t ) = ( 0.910 / 4145.720 0.910 ) t ( 0.910 1 )
NC λ ( t ) = ( 0.542 / 7912.399 0.542 ) t ( 0.542 1 )
L λ ( t ) = ( 0.791 / 8161.512 0.791 ) t ( 0.791 1 )
D λ ( t ) = ( 0.656 / 8905.062 0.656 ) t ( 0.656 1 )
T λ ( t ) = ( 0.773 / 6869.766 0.773 ) t ( 0.773 1 )
Table 3. Fault rate of components at 4000 h.
Table 3. Fault rate of components at 4000 h.
CodeFault RateCodeFault Rate
V0.000114G0.000048
W0.000220J0.000564
K0.000161M0.000621
D0.000097S0.000104
L0.000112T0.000127
NC0.000094
Table 4. Fault mode frequency ratio and fault mode occurrence probability of components at 4000 h.
Table 4. Fault mode frequency ratio and fault mode occurrence probability of components at 4000 h.
CodeMode α j i μ i j CodeMode α j i μ i j CodeMode α j i μ i j
M1010.31580.1403T01010.33330.0303W1010.28570.0450
5030.15790.070101030.11110.01013020.21430.0337
2020.10530.046803010.11110.01013010.14290.0225
6050.10530.046805030.11110.01018050.14290.0225
2010.05260.023406030.11110.01011030.07140.0112
2050.05260.023406050.11110.01016010.07140.0112
5010.05260.023406070.11110.01016100.07140.0112
5020.05260.0234L1030.57140.0458J1010.28570.1153
5060.05260.02343010.28570.02291030.14290.0577
6160.05260.02341010.14290.01145020.14290.0577
S4100.21430.0159V1020.77780.06346010.14290.0577
1010.14290.01061080.11110.00936030.14290.0577
2020.14290.01064090.11110.00938030.14290.0577
6050.14290.0106K01010.50000.0576G4030.42860.0146
2030.07140.005305010.16670.01923010.28570.0097
5030.07140.005302050.16670.01921030.14290.0049
6090.07140.005301050.16670.01924040.14290.0049
6030.07140.0053D03010.61540.0427NC1020.66700.0447
6060.07140.005301030.38460.02675010.33300.0223
Table 5. Two-step diffusion trajectory of mode 102.
Table 5. Two-step diffusion trajectory of mode 102.
Diffusion TrajectoryDiffusion Trajectory
v 102 e NC v 501 e k v 101 v 102 e NC v 501 e M v 605
v 102 e NC v 501 e K v 105 v 102 e NC v 501 e M v 503
v 102 e NC v 501 e K v 205 v 102 e NC v 501 e M v 201
v 102 e NC v 501 e M v 101 v 102 e NC v 501 e M v 502
v 102 e NC v 501 e M v 202 v 102 e NC v 501 e M v 506
v 102 e NC v 501 e M v 205 v 102 e NC v 501 e M v 616
Table 6. Comprehensive influence degree value at 4000 h.
Table 6. Comprehensive influence degree value at 4000 h.
ModeComprehensive Influence DegreeModeComprehensive Influence Degree
1010.017636010.00307
3010.001566070.00010
1030.004256100.00012
1020.000566160.00054
6050.002248030.00301
5030.004374090.00008
2020.002216060.00003
5010.001296090.00003
4100.000171050.00034
6030.003071080.00008
3020.000862010.00054
4030.000092030.00003
2050.000894040.00002
8050.000455060.00054
5020.00352
Table 7. Risk value of machining center component (10−4).
Table 7. Risk value of machining center component (10−4).
Code R j Code R j
V0.00052G0.00053
W0.01329J0.04204
K0.01487M0.04416
D0.00252S0.00388
L0.00604T0.00966
NC0.00075
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mu, L.; Zhang, Y.; Zhang, Q. Risk Evaluation Method Based on Fault Propagation and Diffusion. Mathematics 2023, 11, 4083. https://0-doi-org.brum.beds.ac.uk/10.3390/math11194083

AMA Style

Mu L, Zhang Y, Zhang Q. Risk Evaluation Method Based on Fault Propagation and Diffusion. Mathematics. 2023; 11(19):4083. https://0-doi-org.brum.beds.ac.uk/10.3390/math11194083

Chicago/Turabian Style

Mu, Liming, Yingzhi Zhang, and Qiyan Zhang. 2023. "Risk Evaluation Method Based on Fault Propagation and Diffusion" Mathematics 11, no. 19: 4083. https://0-doi-org.brum.beds.ac.uk/10.3390/math11194083

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop