Trustworthiness Measurement Algorithm for TWfMS Based on Software Behaviour Entropy

Han, Qiang

doi:10.3390/e20030195

Open AccessArticle

Trustworthiness Measurement Algorithm for TWfMS Based on Software Behaviour Entropy

by

Qiang Han

^1,2

¹

School of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China

²

Key Laboratory of Trustworthy Distributed Computing and Services (Ministry of Education), Beijing University of Posts and Telecommunications, Beijing 100871, China

Entropy 2018, 20(3), 195; https://0-doi-org.brum.beds.ac.uk/10.3390/e20030195

Submission received: 3 February 2018 / Revised: 10 March 2018 / Accepted: 10 March 2018 / Published: 14 March 2018

Download

Browse Figures

Versions Notes

Abstract

:

As the virtual mirror of complex real-time business processes of organisations’ underlying information systems, the workflow management system (WfMS) has emerged in recent decades as a new self-autonomous paradigm in the open, dynamic, distributed computing environment. In order to construct a trustworthy workflow management system (TWfMS), the design of a software behaviour trustworthiness measurement algorithm is an urgent task for researchers. Accompanying the trustworthiness mechanism, the measurement algorithm, with uncertain software behaviour trustworthiness information of the WfMS, should be resolved as an infrastructure. Based on the framework presented in our research prior to this paper, we firstly introduce a formal model for the WfMS trustworthiness measurement, with the main property reasoning based on calculus operators. Secondly, this paper proposes a novel measurement algorithm from the software behaviour entropy of calculus operators through the principle of maximum entropy (POME) and the data mining method. Thirdly, the trustworthiness measurement algorithm for incomplete software behaviour tests and runtime information is discussed and compared by means of a detailed explanation. Finally, we provide conclusions and discuss certain future research areas of the TWfMS.

Keywords:

software behaviour trustworthiness; principle of maximum entropy; measurement algorithm; workflow management system

1. Introduction

The workflow management system (WfMS) is considered a multidisciplinary system-of-systems (SoS)-oriented software under the modern IT background of cloud computing, internet of things (IoT), big data, and advanced technologies for the future. When a WfMS encounters unexpected accidents, human intervention or offline adjustment is insufficient to be appropriated for its complex undertaken commissions and high availability requirements. Similar to real-time systems, a WfMS requires online adjustment to accommodate random changes occurring in its surroundings or software architecture under human control strategies, which may affect its functionalities or non-functionalities and transaction data consistence—and even its trustworthiness level. For example, addressing course timetabling problems [1] from the case-based reasoning (CBR) system viewpoint, to adapt to the limited classroom capacity for the ever-increasing number of students, the CBR system may change the venue to a new classroom with larger capacity to prevent basic teaching functionality from being interrupted. In addition, to adapt to enable more effective teaching, the CBR system may adjust the venue to a new classroom that better suits the preference of the teacher and students to optimise the potential teaching non-functionality. Furthermore, to adapt to achieve more consistent teaching transactions log data, the CBR system may recover or repair the transactions with a uniform state; otherwise, the complete student achievement would not be generated from the conflicting data. Although the CBR system works for education-oriented business process management tasks, i.e., a special type of WfMS, the random changes mentioned above could mislead the WfMS software behaviour and eventually directly or indirectly affect the WfMS service behaviour realised by its software, ultimately reducing the WfMS software trustworthiness and resulting in users abandoning the candidate WfMS adoption. In order to solve the abovementioned problem, we present a novel measurement algorithm based on previous research results. Our approach includes a self-automatic framework for random changes and relative algorithms, concentrating on measuring the similarity between software behaviour and its claims through black-box testing with incomplete software behaviour entropy. The remainder of this paper consists of three sections: Section 2 discusses the related work on measurement methods regarding the WfMS; Section 3 illustrates the novel measurement algorithm from two perspectives, namely the calculus implemented on the WfMS and the normal forms of the WfMS; and Section 4 discusses the proposed algorithm and summarises future research areas.

2. Related Work

Software trustworthiness [2] is considered a non-functionality combination of software quality against the conformance degree oriented to subjective user evaluation. Against this background, the difference between user requirements and actual software behaviour will determine the trustworthiness perceived by the user. In recent decades, considerable efforts have been made to design intelligence algorithms to measure the difference between objective software or service behaviour and subjective user evaluation. In references [3,4], the authors define the similarity and design based on the genes of the firing sequences generated from transition adjacency relations (TARs). The corresponding distance measures between processes were taken as a metric to be applied in artificial processes and evaluations for clustering real-life processes. As a promising candidate for further research areas, the authors of reference [5] took into account the active feedback of evaluation data in workflow process modelling, which encompassed the entire lifecycle of workflow and enabled active real-time controlling using workflow audit trail data from three perspectives, namely process, resource, and object. In order to assist users in selecting a sufficient workflow with appropriate quality of service (QoS) to meet their requirements, reference [6] proposed a novel approach to scientific workflow retrieval with cost constraints and defined a distance measure for comparing the similarities among cost constrained graph (CCGs), through which workflow retrieval and ranking could be conducted based on similarity computations. To allow companies to respond to changing markets by creating product variants derived from different combinations of existing or new modular components, reference [7] sought to define and measure workflow modularity. The authors made three contributions: they designed two important performance measures regarding flow time and flexibility; they proposed an integer nonlinear programming (INLP) optimisation model for designing modular workflows that can be adopted for small processes; and they presented a heuristic model for the same commission adopted for larger processes. From a workflow standardisation perspective, in order to translate business data into bytes that are consumable by daily information systems with less automation difficulty and more reliability, as well as to make business competition comparisons more precise and invariable, reference [8] focused on health care data and workflow, reengineering the coding workflow and performance benchmarks, and proposed the establishment of useful tools for improving data quality, which can be used by everyone. In terms of the practical computation environment known as grid computing, reference [9] layered the clustering approach of the reusable workflow into a hierarchical model consisting of activity, event, condition-action, rule, and process similarity measures based on event condition action event condition action (ECA) theories. In the industry-integrated manufacturing environment, digital printing should eliminate redundant processes in order to shorten press runs and save costs. In order to select equipment and software for supporting such workflows, reference [10] introduced a workflow configuration tool that can elicit customer printing requirements and classified the requirements into a set of equivalent workflows according to their configuration using a compression-based dissimilarity measure (CDM) approach. The continuous improvement of business process management (BPM) is an ongoing issue for the WfMS, and in order to address this, a solid understanding of the success elements and waste experience during the same workflow process is essential. With the trend of lean production, reference [11] surveyed research and presented a conceptualisation for understanding the workflow and simultaneously occurring waste in the production of buildings through the three different dimensions of smoothness (high level of direct work), quality, and intensity. Moreover, they summarised the methods for this purpose, consisting of an integrated method of observation and self-reporting, as well as the last planner system (LPS) based on the method to measure workflow as handover of work between trades.

From an overview of the above related works, we can draw the conclusion that BPM measurement can be transformed from socialised human production to mechanised computing by means of electronic equipment. Therefore, there is a need for a comprehensive approach to connect the macroscopic and microscopic measurements systematically, thereby reflecting the user’s subjective experience of the trustworthiness of BPM according to the objective evidence of trustworthiness of the WfMS. However, in the dynamic and open WfMS runtime environment, the goal of comprehensively interpreting and obtaining the WfMS software behaviour is almost impossible to achieve. Generally, we can only obtain partial software behaviour, which represents incomplete trustworthiness of the entire service QoS without sufficient preciseness when the parameters regarding other software behaviour are uncertain [12]. Accordingly, we first present a quality management system (QMS) [13] of maximum entropy (QMSOME) derived from the trustworthy WfMS (TWfMS) framework, which is a thermodynamics-related interpretive model, as the virtual mirror of complex real-time business processes of the underlying information systems of organisations. Secondly, we present a novel recursive measurement algorithm for WfMS trustworthiness by means of a hybrid method derived from large-scale software black-box testing, as illustrated in the following sections, which is also an application of the principle of maximum entropy (POME) or maximum-entropy principle (MEP) [14,15].

3. Measurement Algorithm for the Workflow Management System Trustworthiness

In line with the related works discussed above, as a special type of QMS for workflow quality management, WfMSs experience multiple loops consisting of re-engineering and/or reorganisations. This process improves the WfMS into a high-level order—with relatively low entropy of information systems—from low-level disorder—with high entropy of information systems in the long term— along with consistently maintaining its trustworthiness as defined by users. This cycle, which covers the entire life of the WfMS, is also known as the resilience engineering (RE) [16] model and is illustrated in this section. Thereafter, we introduce its definition, including integrated manufacturing business processes, and the translation of these into models. Prior to proposing the RE model for the WfMS, we illustrate our thoughts regarding the TWfMS according to the standard WfMC reference model with the extension of methods and mechanisms, which is the fundamental goal of RE. Considering that the implementation of RE for the WfMS is ultimately imposed on the WfMS components, we first provide the preliminary component definition for the WfMS. Secondly, as the fundamental goal of WfMS RE is to assure its trustworthiness as perceived by users, we provide the trustworthy component definition of the WfMS. Thirdly, in order to certify WfMS recovery as its common function under RE, we present a normal form (NF) set [17] as a so-called paradigm to label the WfMS trustworthiness level at runtime.

3.1. Formal Representation of the Workflow Management System Trustworthiness

As SoS-oriented, complex system software, a formal representation of the mechanisms and methods revealing the internal principles of WfMS trustworthiness is imperative for the design, development, and maintenance tasks covering the entire WfMS lifecycle. In our prior research, we presented a reference model for the TWfMS [18,19], as illustrated in Figure 1, inspired by the RE concept. As indicated in Figure 1, we expand the WfMC reference model from interfaces no.0 and no.6–no.11. In the following paragraphs, we explain the differences between the TWfMS and WfMS in terms of each of these interfaces.

(1): Interface no.0 is linked to the core work engine(s) component of the WfMS and the additional self-configuration-parameter system for the WfMS (SCP4WMS) RE tool in the process execution service module; that is, we consider the SCP4WMS tools an extension of and supplementary to the process execution service module.
(2): Interfaces no.6 and no.7 are linked to the self-optimization framework system for the WfMS (SOF4WMS) and self-healing model system for WfMS (SHM4WMS) RE tools, respectively, with the management and monitoring tool, which constructs the TWfMS mechanism with the SCP4WMS tool; that is, we consider the SOF4WMS and SHM4WMS tools extensions of and supplementary to the management and monitoring tool.
(3): Interface no.8 is linked to the tools for communication on called application of typical web services with an additional tool, the auto construction method for the WfMS (ACM4WMS), based on services combination; that is, we consider the ACM4WMS tool an extension of and supplementary to the standard tools linked to the process execution service module via interface no.3.
(4): Interface no.9 is linked to the requirement auto-analysis tool with the process definition tool, where the former consists of four components known as acquisition, decomposition, combination, and verification based on a Petri net (ADCV-PN); that is, we consider the ADCV-PN tools extensions of and supplementary to the process definition tool.
(5): Interface no.10 is connected to the management and monitoring tool with the ACM4WMS tool when the WfMS encounters “local break points”, whereby the WfMS trustworthiness can no longer be maintained by the management and monitoring tool, even with the assistance of the tool sets of SCP4WMS, SOF4WMS, and SHM4WMS. In the context of the scenario described above, via interface no.10, the management and monitoring tool transfers the exceptional event unsolved by the SCP4WMS, SOF4WMS, and SHM4WMS tool sets sequentially to the ACM4WMS tool, in order to reconstruct the WfMS by searching for resources in the cloud. At such a time, we consider the WfMS as beginning local resilience engineering (LRE).
(6): Interface no.11 is connected to the management and monitoring tool with the ADCV-PN tools when the WfMS encounters “global break points”, whereby the WfMS trustworthiness can no longer be sufficiently accurate by means of the ACM4WMS tool, even if all of the resources in the cloud are traversed by means of the ACM4WMS tool. In the context of the scenario described above, via interface no.11, the management and monitoring tool finally transfers the exceptional event unsolved by the ACM4WMS tool to the ADCV-PN tools, in order to remodel the WfMS under user validation. At such a time, we consider the WfMS as beginning global resilience engineering (GRE).

Based on the above model, from the implementation view of software architecture, we propose the core components of the methods and mechanisms [18,19] as illustrated in Figure 2. Compared with the method of trustworthiness and the TWfMS in the cloud in Figure 2, here we place emphasis on the mechanisms of trustworthiness, comprising SCP4WMS, SOF4WMS, and SHM4WMS:

SCP4WMS means self-configuration-parameter system for the WfMS. It has the function of analysing the parameters transferred from the trustworthiness data collection (TDC) component, which gathers real-time data from the WfMS at the multilevel of components, component combinations, and application software. According to the analysis, SCP4WMS carries out the following procedures.
1.1.
If the parameters of the operating environment variables of workflow engines have changed and will cause WfMS failure and improper operation, SCP4WMS will modify the parameters of the WfMS itself according to predefined rules and return the new parameters of the WfMS itself to the TDC in order to be adaptable to the new environment variables of the workflow engines.
1.2.
SCP4WMS should transmit the remaining parameters to SOF4WMS to deal with other WfMS mechanisms.
1.3.
SCP4WMS should compute the WfMS behaviour trustworthiness according to the algorithm illustrated in Section 3.2 and transform it into the subsequent mechanism SOF4WMS.
1.4.
When SCP4WMS receives the new parameters of the WfMS itself, modified by SOF4WMS, it should return these to the TDC in order to be optimised with the new operating condition variables of the workflow engines.
1.5.
When SCP4WMS receives the new parameters of the WfMS itself, modified by SHM4WMS, it should return these to the TDC in order to be recovered with the new transaction consistence variables of the workflow engines.
SOF4WMS means self-optimisation framework system for the WfMS, and one of its functions is analysing the parameters transferred from the SCP4WMS component. According to the analysis, SOF4WMS carries out the following procedures.
2.1.
If the parameters of the operating condition variables of the workflow engines have changed and will lead to worse or better WfMS performance, SOF4WMS will modify the parameters of the WfMS itself, according to predefined ECA rules, and return the new parameters of the WfMS itself to SCP4WMS to be optimised with the new operating condition variables of the workflow engines.
2.2.
SOF4WMS should transmit the remaining parameters to SHM4WMS in order to deal with other WfMS mechanisms.
2.3.
SOF4WMS should verify the WfMS behaviour trustworthiness according to the algorithm for software/service behaviour trustworthiness validation and transform it into the subsequent mechanism SHM4WMS.
2.4.
When SOF4WMS receives the new parameters of the WfMS itself, modified by SHM4WMS, it should return these to SCP4WMS to be recovered with the new transaction consistence variables of the workflow engines.
SHM4WMS means self-healing-model system for the WfMS, and one of its functions is analysing the parameters transferred from the SOF4WMS component. According to the analysis, SHM4WMS carries out the following procedures.
3.1.
If the parameters of the transaction consistence variables of the workflow engines have changed and will cause an inconsistent transaction record in the WfMS, SHM4WMS will modify the parameters of the WfMS itself, according to predefined ECA rules, and return the new parameters to SOF4WMS to be recovered with the new transaction consistence variables of the workflow engines.
3.2.
SHM4WMS should compute the WfMS behaviour trustworthiness according to the NF algorithm (NF paradigms):
3.2.1.
If the WfMS NF is higher than or equal to the requirement NF of users when the ACM4WMS constructs the WfMS at the initial time, jump to step (3.3) directly.
3.2.2.
Otherwise, SHM4WMS will transfer it to the TWfMS method, which means that SHM4WMS will suggest the management and monitoring tool to transfer it to ACM4WMS through the local RE path (3.2.3). The management and monitoring tool transfers the NF that is lower than that of the initial WfMS to ACM4WMS through the local RE path.
3.3.
If ACM4WMS can reconstruct a WfMS with a new NF that is higher than or equal to the requirement NF of users from the source service in the cloud successfully, then jump to step (3.3) directly.
3.4.
Otherwise, the management and monitoring tool sends the NF that is lower than that of the initial WfMS to the ADCV-PN tools through the global RE path (that is, the ADCV-PN tools set will begin to remodel the WfMS under user validation of users).
3.5.
Register the new WfMS NF into the SHM4WMS database and return to step (1).

3.2. Measurement Algorithm for the Trustworthy Workflow Management System Based on Calculus

In this section, we introduce the general recursive measurement algorithm for the TWfMS based on the priority according to calculus. We provide the application server (AS) of the WfMS, consisting of the components and operations organised as a tree.

Firstly, the initial trustworthiness of AS is set to the value of 0.5, which means that we cannot judge whether or not it is more trustworthy than the initial value of 0.5 for proper trustworthiness when the ACM4WMS tool constructs the WfMS at the initial time, as illustrated in the following Equation (1):

A S^{I n i t i a l_T r u s t w o t h i n e s s} = 〈 C, O 〉 = 0.5 .

(1)

Secondly, we add the start and end components as the first and last activities of the original process with an absolute trustworthiness value of 1, which means they are coded with a simple but stable start or stop program with our absolute trust. Furthermore, we set the other components as the remaining immediate activities with an initial trustworthiness value of 0.5 for the same reason as that mentioned above. The initial trustworthiness of components is expressed by the following Equation (2):

C = C o m p o n e n t s = {\begin{cases} O r i g i n a l_p r o c e s s_{S t a r t}^{I n i t i a l_T r u s t w o r t h i n e s s} = 1, \\ O r i g i n a l_p r o c e s s_{i, j, k \in [1, N]}^{I n i t i a l_T r u s t w o r t h i n e s s} = 0.5 \\ O r i g i n a l_p r o c e s s_{E n d}^{I n i t i a l_T r u s t w o r t h i n e s s} = 1 \end{cases}} .

(2)

Thirdly, we classify the operations among these components into three types of calculus. That is, for the operated component A, which is operated with the operating component B via dependency calculus C.

If, following the operation, the trustworthiness of A is replaced with the minimum trustworthiness of components A and B, we name C strong dependency calculus.

If, following the operation, the trustworthiness of A is replaced with the multiplication value of the trustworthiness of components A and B, we name C indirect dependency calculus.

If, following the operation, the trustworthiness of A is replaced with the average value of the trustworthiness of components A and B, we name C weak dependency calculus.

The operations can be illustrated as the following Equations: (3) with classes or (4) with a sequence order perspective from start, immediate, and end activities.

O = O p e r a t i o n s = {\begin{cases} S D C^{U n s e a r c h e d} = {S t r o n g_D e p e n d e n c y_C a l c u l u s^{U n s e a r c h e d}}, \\ I D C^{U n s e a r c h e d} = {I n d i r e c t_D e p e n d e n c y_C a l c u l u s^{U n s e a r c h e d}}, \\ W D C^{U n s e a r c h e d} = {W e a k_D e p e n d e n c y_C a l c u l u s^{U n s e a r c h e d}} . \end{cases}}

(3)

O = O p e r a t i o n s = {C a l c u l u s_{s t a r t, 1}^{U n s e a r c h e d}; C a l c u l u s_{i, j}^{U n s e a r c h e d}; C a l c u l u s_{k, E n d}^{U n s e a r c h e d}} .

(4)

The trustworthiness of AS can be computed by the following Algorithm 1:

Algorithm 1: General_Recursive_Measure (

A S^{I n i t i a l_T r u s t w o t h i n e s s}

)

Input:

A S^{I n i t i a l_T r u s t w o t h i n e s s} = 0.5

;

G R M = 0.5

; Output:

A S^{T r u s t w o t h i n e s s} \in [0, 1]

;
1: If

A S . C o m p o n e n t s = \emptyset

;
2: Return; End if;
3: For every

O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{I n i t i a l_T r u s t w o r t h i n e s s}

do
//Traverse the tree in proper order of calculus priority from high to low:

S D C^{U n s e a r c h e d} > I D C^{U n s e a r c h e d} > W D C^{U n s e a r c h e d}

4: Compute behaviour trustworthiness (

O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{T r u s t w o r t h i n e s s}

) for

O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{I n i t i a l_T r u s t w o r t h i n e s s}

, replace

O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{I n i t i a l_T r u s t w o r t h i n e s s}

with

O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{T r u s t w o r t h i n e s s}

in

A S^{I n i t i a l_T r u s t w o t h i n e s s}

;
5: While exist

C a l c u l u s_{i, j}^{U n s e a r c h e d} \in S D C^{U n s e a r c h e d}

do
6: GRM = General_Recursive_Measure (

O p_{j \in [s t a r t, 1, 2, \dots, n, e n d]}^{I n i t i a l_T r u s t w o r t h i n e s s}

);
7: Replace

C a l c u l u s_{i, j}^{U n s e a r c h e d}

with

C a l c u l u s_{i, j}^{S e a r c h e d}

in

A S^{I n i t i a l_T r u s t w o t h i n e s s}

8: Set

O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{T r u s t w o r t h i n e s s} \leftarrow O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{T r u s t w o r t h i n e s s} (C a l c u l u s_{i, j}^{S e a r c h e d}) G R M

;
9: End do;
10: While exist

C a l c u l u s_{i, j}^{U n s e a r c h e d} \in I D C^{U n s e a r c h e d}

do
11:

G R M

= General_Recursive_Measure (

O p_{j \in [s t a r t, 1, 2, \dots, n, e n d]}^{I n i t i a l_T r u s t w o r t h i n e s s}

);
12: Replace

C a l c u l u s_{i, j}^{U n s e a r c h e d}

with

C a l c u l u s_{i, j}^{S e a r c h e d}

in

A S^{I n i t i a l_T r u s t w o t h i n e s s}

13: Set

O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{T r u s t w o r t h i n e s s} \leftarrow O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{T r u s t w o r t h i n e s s} (C a l c u l u s_{i, j}^{S e a r c h e d}) G R M

;
14: End do;
15: While exist

C a l c u l u s_{i, j}^{U n s e a r c h e d} \in W D C^{U n s e a r c h e d}

do
16:

G R M

= General_Recursive_Measure (

O p_{j \in [s t a r t, 1, 2, \dots, n, e n d]}^{I n i t i a l_T r u s t w o r t h i n e s s}

);
17: Replace

C a l c u l u s_{i, j}^{U n s e a r c h e d}

with

C a l c u l u s_{i, j}^{S e a r c h e d}

in

A S^{I n i t i a l_T r u s t w o t h i n e s s}

18: Set

O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{T r u s t w o r t h i n e s s} \leftarrow O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{T r u s t w o r t h i n e s s} (C a l c u l u s_{i, j}^{S e a r c h e d}) G R M

;
19: End do;
20: End for;
21: Set

A S^{I n i t_T r u s t w o t h i n e s s} \leftarrow O p_{i \in [s t a r t, 1, 2, \dots, n, e n d]}^{T r u s t w o r t h i n e s s}

;
22: Replace

A S^{I n i t_T r u s t w o t h i n e s s}

with

A S^{T r u s t w o t h i n e s s}

;
23: Return

A S^{T r u s t w o t h i n e s s}

.

Following completion of this algorithm, the trustworthiness of AS should be computed as follows:

A S^{T r u s t w o t h i n e s s} = 〈 C, O 〉 \in [0, 1] .

(5)

C = C o m p o n e n t s = {\begin{cases} O r i g i n a l_p r o c e s s_{S t a r t}^{T r u s t w o r t h i n e s s} = 1, \\ O r i g i n a l_p r o c e s s_{i, j, k \in [1, N]}^{T r u s t w o r t h i n e s s} \in [0, 1], \\ O r i g i n a l_p r o c e s s_{E n d}^{T r u s t w o r t h i n e s s} = 1 . \end{cases}}

(6)

O = O p e r a t i o n s = {\begin{cases} S D C^{S e a r c h e d} = {S t r o n g_D e p e n d e n c y_C a l c u l u s^{S e a r c h e d}}, \\ I D C^{S e a r c h e d} = {I n d i r e c t_D e p e n d e n c y_C a l c u l u s^{S e a r c h e d}}, \\ W D C^{S e a r c h e d} = {W e a k_D e p e n d e n c y_C a l c u l u s^{S e a r c h e d}} . \end{cases}}

(7)

O = O p e r a t i o n s = {C a l c u l u s_{s t a r t, 1}^{S e a r c h e d}; C a l c u l u s_{i, j}^{S e a r c h e d}; C a l c u l u s_{k, E n d}^{S e a r c h e d}} .

(8)

3.3. Basic Software Behaviour Trustworthiness Metric for Components

In this section, based on our prior works on data mining [20,21], and now in the uncertain environment, we introduce the basic component trustworthiness metrics expanded with POME in order to obtain the appropriate trustworthiness entropy of software behaviour according to the deterministic software behaviour of AS components.

Definition 1.

Trust is a three-tuple

(E_{1}, E_{2}, t e_{E_{1}}^{E_{2}})

, where

E_{1}

is the trustor,

E_{2}

is the trustee,

t e_{E_{1}}^{E_{2}}

is the value of trust entropy made by

E_{1}

upon

E_{2}

, and

E 1 \cap E 2 = \emptyset, E 1 \cup E 2 \neq \emptyset; t e_{E_{1}}^{E_{2}} \in [0, 1]

.

Definition 2.

Software trustworthiness entropy (

T E

) is a combination entropy attribute consisting of sub-attributes according to the requirement, where

T E \in [0, 1]

and a greater value of

T E

results in higher trust in the software.

Definition 3.

Software initialisation trustworthiness entropy (

T_{s i t e} (s)

) is set at software start-up, where

T_{s i t e} (s) \in [0, 1]

and greater values of

T_{s i t e} (s)

entropy mean higher trust in the initialised software is required.

Definition 4.

Software trusted threshold entropy (

T_{s t t e} (s)

) is set by the user prior to the software running, where

T_{s t t e} (s) \in [0, 1]

and greater values of

T_{s t t e} (s)

entropy mean higher trust in the terminated software is required.

Definition 5.

Software runtime trustworthiness entropy (

T_{s r t e} (s)

) is measured at the software runtime by a software measurement tool or agent, according to its actual behaviour and user evaluation.

It is clear that the trustworthy software running condition should be

T_{s i t e} (s) \geq T_{s r t e} (s) \geq T_{s t t e} (s)

; otherwise, the software should be terminated.

From the perspective of software engineering, all of the initial software attributes can be reflected by software test data entropy (STDE). It is well known that any STDE partition can be uniquely associated with an equivalence relation on the STDE. Therefore, we define the STDE as the static trustworthiness data to reflect the

T_{s i t e} (s)

through the equivalence partition of black-box testing prior to delivering the software.

In contrast, all dynamic software attributes can only be reflected by software executed data entropy (SEDE). Thus, we define SEDE as dynamic trustworthiness data for reflecting the

T_{s r t e} (s)

through an equivalence partition approach to black-box testing after delivering the software and comparison with the equivalence partition on STDE.

Definition 6.

Assume that

X

is an incomplete and finite collection consisting of STDE or SEDE, and recall that an equivalence relation

R

on

X

is a mapping

R : X \times X \to {0, 1}

.

Therefore, we denote

R_{T i}

as test data when

R_{T i}

is a real case of

R

defined above and collected from a software test environment prior to being delivered for use.

In contrast, we denote

R_{E i}

as executed data when

R_{E i}

is a real case of

R

defined above and collected from a software runtime environment after being delivered for use.

Definition 7.

Taking the equivalence relation

R

as rule-type information, according to the artificial intelligence theory, we can introduce the theory for software trustworthiness measurement and evaluation. Here,

R

is represented as follows:

i f R t h e n H; [0 \leq (C F (R), C F (R, H)) \leq 1],

where H means the trustworthiness of the owning trustee. The rule can be explained as follows: given that R occurred with a probability of CF(R), the trustee is the software itself, and the trustworthiness of the rule is (R,H) with probability CF(R,H). Thus, the trustworthiness of the software is H with probability CF(H).

We can calculate CF(H) by means of criteria 1 to 3.

Criterion 1.

According to the definitions above, CF(H) can be calculated as follows:

T_{s i t e} (s) = C F (H) = C F (R, H) \times C F (R) .

(9)

Criterion 2.

Given an equivalence relation R on

X = {x_{1}, x_{2}, \dots \dots, x_{l}}

, assume that we have two partitions of the test space

X

. According to the definition above,

X

comprises:

X = \cup_{i = 1}^{C a r d ({(R_{i}, H)})} R_{i} .

(10)

P_{S T D E} = R_{T 1}, \dots, R_{T p}, R_{T i} \cap R_{T j} = \emptyset, i \neq j, \cup_{i = 1}^{T p} R_{T i} = S T D E \subseteq X .

(11)

P_{S E D E} = R_{E 1}, \dots, R_{E q}, R_{E i} \cap R_{E j} = \emptyset, i \neq j, \cup_{i = 1}^{E q} R_{E i} = S E D E \subseteq X .

(12)

Criterion 3.

According to the definitions above, given an equivalence relation R on

X = {x_{1}, x_{2}, \dots \dots, x_{l}}

,

n = C a r d ({(R_{i}, H)})

, CF(R) can be calculated as follows:

C F (R) = \sum_{i = 1}^{n} (\sum_{j = 1, x_{j} \in (R_{i}, H)}^{l} p r (R_{i} | x_{j}) - \sum_{j = 1, x_{j} \notin (R_{i}, H)}^{l} p r (R_{i} | x_{j})),

(13)

where the uncertainty regarding CF(R) consisting of

p r (R_{j} | x_{i})

, measured by the entropy function for

\forall x_{j} \in X

,

n = C a r d ({(R_{i}, H)})

, is given as follows:

Max . size : \max_{\forall x_{j} \in X} (- \sum_{i = 1}^{n} [p r (R_{i} | x_{j})] \ln [p r (R_{i} | x_{j})]) .

(14)

Subject to : \sum_{i = 1}^{n} [p r (R_{i} | x_{j})] = 1, p r (R_{i} | x_{j}) \geq 0 .

(15)

\sum_{i = 1}^{n} [p r (R_{i} | x_{j})] f_{k} (x_{k}) = E [f_{k}] = F_{k}, k \in [1, m],

(16)

where

p r (R_{i} | x_{j})

is the probability of each set of possible information or state

R_{i}

related to evidence of whether or not the equivalence relation R belongs to user requirements, given every test item

x_{j}

of

X

. Suppose that we obtain:

p r (R_{i} | x_{j}) = \frac{1}{Z (λ_{1}, λ_{2}, \dots, λ_{m})} \exp [λ_{1} f_{1} (R_{i}) + λ_{2} f_{2} (R_{i}) +, \dots, + λ_{m} f_{m} (R_{i})],

(17)

where

Z (λ_{1}, λ_{2}, \dots, λ_{m}) = \sum_{i = 1}^{n} \exp [λ_{1} f_{1} (R_{i}) + λ_{2} f_{2} (R_{i}) +, \dots, + λ_{m} f_{m} (R_{i})]

, and the

λ_{k}

parameters are Lagrange multipliers with values determined by

F_{k} = - \frac{\partial}{\partial λ_{k}} Z (λ_{1}, λ_{2}, \dots, λ_{m})

.

Above, we have introduced the definitions of STDE and SEDE associated with the

T_{s i t e} (s)

calculated on STDE. We consider formulating the congruence measurement from the perspective of the partitions on STDE and SEDE in order to calculate

T_{s r t e} (s)

.

It is critical to obtain a mapping congruence:

P_{S T D E} \times P_{S E D E} \to [0, 1]

(where P stands for the equivalence relation on the software test data entropy (STDE) or software executed data entropy (SEDE)) indicating the degree of congruence or similarity between

P_{S T D E}

and

P_{S E D E}

.

Here, we calculate the congruence between

P_{S T D E}

and

P_{S E D E}

using the underlying equivalence relations. We note that if, for

x \neq y

, we indicate an unordered pair by

〈 x, y 〉

,

〈 x, y 〉 = 〈 y, x 〉

, and if

X

has

n = C a r d ({(R_{i}, H)})

elements, we have

(\begin{matrix} n \\ 2 \end{matrix}) = \frac{(n) (n - 1)}{2} = n^{c} 2

unordered pairs.

We now suggest a general congruence measure between partitions of

P_{S T D E}

and

P_{S E D E}

, which we express in terms of their underlying equivalence relations.

\bar{C o n g} (P_{S T D E}, P_{S E D E}) = 1 - \frac{D i f f_V a l (P_{S T D E}, P_{S E D E})}{n^{c} 2},

(18)

where

D = D i f f_V a l (P_{S T D E}, P_{S E D E})

is the number of pairs that have different values in

P_{S T D E}

and

P_{S E D E}

. Then, we can calculate the software runtime trustworthiness

T_{s r t e} (s)

from the software initialization trustworthiness

T_{s i t e} (s)

:

T_{s r t e} (s) = \bar{C o n g} (P_{S T D E}, P_{S E D E}) \times T_{s i t e} (s) .

(19)

In the section above, we have introduced a general measure of similarity or congruence between two partitions on STDE and SEDE using the underlying equivalence relations. Equation (19) implies that we should traverse all of the equivalence relations from the STDE and SEDE circularly. Thus, the largest complexity of Equation (19) is

O ({(C a r d (R_{i}) | (R_{i}, H) \in X))}^{2})

.

We now consider the perspective of the partitions themselves. Taking into account Equations (11) and (12), without loss of generality, we can assume that

q = p

, and if

q > p

, we can augment the partition

P_{S E D E}

by adding

q - p

subsets

R_{E p + 1} = R_{E p + 2} = \dots = R_{E q} = \emptyset

. Thus, in the following we assume that the two partitions have the same number of classes,

q

. We now introduce an operation known as a pairing of

P_{S T D E}

and

P_{S E D E}

, denoted by

g (P_{S T D E}, P_{S E D E})

, which associates with each subset

R_{T i}

of

P_{S T D E}

a unique

R_{E i}

from

P_{S E D E}

, grouped according to

(R_{i}, H) \in X

. We then have the fact that a pairing

g (P_{S T D E}, P_{S E D E})

is a collection of

q

pairs,

g (P_{T i}, P_{E i})

. We now associate with each pairing a score,

g (P_{S T D E}, P_{S E D E})

, defined as follows. Denoting

D_{g . j} = P_{T j} \cap P_{E j}

, for

j = 1

to

q

, we obtain:

S c o r e (g (P_{S T D E}, P_{S E D E})) = \sum_{j = 1}^{q} C a r d (D_{g . j}) .

(20)

We now use this to obtain the congruence:

\bar{\bar{C o n g}} (P_{S T D E}, P_{S E D E}) = \frac{S c o r e (g (P_{S T D E}, P_{S E D E}))}{C a r d (X)} .

(21)

Then, we can calculate

T_{s r t e} (s)

from

T_{s i t e} (s)

,

T_{s r t e} (s) = \bar{\bar{C o n g}} (P_{S T D E}, P_{S E D E}) \times T_{s i t e} (s) .

(22)

Through analysis, the complexity of Equation (22) is determined as

O (C a r d (S T D E) \times C a r d (S E D E))

, which is less than or equal to the complexity of

\bar{C o n g} (P_{S T D E}, P_{S E D E})

in Equation (18), because

C a r d (S T D E) \leq C a r d (X)

and

C a r d (S E D E) \leq C a r d (X)

, according to Equations (6) and (7).

Therefore, can we conclude that the performance of Equation (22) is far superior to that of Equation (19) simply by their differing complexities? Indeed, with the trends of infrastructure-as-a-service (IaaS), platform-as-a-service (PaaS), and software-as-a-service (SaaS) in the cloud, increasing software components encapsulated as services are emanating from third parties so that there no longer exists a steady and closed STDE. For this reason, the precondition of Equation (11) that clusters STDE into the test space

X

would visibly increase its complexity.

4. Conclusions

In order to address the measurement problem of WfMS trustworthiness, based on prior research [17,18,19,20,21], this paper has proposed a novel algorithm for WfMS trustworthiness based on the TWfMS framework mechanisms in an uncertain environment with incomplete software behaviour test cases, which means that the deterministic entropy of services or its underlying software behaviour is partial. Similar to BPM, we can consider the entire WfMS lifecycle as a group of long-term processes, which we categorise into three aspects: the ‘as-is process’ in the build-time stage, the ‘to-be process’ in the runtime stage, and the ‘agile-consistent process’ in maintenance time. This study focuses on the measurement algorithm of the first SCP4WMS mechanism of the ‘agile-consistent process’, which supports the computing infrastructure of the SOF4WMS and SHM4WMS mechanisms. In order to guarantee the current agile and consistent attributes in the WfMS mechanisms, we serialise the three mechanisms SCP4WMS, SOF4WMS, and SHM4WMS in a sequence list to solve the agile and consistent WfMS problems into three self-autonomous propriety grades: functionalities, non-functionalities, and transactions. Direct feedback is used to adjust the workflow engine online by the TDC, accompanied by three trustworthy propriety grades–measurement, verification, and evaluation—indirectly proposed for the ACM4WMS or ADCV-PN tools by the management and monitoring tool encountered in the LRE or GRE loops. Moreover, this means that, for exception events that cannot be solved by WfMS mechanisms, we transform these with event parameters into the method of the WfMS in order to pursue further solving with LRE or GRE.

In summary, our study is closely related to former works, but it differs from these in terms of two aspects on which we place emphasis simultaneously: self-autonomic computing and trust computing, including the evaluation method [22], which indicates that Equation (13), if applied only to the

C F (R)

of Equation (9) and not the

C F (R, H)

, has not resolved the problem of incomplete information on the set of

(R, H)

. Similarly, Equations (18) and (19) are all based on the relatively static software test environment while WfMSs are in runtime. Then, we compare the difference between the

P_{S T D E}

and

P_{S E D E}

by using Equation (18) and compute the

T_{s r t e} (s)

by using Equation (19). Indeed, in order to obtain more precise

T_{s i t e} (s)

and

T_{s r t e} (s)

, we might update Equations (18) and (19) with the same style as Equation (9) in future works. Furthermore, we plan to implemented the WfMS based on the fundamental features of Internetware [23], and all of these works involve software architecture or service paradigms representing their trustworthiness concentrically, from direct or indirect viewpoints, which also indicate that the

P_{S E D E}

would be more frequently influenced by the dynamic software test environment while WfMSs are more in runtime than the

P_{S T D E}

, which is generated from the relatively static software test environment.

In future work, we plan to conduct simulations or practical industry experiments to verify and evaluate our measurement algorithm for the WfMS mechanism. It is our hope that future works will be completed in the context of the reduction process of WfMS behaviour entropy, given the unavoidable nature of WfMS behaviour entropy increasing.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant No. 61363001). The author would like to thank the anonymous reviewers and editors for their suggestions.

Conflicts of Interest

The author declares no conflict of interest.

References

Qu, R. Case-Based Reasoning for Couse Timetabling Problems. Ph.D. Thesis, University of Nottingham, Nottingham, UK, 2002. [Google Scholar]
Liu, K.; Shan, Z.; Wang, J.; He, J.; Zhang, Z.; Qin, Y. Overview on Major Research Plan of Trustworthy Software. Bull. Natl. Sci. Found. China 2008, 22, 145–151. [Google Scholar]
Zha, H.; Wang, J.; Wen, L.; Wang, C.; Sun, J. A workflow net similarity measure based on transition adjacency relations. Comput. Ind. 2010, 61, 463–471. [Google Scholar] [CrossRef]
Zha, H.; Wang, J.; Wen, L.; Wang, C. A label-free similarity measure between workflow nets. In Proceedings of the IEEE Asia-Pacific Services Computing Conference (IEEE APSCC 2009), Singapore, 7–11 December 2009; pp. 463–469. [Google Scholar]
Mühlen, M. Workflow-based process controlling-or: What you can measure you can control. In Workflow Handbook 2001, Workflow Management Coalition; Future Strategies: Lighthouse Point, FL, USA, 2001; pp. 61–77. [Google Scholar]
Ma, Y.; Shi, M.; Wei, J. Cost and accuracy aware scientific workflow retrieval based on distance measure. Inf. Sci. 2015, 314, 1–13. [Google Scholar] [CrossRef]
Chin, D.-M. A Definition and Measure of Workflow Modularity. Master’s Thesis, Florida International University, Miami, FL, USA, 2005. [Google Scholar]
Wilson, D.; Hamptonbagshaw, K.; Jorwic, T.M.; Bishop, J.; Giustina, E. A new focus on process and measure: Raising data quality with a standard coding workflow and benchmarks. J. AHIMA 2008, 79, 54–58. [Google Scholar] [PubMed]
Wang, Y.; Li, M.; Cao, J.; Lin, X.; Tang, F. Workflow similarity measure for process clustering in grid. In Proceedings of the International Conference on Fuzzy Systems and Knowledge Discovery, Haikou, China, 24–27 August 2007; pp. 629–635. [Google Scholar]
Wei, L.; Handley, J.; Martin, N.; Sun, T.; Keogh, E. Clustering workflow requirements using compression dissimilarity measure. In Proceedings of the Sixth IEEE International Conference Data Mining Workshops (ICDM Workshops), Hong Kong, China, 18–22 December 2006; pp. 50–54. [Google Scholar]
Bo, T.K.; Gundersen, M.; Berge, T.O. To measure workflow and waste: A concept for continuous improvement. In Proceedings of the 22th Annual Conference of the International Group for Lean Construction (IGLC 2014), Oslo, Norway, 25–27 June 2014; pp. 835–846. [Google Scholar]
Dai, Y.S.; Min, X.; Quan, L.; Szu-Hui, N. Uncertainty Analysis in Software Reliability Modeling by Bayesian Analysis with Maximum-Entropy Principle. IEEE Trans. Softw. Eng. 2007, 33, 781–795. [Google Scholar] [CrossRef]
Lollai, S.A. Quality Systems. A Thermodynamics-Related Interpretive Model. Entropy 2017, 19, 418. [Google Scholar] [CrossRef]
Jaynes, E.T. Information Theory and Statistics Mechanics. Stat. Phys. 1963, 106, 181–218. [Google Scholar]
Kapur, J. Maximum-Entropy Models in Science and Engineering; John Wiley & Sons: Hoboken, NJ, USA, 1989. [Google Scholar]
Madni, A.M.; Jackson, S. Towards a Conceptual Framework for Resilience Engineering. IEEE Syst. J. 2009, 3, 181–191. [Google Scholar] [CrossRef]
Han, Q.; Yuan, Y. Research on trustworthiness measurement approaches of component based BPRAS. J. Commun. 2014, 35, 47–57. (In Chinese) [Google Scholar]
Han, Q. TWfMS: A framework of trustworthy workflow management system. In Proceedings of the International Conference on Subject-Oriented Business Process Management (S-BPM) ONE 2017, Darmstadt, Germany, 30–31 March 2017; pp. 78–85. [Google Scholar]
Han, Q. Resilience engineering for trustworthy workflow management system. In Proceedings of the IEEE International Conference on Software Quality, Reliability and Security Companion (QRS-C), Prague, Czech Republic, 25–29 July 2017; pp. 289–296. [Google Scholar]
Yuan, Y.; Han, Q. A Software Behavior Trustworthiness Measurement Method based on Data Mining. Int. J. Comput. Intell. Syst. 2011, 4, 817–825. [Google Scholar] [CrossRef]
Yuan, Y.; Han, Q. A Data Mining Based Measurement Method for Software Trustworthiness. Chin. J. Electron. 2012, 21, 293–296. [Google Scholar] [CrossRef]
Rong, J. A Trustworthiness Evaluation Method for Software Architectures Based on the Principle of Maximum Entropy (POME) and the Grey Decision-Making Method (GDMM). Entropy 2014, 16, 4818–4838. [Google Scholar]
Hong, M.; Jian, L. A Software Architecture Centric Engineering Approach for Internetware; Springer: Singapore, 2016; pp. 702–730. [Google Scholar]

Figure 1. Reference model for a trustworthy workflow management system (TWfMS) with resilience engineering (RE).

Figure 2. Methods and mechanisms of a TWfMS with RE.

© 2018 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Han, Q. Trustworthiness Measurement Algorithm for TWfMS Based on Software Behaviour Entropy. Entropy 2018, 20, 195. https://0-doi-org.brum.beds.ac.uk/10.3390/e20030195

AMA Style

Han Q. Trustworthiness Measurement Algorithm for TWfMS Based on Software Behaviour Entropy. Entropy. 2018; 20(3):195. https://0-doi-org.brum.beds.ac.uk/10.3390/e20030195

Chicago/Turabian Style

Han, Qiang. 2018. "Trustworthiness Measurement Algorithm for TWfMS Based on Software Behaviour Entropy" Entropy 20, no. 3: 195. https://0-doi-org.brum.beds.ac.uk/10.3390/e20030195

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Trustworthiness Measurement Algorithm for TWfMS Based on Software Behaviour Entropy

Abstract

1. Introduction

2. Related Work

3. Measurement Algorithm for the Workflow Management System Trustworthiness

3.1. Formal Representation of the Workflow Management System Trustworthiness

3.2. Measurement Algorithm for the Trustworthy Workflow Management System Based on Calculus

3.3. Basic Software Behaviour Trustworthiness Metric for Components

4. Conclusions

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI