Next Article in Journal
The Feasibility of Using the “Artery Sign” for Pre-Procedural Planning in Navigational Bronchoscopy for Parenchymal Pulmonary Lesion Sampling
Next Article in Special Issue
Role of Diffusion-Weighted Magnetic Resonance Imaging for Characterization of Mediastinal Lymphadenopathy
Previous Article in Journal
Human Fall Detection Using 3D Multi-Stream Convolutional Neural Networks with Fusion
Previous Article in Special Issue
Radiation Exposure and Lifetime Attributable Risk of Cancer Incidence and Mortality from Low- and Standard-Dose CT Chest: Implications for COVID-19 Pneumonia Subjects
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Clinical Modelling of RVHF Using Pre-Operative Variables: A Direct and Inverse Feature Extraction Technique

1
Medical Diagnostic Imaging Department, College of Health Science, University of Sharjah, Sharjah 27272, United Arab Emirates
2
Operational Research Centre in Healthcare, Near East University, TRNC Mersin 10, Nicosia 99138, Turkey
3
Department of Cardiovascular Surgery, Near East University, TRNC Mersin 10, Nicosia 99138, Turkey
4
Statistics Department, Carlos III University of Madrid, 28903 Madrid, Spain
5
Department of Mathematics, Near East University, TRNC Mersin 10, Nicosia 99138, Turkey
6
Interdisciplinary Research Center for Membrane and Water Security, King Fahd University of Petroleum and Minerals, Dhahran 31261, Saudi Arabia
7
Department of Radiology, Brain Health Imaging Institute, Weill Cornell Medicine, New York, NY 10065, USA
8
Cardiovascular Surgery Department, Ege University, Izmir 35100, Turkey
*
Authors to whom correspondence should be addressed.
Submission received: 15 October 2022 / Revised: 23 November 2022 / Accepted: 2 December 2022 / Published: 6 December 2022
(This article belongs to the Special Issue Machine Extractable Knowledge from the Shape of Anatomical Structures)

Abstract

:
Right ventricular heart failure (RVHF) mostly occurs due to the failure of the left-side of the heart. RVHF is a serious disease that leads to swelling of the abdomen, ankles, liver, kidneys, and gastrointestinal (GI) tract. A total of 506 heart-failure subjects from the Faculty of Medicine, Cardiovascular Surgery Department, Ege University, Turkey, who suffered from a severe heart failure and are currently receiving support from a ventricular assistance device, were involved in the current study. Therefore, the current study explored the application of both the direct and inverse modelling approaches, based on the correlation analysis feature extraction performance of various pre-operative variables of the subjects, for the prediction of RVHF. The study equally employs both single and hybrid paradigms for the prediction of RVHF using different pre-operative variables. The visualized and quantitative performance of the direct and inverse modelling approach indicates the robust prediction performance of the hybrid paradigms over the single techniques in both the calibration and validation steps. Whereby, the quantitative performance of the hybrid techniques, based on the Nash–Sutcliffe coefficient (NC) metric, depicts its superiority over the single paradigms by up to 58.7%/75.5% and 80.3%/51% for the calibration/validation phases in the direct and inverse modelling approaches, respectively. Moreover, to the best knowledge of the authors, this is the first study to report the implementation of direct and inverse modelling on clinical data. The findings of the current study indicates the possibility of applying these novel hybridised paradigms for the prediction of RVHF using pre-operative variables.

1. Introduction

Right ventricular heart failure (RHVF) is generally due to problems arising from the left atrium [1]. This heart disease is a syndrome characterised by the inability of the cardiac output to match the body’s metabolic demands, stemming from structural or functional impairment of ventricular filling or ejection. There are, however, two predominant conditions where the anatomic right ventricular (RV) lies in the systemic position and the anatomic left ventricular (LV) lies in the subpulmonary position, namely d-Transposition of the great arteries (d-TGA), which is palliated with an atrial switch operation, and congenitally corrected transposition of the great arteries (cc-TGA) [2]
Most patients with symptoms and signs of heart failure have a left ventricular ejection fraction that is not markedly abnormal [3]. The recognition of the magnitude of the problem of heart failure with preserved ejection fraction in the past 20 years has spurred an explosion of clinical investigation and a growing intensity of informative outcome trials [4].
Although substantial progress has been made in understanding the various pathological conditions of RVHF, effective and accurate medical assessment of these factors, as well as the benefits of RVHF diagnosis at an earlier stage, have led to a tremendous surge in the use of detectors [5,6,7]. Furthermore, Manca et al. [8] reported that Titin (TTN)-related dilated cardiomyopathy (DCM) has a higher likelihood of left ventricular reverse remodelling compared with other genetic etiologies. Moreover, research indicates that previously there were no data regarding the evolution of right ventricular dysfunction (RVD) according to genetic background. Moreover, the findings of these results indicate that the evolution of RVD in DCM is heterogeneous in different genetic backgrounds. Whereby, TTN-related DCM is associated with a higher chance of RVD recovery compared with other genetic etiologies.
Therefore, the need for understanding this medical condition using a simple, fast, and cost-effective technique is of paramount importance. For instance, the application of artificial intelligence (AI) and machine learning (ML) in locating, analysing, interpreting, forecasting, and classifying the medical information associated with RVHF can serve as the robust integration for understanding this important health condition, especially when provided with the necessary medical information. Recently, ML and AI have been recommended as valuable methods to enhance illness prognosis, diagnosis, and prediction as well as progress management [5,9]. Several machine learning (ML) algorithms have been implemented in RVHF to improve the medical workflow and avoid the limitations of conventional methods. A recent example is the ML-based research by Jingjing et al. [10], which uses AI-assisted auscultation to detect congenital cardiac disorders at a Shanghai children’s medical center; they focused on the sensitivity, specificity, and accuracy of remote auscultation. This study found that remote auscultation could find unusual heart sounds with 98% responsiveness, 91% specificity, 97% accuracy, and a 0.87 kappa coefficient. A total of 1397 people with CHD signed up for the survey. The rest of the specimens from the 1362-patient population (mean age 2.4 3.1 years, 46% female) were evaluated.
Silvia et al. [11] reported that cardiovascular disease (CVD), despite the significant advances in its diagnosis and treatment, still represents the leading cause of morbidity and mortality worldwide. Therefore, to improve and optimise CVD outcomes, artificial intelligence techniques have the potential to radically change the way we practice cardiology, offering us novel tools to interpret data and make clinical decisions. AI techniques such as machine learning and deep learning can also improve medical knowledge due to the increase in the volume and complexity of the data, unlocking clinically relevant information.
The current study is, to our best knowledge, the first in the technical published literature that employs the applications of hybridised paradigms (ILR–ANFS, ILR–GPR, and ILR–GRNN) for the clinical prediction of RVHF using pre-operative variables and, ultimately, is based on the recent technical literature as well as a scan of the literature, as shown in Figure 1; in addition, to our best knowledge, this is the first study that reports the feasibility of applying direct and inverse modelling for clinical prediction in health- and medical-related studies.
Therefore, the current study aims at understanding the connection between pre-operative variables and RVHF by using the correlational feature extraction method based on heart failure patients’ data. Based on the correlational analysis results of the pre-operative variables and RVHF as the dependent variable, two novel approaches were developed, namely the inverse and direct approaches for modelling RVHF using single stand-alone models with improved novel hybridised paradigms.

2. Materials and Methods

2.1. Single Paradigms

2.1.1. Gaussian Process Regression (GPR)

Gaussian process regression (GPR) can be referred to as robust non-linear prediction model, which is probabilistic, nonparametric, supervised, and unsupervised learning method that generalises the non-linear and complex function mapping hidden in datasets. Recently, GPR has increasingly attracted the attention of researchers from different engineering fields [12,13]. GPR is capable of handling non-linear data due to its use of kernel functions. Moreover, one of the merits of a GPR model is that the model can provide a reliable response to input data [14].

2.1.2. Adaptive Neuro-Fuzzy Inference System (ANFIS)

When there are problems, ANFIS can figure out what they are and how to fix them. Its origin was from the feed-forward and multilayer adaptive networks. ANFIS is jointly made up of input variables and the rule of fuzzy, which has dependent and independent variables according to TSK (Takagi–Sugeno–Kan) inferences. Fuzzification and defuzzier are all contained in the database of fuzzy. By using membership function parameters, fuzzy set theory converts the information into fuzzified values.
Nodes had a vital role as member functions (MFs), which made it possible to model the correlation between the two parameters in a way that makes sense. It has a triangular, trapezoidal sigmoid and Gaussian member function [15]. Based on the theory, Equations (1) and (2) are created.
Rule   No . 1 :   if   μ x   is   A 1   and   μ y B 1 then   f 1 =   p 1 x + q 1 y + r 1
Rule   No .   2 :   if   μ x   is   A 2   and   μ y is   B 2   then   f 2 =   p 2 x + q 2 y + r 2
A 1 , B 1 , A 2 , B 2 parameters are membership functions for x and y, and inputs p 1 , q 1 , r 1 ,   p 2 , q 2 , r 2 , are output function data. The structure of ANFIS and its formulation agrees with a neural net set up with 5 tiers. More details about ANFIS are explained by Khademi et al. 2016.

2.1.3. Generalised Regression Neural Network (GRNN)

GRNN, also known as the lazy training method model, was developed by Specht [16] to behave in the manner of the regression method, by generating a relationship between the dependent manipulated variable (X) and the outcome variable (Y) with a non-linear regression estimation for a smaller group of data. The input layer is similar to that of a conventional neural network, in that its main purpose is to train the input data, and the size of the input vectors is the main determinant of the number of neurons required for training. The model training begins immediately in the pattern layer due to the Gaussian kernel’s conversion of previously input data. The smoothing parameter (σ) is used to calculate the weight of each neuron in this layer. This parameter is referred to as the “hyper-parameter of the GRNN model”, and it contributes to the GRNN model’s prediction accuracy [17]. Its general form is depicted as follows.
P i = e x p X X i T X X i 2 σ 2
where X equals the input data of the dataset to be tested, X i is the ith input of the training dataset, and σ is the smoothing parameter.

2.1.4. Interaction Linear Regression (ILR)

Generally, we have different kinds of linear regressions (LRs), including multi-linear regression (MLR), stepwise linear regression (SWLR), and interaction linear regression (ILR), as can be seen from these studies [18,19,20,21].
ILR (isometric log-ratio) is a type of regression that looks at how the relying (target) factors and one or more reaction (response) parameters are related and communicate [22]. Overall, S. I. Abba et al. [23] demonstrates that the multi-linear regression (MLR) concept is the most commonly used regression model. MLR could be read even though it has a lower prediction performance than AI-based modelling techniques [24]. Generally, LR models can be expressed as:
y = b 0 + b 1 x 1 + b 2 x 2 + b i x i
where y represents the target parameter,   x 1 equals the value of the 𝑖th predictor, b 0 denotes constant regression, and b i indicates 𝑖th predictor coefficient.

2.2. Hybrid-Based Paradigms

Various modellers have different opinions regarding the buildup of models, methodology employed in inputting data, and duration of modelling; all had a significant impact in optimal performance of the model [25,26,27,28].
The problem associated with artificial intelligence could be surmounted courtesy of evolving techniques utilised in eradicating the issues. This recent outstanding technique takes into consideration the straight line association between the input data and the predicted variable, and also there is no direct relationship between the information and variable it gives out.
These models, in artificial intelligence (AI) models such as Gaussian process regression (GPR), general regression neural networks and adaptive neuro-fuzzy inference, have been impacted positively.
According to the work of Marrero-Ponce et al. [29], ANN (artificial neural network) as well as MLR (multi-linear regression) have been excellent in making remarkable predictions; this is as a result of the combination of rectilinear and haphazard domains of these models and their applications [30,31,32,33]. The “no free lunch” theorem emphatically states that no singular model could be applied to varieties of datasets. The optimal performance of model is dependent on the kind of information the model utilises in working and data features such as measurement of linearity, size, and general wholeness, which all contribute immensely to the working principle of the model.
Several researchers have reiterated that the same information and performance index can vary once various provided variants are utilised.
Again, when data intelligence models are modified, they are wholly utilised in multiple problems.
Therefore, in this study, four single models were employed to figure out right ventricular heart failure, which are GPR (Gaussian process regression), GRNN (general regression neural network), adaptive neuro-fuzzy inference, and ILR. An algorithm named “Hybrid Data Intelligence” is then suggested. This combines both the linear ILR model and the various artificial intelligence-based prototypes (ILR, GPR, GRNN, and ANFIS) to reap the benefits of both the unique characteristics and strong points of both models, especially for predicting data trends of various types. The combination of different modelling approaches makes the whole process work efficiently. Afterwards, an ILR model learns in order to acquire the finest models with linear characteristics. The composite (hybrid) method has two categories. ILR, with learning that is associated with general best values, cannot model non-linear features of the data, thereby it returns to the sequential ILR model, which has information about non-linear dynamics and can be utilised by artificial intelligence models to represent information.
f y t = q t + r t
where q t represents the linear phase, and r t represents the non-linear phase. In order to evaluate how the two proceed, information must be utilised. Let ϵ represent the residual at time t from the linear model, therefore:
ϵ t = y t q ^ t
where q ^ t is given as forecasted time value t based on the calculated correlation, by modelling residuals using intelligence-based models. It is very vital to find non-linear connections. Using n nodes that provide input, for the residuals, the summation of artificial intelligence model will be:
ϵ t = f ϵ t 1 , ϵ t 2 , . , ϵ t n + ε t
where f is given as the function that is not linear, which was determined by the AI models that make them, GPR, L-Boost, and SVM, giving ε t as random errors. Moreover, Figure 2 depicts the methods used based on block diagram.

2.3. Grading Metrics of the Models Employed in the Current Study

The optimal performance of the model is estimated using different variables that compare estimated values to the one obtained for any category of dataset.
In this work, 2 statistical error metrics were employed, root-mean-square (RMSE) and mean-squared (MSE), plus 2 goodness-of-fit variables: Nash–Sutcliffe coefficient (NC) and Pearson coefficient (PC).
NC = 1 j = 1 N Y o b s , j Y c o m , j 2 j = 1 N Y o b s , j Y ¯ o b s , j 2
PC = i = 1 N Y o b s Y ¯ o b s Y c o m Y ¯ c o m i = 1 N Y o b s Y ¯ o b s 2 i = 1 N Y c o m Y ¯ c o m 2    
RMSE = i = 1 N Y o b s i Y c o m i 2 N
MSE = 1 N i = 1 N ( Y o b s i Y c o m i ) 2
where N, Y o b s i , Y ¯ , and Y c o m i are data quantity, data that are seen, overall average of data noted, and calculated values, respectively.

2.4. Study Description and Validation Strategy for the Models Used

This study consists of 506 heart-failure subjects from the Faculty of Medicine, Cardiovascular Surgery Department, Ege University, Turkey, who suffered from a severe heart failure and are currently receiving support from a ventricular assistance device. The patients’ group equally consists of RVHF patients, non-RVHF patients, and patients with a risk of RVHF. Whereby, the patients pre-operative features, namely mean pulmonary arterial pressure (mPAP), central venous pressure (CVP), transpulmonary gradient (tpg), preoperative alanine aminotransferase level (ALT), preoperative aspartate aminotransferase level (AST), preoperative billurubin (BUN), prothrombin time (PT time), preoperative haematocrit level (pre htc), preoperative sodium level (pre-sodium), preoperative tricuspid valve insufficiency (pre ty), extracorporeal membrane oxygenator (ECMO), preoperative mitral valve prolapse (preMV), pulmonary capillary wedge pressure (Pcw), preoperative left ventricular end systolic diameter (pre lvesd), preoperative left ventricular ejection fraction (pre lvef), preoperative mitral insufficiency (pre my), preoperative aortic valve insufficiency (pre ay), preoperative systolic pulmonary arterial pressure (pre spap), preoperative tricuspid annular plane systolic excursion (pre tapse), intraaortic balloon pump (IABP), and creatinine were recorded.
Hence, the current study employed the application of both single and hybrid paradigms using different pre-operative variables for the prediction of RVHF. In addition, prior to the modelling step, correlation-based feature extraction step was conducted in order to separate the variables based on their connections with the RVHF output variable. The direct modelling scenario involves pre-operative variables, namely mPAP, CVP, tpg, alt, ast, BUN, pre Bili, PT time, pre htc, pre-sodium, pre ty, ECMO, and preMV for the prediction of RVHF, while the inverse modelling scenario, composed of pcw, creatinine, pre INR, pre lvesd, pre lvef, pre my, pre ay, pre spap, pre tapse, and IABP, was involved in the clinical prediction of RVHF.
Furthermore, the fundamental aim of data-driven techniques is geared toward the model of a collection of datasets, with the pointer in use as a building block for a reliable prediction of unknown. Keeping in mind that several constraints such as overfitting and underfitting lead to poor training and testing results. Testing schemes include k-fold cross-validation, holdout, leave one out, and others. The 10 k-fold cross-validation procedure was used in the current study [34], for assessing and validating the datasets used. According to the k-fold cross-validation, researchers have divided the information: 75% for the calibrating (training) stage and 25% for the checking (verification) stage. Moreover, Table 1 describes the basic descriptive statistics of the study population indicating the mean (0.057312), median (0), mode (0), standard deviation (0.232668), kurtosis (12.64552), skewness (3.820412), range (1), minimum (0), maximum (0), and count (506) of the RVHF patients.
Based on Table 1, the minimum value is 0, which denotes ‘no’, meaning normal patients; the maximum value is 1, meaning ‘yes’, which indicates patients suffering from RVHF.

2.5. Model Conceptualisation

Phase 1: Data acquisition
The complete dataset was obtained from a clinical study consisting of 506 heart-failure subjects from the Faculty of Medicine, Cardiovascular Surgery Department, Ege University, Turkey, who suffered from a severe heart failure and are currently receiving support from a ventricular assistance device. Furthermore, the data points were divided into 75% for the calibration stage and 25% for the testing stage. The data were subsequently validated to check and control potential modelling problems such as overfitting and underfitting.
Phase 2: Simulation using single models
The stand-alone paradigms (GPR, GRNN, and ANFIS) together with the traditional linear regression ILR were all conducted using MATLAB 9.3 (R2020a).
Phase 3: Hybrid data-intelligence
The data-intelligence techniques involve combining the properties of the classical linear ILR technique with AI-based techniques in order to improve the performance of the single models. Hence, ILR, GPR, GRNN, and ANFIS are developed.
Furthermore, the major essence of developing these techniques is to understand the behaviour of different clinical data (consisting of the input and output variables) to assign some weight that can be used in predicting the target. For instance, in the current study, the pre-operative variables are used in determining whether a patient will suffer from RVHF or not. Moreover, developing mobile application is very possible through this approach, in order to assist clinicians, patients, and policy makers in understanding the behaviour and risk factors of RVHF.

3. Results

Recently, computational techniques have been established in the medical field for the prediction of various diseases, prevalence, and disorders. AI-based techniques and machine learning are the major dominant techniques over classical regressions, owing to their robustness in handling highly chaotic datasets. Therefore, understanding a dataset prior to the employment of AI techniques for prediction is of paramount importance. Different feature-extraction techniques such as mutual understanding, sensitivity analysis, and correlational analysis feature the extraction method. Therefore, an exploratory method based on correlation analysis is utilised in Table 2 in order to determine the input–output relation of the variables used in the current study.
An exploratory technique informed of correlation analysis is employed in Table 2 to elucidate the relation between the variables used in the current study.

Performance of the Single and Hybrid Paradigms for Modelling RVHF Using Pre-Operative Variables

The currect section demonstrates both the quantitative and visualised performance of both the single and hybrid paradigms for the direct and inverse prediction of RVHF using various pre-operative variables.

4. Discussion

The correlational analysis shown in Table 2 demonstrates both the direct and inverse relation for the input and the corresponding RVHF variable. Based on Table 2a, all the independent variables showed a weak connection with the output variable (RVHF), whereby pre sodium and PT time showed a superior relation than the others, with R-values equal to 0.16 and 0.15, respectively. Moreover, CVP showed a 0 correlational value with RVHF; this is due to the fact that most of the patients are suffering from left ventricular heart failure but not RVHF. Furthermore, Table 2b equally indicates that all the input variables showed a weak negative correlation with RVHF as the corresponding output variable. Based on the inverse results, pre my with an R-value equal to −0.24 with RVHF showed the highest relation with the output variable, in which pre spap showed a 0 relation with the corresponding RVHF output variable.
Based on the correlational analysis performance shown in Table 2, four different single-paradigm (GPR, GRNN, ANFIS, and ILR) models integrated with three different novel hybrid paradigms (ILR–GPR, ILR–GRNN, and ILR–ANFIS) were used in predicting RVHF using various pre-operative variables in two different scenarios (direct and inverse approaches).
Table 3 demonstrates the performance of the direct modelling scenario approach for the prediction of RVHF. The performance metrics, PC, DC, RMSE, and MSE, used in the current scenario indicate the superiority of the hybrid techniques over the single approaches for the clinical modelling of RVHF in both the calibration and validation stages. Various modellers reported that for any prediction model or computation paradigm to be accepted, it should show a minimum DC-value of 0.8. Therefore, all four single models (ILR, ANFIS, GRNN, and GPR) failed to fulfill this criterion, as they failed to meet this requirement in both the calibration and validation phases. In addition, ILR–GPR and ILR–ANFIS, with DC-values equal to 0.862 and 0.813, respectively, in the calibration phase fulfilled the minimum requirement. Whereby, ILR–ANFIS (0.696) failed and ILR–GPR (0.861) succeeded in the validation step. The weak performance of the paradigms can be attributed to the low correlation between the pre-operative variables and RVHF as the corresponding output variable, as shown in Table 2a. In general, for both the single and hybrid paradigms used in the direct modelling of RVHF based on the pre-operative variables, only the ILR–GPR hybrid technique was able to capture the highly non-linear and chaotic nature of the dataset.
Hence, the performance of the results obtained in the current study is in line with studies reported by Konstantinos et al. [35] regarding the current and future state of the AI-enhanced electrocardiogram in detecting heart disease for patients in at-risk population densities. Their technique has made it possible for the electrocardiogram to be interpreted quickly.
Moreover, Muni et al. used AI techniques to present the importance of passive and semi-sensors and unique approaches analysing heart failure [36]. More studies related with the implementation of ML and AI on cardiovascular diseases require elucidation.
The performance of both the single and hybrid techniques can be comparatively visualised using various graphical illustrations. For instance, the MSE- and RMSE-values are used to indicate the error performance of each model. The error performance of both the single and hybrid techniques developed using the direct modelling approach can be graphically compared in both the calibration and validation steps using column and bar charts (see Figure 3).
Moreover, the fitness of the direct modelling can be compared based on performances against the clinical RVHF values, which can be visualised using a time series plot (see Figure 4).
Furthermore, Table 4 indicates the performance of inverse modelling for the prediction of RVHF in both the calibration and validation steps. Based on the quantitative performance of the single and hybrid techniques, it can be seen that all four single models, GPR, GRNN, ANFIS, and ILR, failed to predict RVHF as the dependent variable. Whereas, for the hybrid-based inverse modelling, ILR–GPR and ILR–ANFIS were able to predict the behaviour and properties of the complex RVHF dataset. Moreover, comparative analysis of all the inverse-based techniques indicated that ILR–ANFIS outperformed all six other techniques in both the training and validation stages. Furthermore, based on the DC-metrics performance of the ILR–ANFIS technique, its ability in improving the performance prediction of the single paradigms increased up to 81% and 51% in the calibration and validation stages, respectively. Hence, the comparative performance of the techniques can be graphically compared, based on the performance-error in terms of RMSE and MSE (see Figure 5).
Moreover, the metrics PC and DC indicate the performance fitness between the predicted and experimental values. Therefore, the response plot information of the time series can be used to compare the performance of the hybrid techniques for the simulation of RVHF (see Figure 6).
Hence, the novelty of the current work can be shown in different ways: (1) This is the first study that reports the combined application of GPR, GRNN, and ANFIS AI-based techniques for the clinical prediction of RVHF using pre-operative variables. (2) It is, equally, the first study that employs the application of the ILR regression method for the clinical modelling of RVHF; in fact, this is the first study that reports the implementation of this model in any clinical/health-related study. Ultimately, to the best knowledge of the authors, based on the recent technical literature as well as a scan of the literature, as shown in Figure 1, this is the first study that reports the feasibility of applying direct and inverse modelling for clinical prediction in health- and medical-related studies. Moreover, the quantitative performance of the hybrid technique based on the Nash–Sutcliffe coefficient (NC) metric depicts its superiority over the single paradigms by up to 58.7%/75.5% and 80.3%/51% for the calibration/validation phases in the direct and inverse modelling approaches, respectively. However, one of the major limitations of the current study is the employment of a two-step technique: the single and hybrid approaches; whereby, the single approach was unable to capture the RVHF datasets owing to its complexity and chaotic nature.

5. Conclusions

Medical informatics deals with improving the management of clinical knowledge, patient data, population data, and information related to patient care. This emerging technique is regarded as a promising tool that helps policy and decision makers in making critical decisions related to patients’ care. Therefore, the current study explores the application of both direct and inverse modelling using AI-based techniques and hybrid-based paradigms for the prediction of RVHF; whereby, the hybrid techniques depict a higher performance compared with the single paradigms. Hence, the results of the current research recommend the application of various metaheuristic and computational approaches for improving the prediction ability of RVHF using various pre-operative variables. Furthermore, future work on different ways of identifying the complex behaviour of the data through the non-linear feature extraction technique, feature scaling, the normalisation of data, and standardisation are equally recommended.

Author Contributions

Conceptualization, A.G.U., O.B. and D.U.O.; methodology, A.G.U., D.I.E., S.I.A., D.U.O. and O.B.; data curation, A.G.U. and O.B.; writing—original draft preparation, A.G.U., O.B., D.I.E., S.I.A. and D.U.O.; writing—review and editing, I.O., B.U., T.Y. and C.E.; visualization, A.G.U. and S.I.A.; supervision O.B. and D.U.O.; project administration, O.B. and D.U.O.; funding acquisition, O.B. and D.U.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Medical Research Ethics Committee of Ege University (date/code: 2021/9T-20).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data will be provided on requests.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sztrymf, B.; Vuillard, C.; Boucly, A.; Artaud-Macari, E.; Sattler, C.; Amar, D.; Jaïs, X.; Sitbon, O.; Humbert, M.; Savale, L.; et al. Right heart failure. ERS Monogr. 2022, 2016, 32–47. [Google Scholar] [CrossRef]
  2. Andrade, L.; Carazo, M.; Wu, F.; Kim, Y.; Wilson, W. Mechanisms for heart failure in systemic right ventricle. Heart Fail. Rev. 2019, 25, 599–607. [Google Scholar] [CrossRef] [PubMed]
  3. Wang, J.M.; Rai, R.; Carrasco, M.; Sam-Odusina, T.; Salandy, S.; Gielecki, J.; Zurada, A.; Loukas, M. An anatomical review of the right ventricle. Transl. Res. Anat. 2020, 17, 100049. [Google Scholar] [CrossRef]
  4. Pfeffer, M.A.; Shah, A.M.; Borlaug, B. Heart Failure With Preserved Ejection Fraction In Perspective. Circ. Res. 2019, 124, 1598–1617. [Google Scholar] [CrossRef] [PubMed]
  5. Grünig, E.; Eichstaedt, C.A.; Seeger, R.; Benjamin, N. Right Heart Size and Right Ventricular Reserve in Pulmonary Hypertension: Impact on Management and Prognosis. Diagnostics 2020, 10, 1110. [Google Scholar] [CrossRef] [PubMed]
  6. Ji, M.; Wu, W.; He, L.; Gao, L.; Zhang, Y.; Lin, Y.; Qian, M.; Wang, J.; Zhang, L.; Xie, M.; et al. Right Ventricular Longitudinal Strain in Patients with Heart Failure. Diagnostics 2022, 12, 445. [Google Scholar] [CrossRef]
  7. Zhang, X.; Ruan, B.; Qiao, Z.; Yang, M.; Zhuang, Q.; Wang, J.; Wang, W.; Zheng, Y.; Zhao, H.; Shen, X.; et al. The Balance between the Left and Right Ventricular Deformation Evaluated by Speckle Tracking Echocardiography Is a Great Predictor of the Major Adverse Cardiac Event in Patients with Pulmonary Hypertension. Diagnostics 2022, 12, 2266. [Google Scholar] [CrossRef] [PubMed]
  8. Manca, P.; Cannatà, A.; Nuzzi, V.; Bromage, D.I.; Varrà, G.G.; Rossi, M.; Ferro, M.D.; Paldino, A.; Gigli, M.; Barbati, G.; et al. Prevalence and Evolution of Right Ventricular Dysfunction Among Different Genetic Backgrounds in Dilated Cardiomyopathy. Can. J. Cardiol. 2021, 37, 1743–1750. [Google Scholar] [CrossRef]
  9. Patil, S.; Albogami, S.; Hosmani, J.; Mujoo, S.; Kamil, M.A.; Mansour, M.A.; Abdul, H.N.; Bhandi, S.; Ahmed, S.S.S.J. Artificial Intelligence in the Diagnosis of Oral Diseases: Applications and Pitfalls. Diagnostics 2022, 12, 1029. [Google Scholar] [CrossRef]
  10. Lv, J.; Bin Dong, B.; Lei, H.; Shi, G.; Wang, H.; Zhu, F.; Wen, C.; Zhang, Q.; Fu, L.; Gu, X.; et al. Artificial intelligence-assisted auscultation in detecting congenital heart disease. Eur. Heart J.-Digit. Health 2021, 2, 119–124. [Google Scholar] [CrossRef]
  11. Romiti, S.; Vinciguerra, M.; Saade, W.; Cortajarena, I.A.; Greco, E. Artificial Intelligence (AI) and Cardiovascular Diseases: An Unexpected Alliance. Cardiol. Res. Pract. 2020, 2020, 4972346. [Google Scholar] [CrossRef]
  12. Omran, B.A.; Chen, Q.; Jin, R. Comparison of Data Mining Techniques for Predicting Compressive Strength of Environmentally Friendly Concrete. J. Comput. Civ. Eng. 2016, 30, 4016029. [Google Scholar] [CrossRef] [Green Version]
  13. Cheng, M.-Y.; Huang, C.-C.; Van Roy, A.F. Predicting project success in construction using an evolutionary gaussian process inference model. J. Civ. Eng. Manag. 2013, 19, S202–S211. [Google Scholar] [CrossRef] [Green Version]
  14. Pal, M.; Deswal, S. Modelling pile capacity using Gaussian process regression. Comput. Geotech. 2010, 37, 942–947. [Google Scholar] [CrossRef]
  15. Elkiran, G.; Nourani, V.; Abba, S. Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach. J. Hydrol. 2019, 577, 123962. [Google Scholar] [CrossRef]
  16. Wiangkham, A.; Ariyarit, A.; Aengchuan, P. Prediction of the influence of loading rate and sugarcane leaves concentration on fracture toughness of sugarcane leaves and epoxy composite using artificial intelligence. Theor. Appl. Fract. Mech. 2022, 117, 103188. [Google Scholar] [CrossRef]
  17. Ardejanii, F.D.; Rooki, R.; Shokri, B.J.; Kish, T.E.; Aryafar, A.; Tourani, P. Prediction of Rare Earth Elements in Neutral Alkaline Mine Drainage from Razi Coal Mine, Golestan Province, Northeast Iran, Using General Regression Neural Network. J. Environ. Eng. 2013, 139, 896–907. [Google Scholar] [CrossRef]
  18. Usman, A.G.; Işik, S.; Abba, S.I. Hybrid data-intelligence algorithms for the simulation of thymoquinone in HPLC method development. J. Iran. Chem. Soc. 2021, 18, 1537–1549. [Google Scholar] [CrossRef]
  19. Abba, S.I.; Abdulkadir, R.A.; Sammen, S.S.; Usman, A.G.; Meshram, S.G.; Malik, A.; Shahid, S. Comparative implementation between neuro-emotional genetic algorithm and novel ensemble computing techniques for modelling dissolved oxygen concentration. Hydrol. Sci. J. 2021, 66, 1584–1596. [Google Scholar] [CrossRef]
  20. Ghali, U.M.; Usman, A.G.; Chellube, Z.M.; Degm, M.A.A.; Hoti, K.; Umar, H.; Abba, S.I. Advanced chromatographic technique for performance simulation of anti-Alzheimer agent: An ensemble machine learning approach. SN Appl. Sci. 2020, 2, 1–12. [Google Scholar] [CrossRef]
  21. Usman, A.G.; Işik, S.; Abba, S.I.; Meriçli, F. Chemometrics-based models hyphenated with ensemble machine learning for retention time simulation of isoquercitrin in Coriander sativum L. using high-performance liquid chromatography. J. Sep. Sci. 2021, 44, 843–849. [Google Scholar] [CrossRef] [PubMed]
  22. Asadisaghandi, J.; Tahmasebi, P. Comparative evaluation of back-propagation neural network learning algorithms and empirical correlations for prediction of oil PVT properties in Iran oilfields. J. Pet. Sci. Eng. 2011, 78, 464–475. [Google Scholar] [CrossRef]
  23. Abba, S.; Usman, A.; Işik, S. Simulation for response surface in the HPLC optimization method development using artificial intelligence models: A data-driven approach. Chemom. Intell. Lab. Syst. 2020, 201, 104007. [Google Scholar] [CrossRef]
  24. Nemati, S.; Fazelifard, M.H.; Terzi, Ö.; Ghorbani, M.A. Estimation of dissolved oxygen using data-driven techniques in the Tai Po River, Hong Kong. Environ. Earth Sci. 2015, 74, 4065–4073. [Google Scholar] [CrossRef]
  25. Garba Usman, A.; Alhosen, M.; Alsharksi, A.; Muhammed Naibi, A. Applications of Artificial Intelligence-Based Models and Multi-Linear Regression for the Prediction of Thyroid Stimulating Hormone Level in the Human Body Artificial Intelligent Techniques for Stream Flow Modeling View Project Global Platform to Showcase Your Research View project. Available online: https://www.researchgate.net/publication/342571024 (accessed on 3 September 2022).
  26. Usman, A.G.; Işik, S.; Abba, S.I.; Meriçli, F. Artificial intelligence–based models for the qualitative and quantitative prediction of a phytochemical compound using HPLC method. Turk. J. Chem. 2020, 44, 1339–1351. [Google Scholar] [CrossRef]
  27. Modeling of Water Treatment Plant Performance Using Artificial Neural Network: Case Study Tamburawa Kano-Nigeria. Available online: https://www.researchgate.net/publication/344380629_Modeling_of_Water_Treatment_Plant_Performance_using_Artificial_Neural_Network_Case_Study_Tamburawa_Kano-Nigeria (accessed on 6 October 2022).
  28. Gaya, M.S.; Abba, S.I.; Abdu, A.M.; Tukur, A.I.; Saleh, M.A.; Esmaili, P.; Wahab, N.A. Estimation of water quality index using artificial intelligence approaches and multi-linear regression. IAES Int. J. Artif. Intell. (IJ-AI) 2020, 9, 126–134. [Google Scholar] [CrossRef] [Green Version]
  29. Marrero-Ponce, Y.; Barigye, S.J.; Jorge-Rodríguez, M.E.; Tran-Thi-Thu, T. QSRR prediction of gas chromatography retention indices of essential oil components. Chem. Pap. 2018, 72, 57–69. [Google Scholar] [CrossRef]
  30. Malami, S.I.; Musa, A.A.; Haruna, S.I.; Aliyu, U.U.; Usman, A.G.; Abdurrahman, M.I.; Bashir, A.; Abba, S.I. Implementation of soft-computing models for prediction of flexural strength of pervious concrete hybridized with rice husk ash and calcium carbide waste. Model. Earth Syst. Environ. 2021, 8, 1933–1947. [Google Scholar] [CrossRef]
  31. Alamrouni, A.; Aslanova, F.; Mati, S.; Maccido, H.S.; Jibril, A.A.; Usman, A.G.; Abba, S.I. Multi-Regional Modeling of Cumulative COVID-19 Cases Integrated with Environmental Forest Knowledge Estimation: A Deep Learning Ensemble Approach. Int. J. Environ. Res. Public Health 2022, 19, 738. [Google Scholar] [CrossRef]
  32. Khalid, G.M.; Usman, A.G. Application of data-intelligence algorithms for modeling the compaction performance of new pharmaceutical excipients. Future J. Pharm. Sci. 2021, 7, 1–11. [Google Scholar] [CrossRef]
  33. Haruna, S.I.; Malami, S.I.; Adamu, M.; Usman, A.G.; Farouk, A.; Ali, S.I.A.; Abba, S.I. Compressive Strength of Self-Compacting Concrete Modified with Rice Husk Ash and Calcium Carbide Waste Modeling: A Feasibility of Emerging Emotional Intelligent Model (EANN) Versus Traditional FFNN. Arab. J. Sci. Eng. 2021, 46, 11207–11222. [Google Scholar] [CrossRef]
  34. Usman, A.G.; Işik, S.; Abba, S.I. A Novel Multi-model Data-Driven Ensemble Technique for the Prediction of Retention Factor in HPLC Method Development. Chromatographia 2020, 83, 933–945. [Google Scholar] [CrossRef]
  35. Siontis, K.C.; Noseworthy, P.A.; Attia, Z.I.; Friedman, P.A. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management. Nat. Rev. Cardiol. 2021, 18, 465–478. [Google Scholar] [CrossRef] [PubMed]
  36. Maurya, M.R.; Riyaz, N.U.S.S.; Reddy, M.S.B.; Yalcin, H.C.; Ouakad, H.M.; Bahadur, I.; Al-Maadeed, S.; Sadasivuni, K.K. A review of smart sensors coupled with Internet of Things and Artificial Intelligence approach for heart failure monitoring. Med. Biol. Eng. Comput. 2021, 59, 2185–2203. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Biblographic literature scan from 1980–2022 on keywords, titles, and abstracts related to implementation of machine learning on RVHF data (source: Scopus 2022).
Figure 1. Biblographic literature scan from 1980–2022 on keywords, titles, and abstracts related to implementation of machine learning on RVHF data (source: Scopus 2022).
Diagnostics 12 03061 g001
Figure 2. Block diagram depicting the methods used in the current study.
Figure 2. Block diagram depicting the methods used in the current study.
Diagnostics 12 03061 g002
Figure 3. Error-depicted interims of (a) MSE- and (b) RMSE-values for the direct modelling of both the single and hybrid paradigms.
Figure 3. Error-depicted interims of (a) MSE- and (b) RMSE-values for the direct modelling of both the single and hybrid paradigms.
Diagnostics 12 03061 g003
Figure 4. Response plot for the direct modelling approach of right ventricular heart failure for both the single and hybrid approaches.
Figure 4. Response plot for the direct modelling approach of right ventricular heart failure for both the single and hybrid approaches.
Diagnostics 12 03061 g004
Figure 5. Error-depicted interims of (a) MSE- and (b) RMSE-values for the direct modelling of both the single and hybrid paradigms.
Figure 5. Error-depicted interims of (a) MSE- and (b) RMSE-values for the direct modelling of both the single and hybrid paradigms.
Diagnostics 12 03061 g005
Figure 6. Response plot for the inverse modelling approach of right ventricular heart failure for both the single and hybrid approaches.
Figure 6. Response plot for the inverse modelling approach of right ventricular heart failure for both the single and hybrid approaches.
Diagnostics 12 03061 g006
Table 1. Descriptive analysis of the study population.
Table 1. Descriptive analysis of the study population.
OperationRVHF
Mean0.057312
Median0
Mode0
Standard Deviation0.232668
Kurtosis12.64552
Skewness3.820412
Range1
Minimum0
Maximum1
Count506
Table 2. (a) Direct and (b) inverse correlation analysis.
Table 2. (a) Direct and (b) inverse correlation analysis.
(a) Direct
VariablesmPAPCVPtpgaltastBUNpre BiliPT timepre htcpre sodiumpre tyECMOpreMVRVHF
mPAP1
CVP0.421.00
tpg0.510.481.00
alt−0.05−0.04−0.081.00
ast0.010.030.060.411.00
BUN−0.05−0.030.030.100.201.00
pre Bilirubin0.080.320.16−0.030.250.071.00
PT time0.060.020.090.110.000.070.041.00
pre htc0.03−0.040.100.03−0.070.05−0.120.171.00
pre sodium−0.12−0.09−0.050.09−0.010.05−0.210.200.211.00
pre ty0.060.190.13−0.020.210.07 0.560.06−0.07−0.091.00
ECMO−0.07−0.060.000.040.480.140.34−0.05−0.080.020.201.00
preMV−0.030.14−0.130.060.07−0.080.21−0.02−0.130.000.110.031.00
RVHF0.020.000.050.030.070.070.010.150.010.160.020.100.011
(b) Inverse
Variablespcwcreatininpre INRpre lvesdpre lvefpre mypre aypre spappre tapseIABPRVHF
pcw1
creatinin0.141.00
pre INR0.160.601.00
pre lvesd0.110.380.791.00
pre lvef0.070.09−0.16−0.331.00
pre my0.050.080.00−0.110.211.00
pre ay0.120.300.580.74−0.28−0.051.00
pre spap0.16−0.17−0.27−0.280.200.08−0.201.00
pre tapse0.01−0.16−0.28−0.320.240.05−0.220.251.00
IABP0.100.310.660.83−0.37−0.100.80−0.28−0.271.00
RVHF−0.03−0.04−0.04−0.03−0.03−0.24−0.020.00−0.01−0.031.00
Table 3. Performance of the direct modelling scenario approach.
Table 3. Performance of the direct modelling scenario approach.
Calibration
ModelsDCPCMSERMSE
GPR0.4190.6470.0020.050
GRNN0.4850.6970.0020.047
ANFIS0.4160.6450.0030.050
ILR0.2750.5250.0030.056
ILR–GPR0.8620.9280.0010.029
ILR–GRNN0.7770.8820.0030.041
ILR–ANFIS0.8130.9020.0010.039
Validation
GPR0.2010.4490.0580.241
GRNN0.1060.3250.0650.255
ANFIS0.1600.4000.0610.247
ILR0.3350.5780.0480.220
ILR–GPR0.8610.9280.0010.024
ILR–GRNN0.5590.7480.0020.044
ILR–ANFIS0.6960.8340.0010.036
Table 4. Performance of the inverse modelling scenario approach.
Table 4. Performance of the inverse modelling scenario approach.
Calibration
DCPCMSERMSE
GPR0.1030.3210.0650.255
GRNN0.6730.8200.0240.154
ANFIS0.0630.2510.0680.261
ILR0.1790.4230.0600.244
ILR–GPR0.8980.9470.0070.086
ILR–GRNN0.7090.8420.0210.146
ILR–ANFIS0.9060.9520.0070.082
Validation
GPR0.4190.6470.0020.050
GRNN0.6730.8200.0010.037
ANFIS0.5370.7330.0020.045
ILR0.4720.6870.0020.048
ILR–GPR0.8020.8960.0010.029
ILR–GRNN0.4370.6610.0020.049
ILR–ANFIS0.9290.9640.0000.017
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Uzun Ozsahin, D.; Balcioglu, O.; Usman, A.G.; Ikechukwu Emegano, D.; Uzun, B.; Abba, S.I.; Ozsahin, I.; Yagdi, T.; Engin, C. Clinical Modelling of RVHF Using Pre-Operative Variables: A Direct and Inverse Feature Extraction Technique. Diagnostics 2022, 12, 3061. https://0-doi-org.brum.beds.ac.uk/10.3390/diagnostics12123061

AMA Style

Uzun Ozsahin D, Balcioglu O, Usman AG, Ikechukwu Emegano D, Uzun B, Abba SI, Ozsahin I, Yagdi T, Engin C. Clinical Modelling of RVHF Using Pre-Operative Variables: A Direct and Inverse Feature Extraction Technique. Diagnostics. 2022; 12(12):3061. https://0-doi-org.brum.beds.ac.uk/10.3390/diagnostics12123061

Chicago/Turabian Style

Uzun Ozsahin, Dilber, Ozlem Balcioglu, Abdullahi Garba Usman, Declan Ikechukwu Emegano, Berna Uzun, Sani Isah Abba, Ilker Ozsahin, Tahir Yagdi, and Cagatay Engin. 2022. "Clinical Modelling of RVHF Using Pre-Operative Variables: A Direct and Inverse Feature Extraction Technique" Diagnostics 12, no. 12: 3061. https://0-doi-org.brum.beds.ac.uk/10.3390/diagnostics12123061

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop