Estimating the Dissolution of Anticancer Drugs in Supercritical Carbon Dioxide with a Stacked Machine Learning Model

Najmi, Maryam; Ayari, Mohamed Arselene; Sadeghsalehi, Hamidreza; Vaferi, Behzad; Khandakar, Amith; Chowdhury, Muhammad E. H.; Rahman, Tawsifur; Jawhar, Zanko Hassan

doi:10.3390/pharmaceutics14081632

Open AccessArticle

Estimating the Dissolution of Anticancer Drugs in Supercritical Carbon Dioxide with a Stacked Machine Learning Model

¹

Faculty of Industrial Engineering, South Tehran Branch, Islamic Azad University, Tehran 1584715414, Iran

²

Department of Civil and Architectural Engineering, Qatar University, Doha 2713, Qatar

³

Technology Innovation and Engineering Education Unit, Qatar University, Doha 2713, Qatar

⁴

Department of Neuroscience, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran

⁵

Department of Chemical Engineering, Shiraz Branch, Islamic Azad University, Shiraz 7198774731, Iran

⁶

Department of Electrical Engineering, Qatar University, Doha 2713, Qatar

⁷

Department of Medical Laboratory Science, College of Health Science, Lebanese French University, Kurdistan Region 44001, Iraq

^*

Author to whom correspondence should be addressed.

Pharmaceutics 2022, 14(8), 1632; https://0-doi-org.brum.beds.ac.uk/10.3390/pharmaceutics14081632

Submission received: 25 June 2022 / Revised: 25 July 2022 / Accepted: 30 July 2022 / Published: 5 August 2022

(This article belongs to the Special Issue Recent Progress in Formulation Approaches for Improving the Solubility and Bioavailability of Poorly Soluble Drugs)

Download

Browse Figures

Versions Notes

Abstract

:

Synthesizing micro-/nano-sized pharmaceutical compounds with an appropriate size distribution is a method often followed to enhance drug delivery and reduce side effects. Supercritical CO₂ (carbon dioxide) is a well-known solvent utilized in the pharmaceutical synthesis process. Reliable knowledge of a drug’s solubility in supercritical CO₂ is necessary for feasible study, modeling, design, optimization, and control of such a process. Therefore, the current study constructs a stacked/ensemble model by combining three up-to-date machine learning tools (i.e., extra tree, gradient boosting, and random forest) to predict the solubility of twelve anticancer drugs in supercritical CO₂. An experimental databank comprising 311 phase equilibrium samples was gathered from the literature and applied to design the proposed stacked model. This model estimates the solubility of anticancer drugs in supercritical CO₂ as a function of solute and solvent properties and operating conditions. Several statistical indices, including average absolute relative deviation (AARD = 8.62%), mean absolute error (MAE = 2.86 × 10⁻⁶), relative absolute error (RAE = 2.42%), mean squared error (MSE = 1.26 × 10⁻¹⁰), and regression coefficient (R² = 0.99809) were used to validate the performance of the constructed model. The statistical, sensitivity, and trend analyses confirmed that the suggested stacked model demonstrates excellent performance for correlating and predicting the solubility of anticancer drugs in supercritical CO₂.

Keywords:

artificial intelligence technique; ensemble model; anticancer solid drugs; solubility; supercritical CO₂

1. Introduction

The low solubility of solid pharmaceutical substances in the aqueous-based media of the human body is often resolved by utilizing a higher dosage of drugs [1,2]. This increase in the dosage usually increases the cost of pharmacological treatment [3], decreases the drug’s therapeutic efficiency, and produces several side effects [2,4]. To overcome these critical limitations/drawbacks, synthesizing either micro- or nano-sized pharmaceutical substances with a uniform size distribution has been suggested by researchers [2,5]. Some researchers also used the ionically crosslinked complex [6] and self-indicating cellulose-based [7] gels to improver drug delivery. Therefore, a practical process must be established to synthesize pharmaceutical substances with these morphological characteristics.

Supercritical CO_2, which is a well-known solvent in the chemical [8], petroleum [9], polymer [10], energy [11], and food [12] industries, has also been successfully engaged in medical [13] and biomedical [14] engineering. The solubilities of stomach statin [15], malaria [16], Coronavirus [16], anti-inflammatory [17], antifungal [18], anti-hypertension [19], anticonvulsant [5,20], antibiotic [21], anti-prostatic tumor [5], antidiabetic [22], antiepileptic [23], and anti-cancer [24] drugs in supercritical CO₂ have been experimentally measured/analyzed. These experimental investigations have often monitored the effect of temperature and pressure on the drug dissolution in supercritical CO₂. Cancer is among the most deadly diseases known to humanity [25,26], but the solubility of anticancer drugs in supercritical CO₂ is often low and ranges from 10⁻⁷ to 10⁻³ mole fraction. The literature has stated that measuring this low-scale property is expensive, time-consuming, and difficult [27].

Therefore, some researchers have utilized equations of state to simulate solid drug solubility in supercritical CO₂ [28,29]. This technique relies on the solid drugs’ physio-chemical and critical properties to calculate their solubility in supercritical CO₂ [2]. Unfortunately, equations of state not only require complex mathematical operations, but the required drug’s characteristics are also often unavailable [2].

Several empirical/semiempirical correlations have also been recommended to estimate the solubility of solid drugs in supercritical CO₂ [30,31,32,33]. Although these correlations are easy to use and only require temperature, pressure, and solvent density to calculate the drug solubility value, they are often applicable for a specific system under predefined operating conditions [13]. Therefore, these methods cannot be applied to monitor the solubility of drugs in supercritical CO₂ in a wide range of domains [13].

Recently, artificial intelligence models have been considered to estimate drug dissolution in supercritical CO₂ as a function of operating conditions and solvent property [16,27,34,35]. The quantitative structure–property relationships [27], artificial neural networks [36,37], adaptive neuro-fuzzy inference systems [13,34], and support vector machines [37,38] have been applied to predict both drug and drug-like substances in supercritical CO₂.

The stacked/ensemble models that are often constructed by systematically combining several previously designed machine learning (ML) models have found great popularity in different fields of science and technology [39,40,41]. This up-to-date modeling scenario has not previously been utilized to estimate anticancer drug solubility in supercritical CO₂. Therefore, the current research combines the extra tree (ET), gradient boosting (GB), and random forest (RF) models to construct a reliable stacked model for estimating anticancer drug solubility in supercritical CO₂. The suggested stacked approach can monitor the solubility of twelve anticancer drugs in supercritical carbon dioxide in a broad range of operating conditions. We can claim that the proposed model in this study is straightforward, easy to use, generalized, and has no applicable range limitation. Moreover, such an approach is essential for designing pharmaceutical processes using supercritical CO₂.

2. Anticancer Drugs’ Solubility in Supercritical Carbon Dioxide

Laboratory investigations, empirical or semiempirical correlations, and machine learning models are often employed to measure or estimate the solubility of a specific solid drug in supercritical CO₂ versus equilibrium pressure/temperature and solvent density. Since this study aims to design a single model for simultaneously estimating the solubility of twelve anticancer drugs in supercritical CO₂, it is also necessary to include the solute property in the model development phase. Therefore, the machine learning models have been applied to extract the relationship defined by Equation (1).

y_{d r u g}^{c a l} = M L (M_{d r u g}, T_{e q}, P_{e q}, ρ_{C O_{2}})

(1)

It can be said that the machine learning methods are responsible for deducing the inherent relationship between the solubility (

y_{d r u g}^{c a l}

) and drug molecular weight (

M_{d r u g}

), equilibrium temperature (

T_{e q}

) and pressure (

P_{e q}

), and solvent density (

ρ_{C O_{2}}

).

Table 1 introduces the experimental data gathered from the literature to develop the machine learning models. Moreover, Table 2 reports the molecular weight of the considered anticancer drugs and their chemical structure.

It should be mentioned that each row of Table 1 is actually a summary of multiple data instances, and that pressure, temperature, and CO₂ density are input features. Moreover, an additional input feature is the molecular weight of drugs from Table 2, and the target feature to predict is anti-cancer drug solubility in supercritical CO₂.

3. Methods

This study first develops three different ML regressors (i.e., extra tree, gradient boosting, and random forest) to monitor the equilibrium behavior of anticancer drug– supercritical CO₂ systems. It then develops a stacked model using the three previously developed models as a base learner and linear regression as a meta learner.

3.1. Extra Tree

Geurts et al. [49] originally derived the extra tree regression (ETR) approach from the random forest (RF) algorithm [50]. According to the conventional top-down technique, the ETR develops a group of unpruned decisions (or regression trees) [49]. The ETR and RF models have two main differences. First, ETR utilizes whole cutting points and divides nodes by the random choice among these points. Second, it cultivates the trees utilizing the whole-learning samples to reduce bias as much as possible [49]. ETR controls the splitting process helping two parameters, namely,

k

and

n_{m i n}

. The former is the number of randomly chosen features in the node, and the latter represents the minimum sample size expected to separate nodes. In addition,

k

and

n_{m i n}

determine the strength of both the selection of attributes and the average output noise, respectively. These parameters have a key role in improving the ETR accuracy and decreasing the possibility of overfitting [50].

3.2. Gradient Boosting

The gradient boost model (GB) is an ensemble regressor used to enhance accuracy of function approximation, according to the boosting process [51]. This scenario gradually reduces observed error by sequentially combining several weak learners. This study employs the decision tree as a weak learner. Although the performance of GB-based models depends on the loss function, the logarithm of loss function is often applied to handle regression problems. Furthermore, adaptive components and weak learners are the key parameters of GB-based models. If a gradient boosting model has 300 n estimators, it means that 300 decision trees (weak learners) have been coupled under the boosting process, and each tree is limited to 300 max depth.

3.3. Random Forest

To perform a regression problem by the random forest (RF) method, the bootstrapping and bagging stages should be followed. The first stage generates a group of decision trees by the growth of each distinct tree that uses a random training dataset sample. The second stage breaks down the decision tree nodes after achieving the ensemble, where several random subdivisions of training samples are chosen during the initial bagging process. The decision-making is performed by choosing the best subdivision and its value [52]. In summary, the RF model can be viewed as a group of decision trees, in which

G (x, θ_{r})

is the Gth predicting tree and

θ

shows a uniform independent distribution vector assigned before the tree growth [53]. The Breiman equation (i.e., Equation (2)) is used to construct the forest (i.e., an ensemble of trees) by combining and averaging the whole trees [53].

G (x, θ_{1}, \dots \dots θ_{r}) = \frac{1}{R} \sum_{r = 1}^{R} G (x, θ_{r})

(2)

3.4. Stacked Model

The study proposed a stacking-based approach and compared the performance with conventional ML regressors. This approach consists of a two-step learner such as base- learner and meta-learner. The three best-performing ML regression models were selected as base-learner models in the stacking model and linear regression was used as a meta- learner model (

M_{f})

in the second phase of the stacking model and eventually produced the final prediction. Figure 1 shows the architecture of the proposed stacking model, which combined p numbers of best-performing regression models

M_{1}, \dots \dots, M_{p}

using an input dataset A, with features (

x_{i}

) and corresponding label (

y_{i}

). In the first step, p numbers of base-level ML regression models produced the predictions

\hat{y_{1}}, \dots \dots, \hat{y_{m}}

. The predictions of the base learners were fed into the meta learner model (

M_{f}

) for the final prediction.

Algorithm 1 provides a step-by-step procedure to explain the construction of the stacked model.

Algorithm 1. The algorithm used for developing the stacked model

Input: training data

A = {x_{i}, y_{i}}_{i = 1}^{m}

Output: a stacking regressor

M_{f}

1: Step 1: perform the training of the base-level regressors
2: for t = 1 to T do
3: Train

h_{t}

based on database of A
4: end for
5: Step 2: design new database of predictions
6: for i = 1 to m do
7:

A_{h} = \{x_{i}^{'}, y_{i}\}

, where

x_{i}^{'} = \{h_{1} (x_{i}), \dots \dots, h_{T} (x_{i})\}

8: end for
9: Step 3: perform the training of the meta-regressor
10: perform the training of

M_{f}

based on

A_{h}

11: return

M_{f}

It should be mentioned that the predictions of the level 0 model (base learner) are used as input to the level 1 model (meta learner). Moreover, the same set of training instances in the level 1 model (meta learner) and level 0 model are used (just with different features, obtained from base model predictions).

3.5. Performance Analysis

This study evaluates and compares the accuracy of the base and stacked machine learning scenarios using the AARD% (Equation (3)), R² (Equation (4)), MAE (Equation (5)), RAE% (Equation (6)), and MSE (Equation (7)) indexes [54,55,56]. These indices quantify the deviation between the experimental solubility values (

y_{d r u g}^{\exp}

) and the calculated solubility data by the machine learning models (

y_{d r u g}^{c a l}

).

A A R D % = {\sum_{n = 1}^{N} 100 \times (|y_{d r u g}^{\exp} - y_{d r u g}^{c a l}| / y_{d r u g}^{\exp})}_{n} / N

(3)

R^{2} = 1 - \sum_{n = 1}^{N} {(y_{d r u g}^{\exp} - y_{d r u g}^{c a l})}_{n}^{2} / \sum_{n = 1}^{N} {(y_{_{d r u g}}^{\exp} - y_{_{d r u g}}^{a v e})}_{n}^{2}

(4)

M A E = {\sum_{n = 1}^{N} (|y_{d r u g}^{\exp} - y_{d r u g}^{c a l}|)}_{n} / N

(5)

R A E % = 100 \times \sum_{n = 1}^{N} {(|y_{d r u g}^{\exp} - y_{d r u g}^{c a l}|)}_{n} / \sum_{n = 1}^{N} {(|y_{d r u g}^{\exp} - y_{_{d r u g}}^{a v e}|)}_{n}

(6)

M S E = \sum_{n = 1}^{N} {(y_{d r u g}^{\exp} - y_{d r u g}^{c a l})}_{n}^{2} / N

(7)

The above statistical criteria also require the average value of drug solubilities (

y_{d r u g}^{a v e}

) and the number of data (N). Equation (8) defines the average value of drug solubilities in supercritical CO₂.

y_{_{d r u g}}^{a v e} = {\sum_{n = 1}^{N} (y_{d r u g}^{\exp})}_{n} / N

(8)

Moreover, several graphical techniques (cross-plot, histogram, kernel density estimation, and Bland-Altman) and trend analyses have been applied to check the performance of the most accurate machine learning approach (i.e., the stacked model).

4. Results and Discussion

4.1. Developing Base Machine Models

The anticancer drug–supercritical CO₂ phase equilibrium measurements (311 datasets) were randomly divided into internal and external groups (4:1 ratio). The five-fold cross-validation utilized the earlier group (i.e., 248 data samples) for the training and validation phases of the base learner machines. On the other hand, the remaining 63 data samples were engaged in the testing phase of the trained base-learner machines.

In this study, we leverage the hyperparameter optimization framework Optuna [57] as follows. First, the corresponding parameter spaces for the Scikit-Learn implementations of the RF, ETR, and GB models were identified. Then, the objective function (OF) was defined as the MSE. Lastly, a pipeline was applied to minimize the OF over a predefined maximum iteration on multiple cores (i.e., 300). Some of the RF and ETR hyperparameters, including the number of trees in the forest (i.e., number of estimators) were adjusted. We checked 80–150 trees in the forest and the best accuracy was obtained with 110 trees for the RF and 100 trees for ETR. Furthermore, upon checking different accuracy criteria (i.e., squared error, absolute error, and Poisson), squared error shows the best performance for both ETR and RF algorithms. In addition, the GB model [51] was tuned by the learning rate, ranging from 0.0 to 1 with a step size of 0.1. The results show that the learning rate = 0.2 produced the best performance. The model was also tuned with diverse loss functions (i.e., squared error, absolute error, Huber, and quantile). It was found that absolute error provided the model with the best performance. All reported results in this study were obtained by using the optimized models.

Table 3 introduces the uncertainty level observed in predictions of the base learner machines. The numerical values of five statistical indices are reported for the internal and external groups, as well as their combination. This table confirms that the deviation between the experimental solubility measurements and the associated predictions by the base-learner machines was relatively high. The extra tree model prediction accuracy was better than two other developed regression machines.

4.2. Designing the Stacked Model

As mentioned before, it is possible to build a stacked model by combining the previous three base learner machines utilizing the flowchart presented in Figure 1. Table 4 summarizes the prediction accuracy of the built stacked model in the cross-validation and testing phases and for all available datasets.

It can be concluded that the stacked model provides acceptable prediction accuracy for calculating the phase equilibrium behavior of the anticancer drug–supercritical CO₂ binary system. The constructed stacked model estimated 311 data samples of anticancer drug solubilities in supercritical CO₂ with excellent accuracy, i.e., AARD = 8.62%, MAE = 2.86 × 10⁻⁶, RAE = 2.42%, MSE = 1.26 × 10⁻¹⁰, and R² = 0.99809. These values of the statistical indexes for predicting the ultra-low ranges of anticancer drug solubility in supercritical CO₂ (10⁻⁷ to 10⁻³ mole fraction based on Table 1) are sufficient for designing pharmaceutical processes.

4.3. Comparison with the Other Modeling Scenarios

The literature has estimated solubilities of Decitabine [36] and Busulfan [38] in supercritical CO₂ using adaptive neuro-fuzzy inference systems and support vector machines, respectively. These models have been developed to estimate the solubility of a single drug or two drugs in supercritical CO₂, while our stacked model covers 12 different anti-cancer drugs. Table 5 shows that the accuracy of the stacked model is comparable or even better than the previously developed intelligent techniques.

This stage suggests a simple correlation based on the partial least-squares regression (PLS-R) to linearly relate the anticancer drug solubility in supercritical CO₂ to the independent variables (Equation (9)).

y_{d r u g}^{P L S - R} = (- 1.66 \times M_{d r u g} + 2.62 \times ρ_{C O_{2}} + 21.16 \times T_{e q} + 9.62 \times P_{e q} - 9397) \times 10^{- 7}

(9)

The accuracy of the stacked model (AARD = 8.62%, MSE = 1.26 × 10⁻¹⁰ and R² = 0.99809) is considerably better than the results obtained by the PLS-R (AARD >> 100%, MSE = 1.90 × 10⁻⁸ and R² = 0.39307).

4.4. Evaluating the Performance of the Stacked Model Using Graphical Analyses

This section utilizes several graphical analyses to visually inspect the stacked model’s performance for predicting anticancer drug solubility in supercritical CO₂.

Figure 2 depicts the calculated solubility values by the stacked model versus their corresponding experimentally measured values. The diagonal line shows those situations where predicted solubilities precisely coincided with their experimental counterparts (i.e., laboratory experiments equal prediction). The accumulation of both internal and external symbols in the vicinity of the diagonal line proved that the proposed stacked model successfully learned the equilibrium behavior of anticancer drug–supercritical CO₂ systems.

Numerical values of the relative error (RE), average (

R E^{a v e}

), and standard deviation (SD) have traditionally been used to evaluate the accuracy of a built model. Equations (10)–(12) present the mathematical expressions of the RE,

R E^{a v e}

, and SD, respectively.

R E = {(y_{d r u g}^{\exp} - y_{d r u g}^{c a l})}_{n} n = 1, 2, \dots, N

(10)

R E^{a v e} = \sum_{n = 1}^{N} R E_{n} / N

(11)

S D = {(\sum_{n = 1}^{N} {(R E_{n} - R E^{a v e})}^{2} / N)}^{0.5}

(12)

The histogram of the observed relative errors illustrated in Figure 3 justifies that the major part of the solubility data (248 samples) was estimated with a relative error equal to zero. Moreover, the relative errors’ average and standard deviation values were −8.2194 × 10⁻⁷ and 1.12 × 10⁻⁵ mole fractions, respectively.

The kernel density estimation (KDE) graphs of the experimental and calculated solubility values for the internal and external groups are exhibited in Figure 4. The two graphs in the figure show that only a little deviation exists between the experimental and calculated KDEs of the external groups. This deviation is observable in the 2 × 10⁻⁹ < magnitude < 4 × 10⁻⁹ of Figure 4b.

Figure 5a,b depict the Bland-Altman plots for the internal and external anticancer drug solubility data. These figures have two horizontal dashed lines associated with the upper and lower LoA (95% limit of agreement). Equations (13) and (14) define the upper and lower LoAs indices, respectively [58].

U p p e r L o A = + 1.96 \times S D + R E^{a v e}

(13)

L o w e r L o A = - 1.96 \times S D + R E^{a v e}

(14)

The lower and upper LoAs in Figure 5a were −2.52 × 10⁻¹⁰ and 2.28 × 10⁻¹⁰, whereas Figure 5b had lower and upper LoAs of −9.51 × 10⁻¹¹ and 1.07 × 10⁻¹⁰, respectively.

Figure 5a,b demonstrated that only 9 out of 248 internal data points (3.63%) and 3 out of 63 external data points (4.76%) were located outside the feasible domains.

4.5. Trend Analyses

This section investigates the effect of equilibrium temperature/pressure and anticancer drug type on the system behavior from experimental and modeling perspectives.

The profile of Capecitabine solubility in supercritical CO₂ versus equilibrium pressure for five temperature levels was plotted in Figure 6. It can be seen that increasing the equilibrium pressure continuously improved the Capecitabine solubility in the applied supercritical solvent. It can also be concluded that the pressure effect on the solid drug solubility was linear at low temperature and became nonlinear at high temperature.

An excellent agreement between the laboratory-measured equilibrium data and the stacked model predictions is easily deduced from this figure. Indeed, the stacked model was trained so well that it precisely anticipated the effect of pressure/temperature change, linear/nonlinear behavior of the system, and all individual data points.

Figure 7 exhibits the dependency of Decitabine solubility in supercritical CO₂ on the temperature at eight pressure levels. This figure shows that the temperature had two different impacts on the solid drug solubility at low and high equilibrium pressures. Indeed, increasing the temperature at low pressures decreased Decitabine solubility in the supercritical CO₂. On the other hand, increasing the temperature at high equilibrium pressures gradually intensified the Decitabine solubility in supercritical CO₂.

The complete agreement between experimental solubility data and their associated stacked model predictions can also be justified in this figure. The stacked model correctly predicts the solubility–temperature profiles and accurately estimates all single data points.

Since the analyzed anticancer drugs have different compositions and chemical structures, their solubility is also influenced by drug type. The effect of anticancer type on the average solubility value is shown in Figure 8. As expected, the anticancer drugs show different dissolution tendencies in supercritical CO₂. Decitabine, Busulfan, and Tamoxifen have the highest dissolution ability in the applied supercritical solvent. In contrast, Paclitaxel, Sorafenib tosylate, Thymidine, and Tamsulosin show the lowest tendency for dissolving in the considered solvent.

This figure also compares the average solubility values obtained by the experimental measurement and modeling analysis (i.e., stacked model). It is clear that the observed and calculated average solubility values are equal to up to four to five decimal places.

4.6. Importance of Independent Variables

As the last analysis, the Pearson technique [58] was applied to monitor the relative importance of each individual independent anticancer on the drug’s solubility in supercritical CO₂. This technique presents a value ranging from −1 to +1 to clarify the direction and importance of the relationship between each pair of dependent and independent variables. Table 6 summarizes the results of this analysis.

5. Conclusions

The current research study applied the novel stacked model to precisely monitor phase equilibria of twelve anticancer drug–supercritical CO₂ systems. The proposed stacked model was constructed by systematically combining extra tree, gradient boosting, and random forest machine learning models (known as base learners). Performance analyses of the based leaner models and the stacked model confirmed that the latter has the best accuracy in the cross-validation and testing phases. The designed stacked model showed excellent accuracy for predicting 311 experimentally-measured data samples (i.e., AARD = 8.62%, MAE = 2.86 × 10⁻⁶, RAE = 2.42%, MSE = 1.26 × 10⁻¹⁰, and R² = 0.99809). This stacked model performance is far better than those results obtained by the base-learner model (i.e., extra tree, gradient boosting, random forest), machine learning approaches suggested in the literature (support-vector machines and Adaptive neuro-fuzzy inference systems), and partial least-squares regression (PLS-R). Moreover, the graphical accuracy monitoring techniques (cross-plot, histogram, kernel density estimation, and Bland-Altman) and trend inspections (solubility–pressure and solubility–temperature profiles) confirmed the reliability of the stacked model predictions. Finally, the experimental data and modeling results revealed that Decitabine and Thymidine have the highest and lowest tendency for dissolving in supercritical CO₂, respectively.

Author Contributions

Conceptualization, M.A.A. and M.E.H.C.; investigation, M.A.A., H.S., A.K. and Z.H.J.; methodology, M.N., M.A.A., H.S. and M.E.H.C.; resources, M.N., H.S., T.R. and Z.H.J.; supervision, B.V. and A.K.; validation, M.A.A. and A.K.; visualization, H.S. and T.R.; writing—original draft, M.N. and A.K.; writing—review and editing, M.N., M.A.A., H.S., B.V., A.K., M.E.H.C., T.R. and Z.H.J. All authors have read and agreed to the published version of the manuscript.

Funding

The publication of this article was funded by the Qatar National Library.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The study data analyzed in this article can be obtained by request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ahuja, N.; Katare, O.P.; Singh, B. Studies on dissolution enhancement and mathematical modeling of drug release of a poorly water-soluble drug using water-soluble carriers. Eur. J. Pharm. Biopharm. 2007, 65, 26–38. [Google Scholar] [CrossRef] [PubMed]
Saadati Ardestani, N.; Amani, M.; Yeganeh Majd, N. Determination of the solubility of anticancer drugs in supercritical carbon dioxide using empirical models and artificial neural network. J. Appl. Res. Chem. Eng. 2022, 5, 15–37. [Google Scholar]
Tomasovic, S.; Lukac, J.K.; Sremec, J.; Klepac, N.; Draganic, P.; Bielen, I. Epidemiology of pharmacological treatment of multiple sclerosis in Croatia. Psychiatr. Danub. 2021, 33, 204–208. [Google Scholar] [PubMed]
Omeragic, I.; Hasanovic, M. Efficacy of emdr treatment in generalized anxiety disorder after a long-standing pharmacological treatment—A case report. Psychiatr. Danub. 2021, 33, S77–S82. [Google Scholar]
Kalikin, N.N.; Kurskaya, M.V.; Ivlev, D.V.; Krestyaninov, M.A.; Oparin, R.D.; Kolesnikov, A.L.; Budkov, Y.A.; Idrissi, A.; Kiselev, M.G. Carbamazepine solubility in supercritical CO₂: A comprehensive study. J. Mol. Liq. 2020, 311, 113104. [Google Scholar] [CrossRef]
Lai, W.F.; Tang, R.; Wong, W.T. Ionically crosslinked complex gels loaded with oleic acid-containing vesicles for transdermal drug delivery. Pharmaceutics 2020, 12, 725. [Google Scholar] [CrossRef]
Lai, W.F.; Gui, D.; Wong, M.; Döring, A.; Rogach, A.L.; He, T.; Wong, W.T. A self-indicating cellulose-based gel with tunable performance for bioactive agent delivery. J. Drug Deliv. Sci. Technol. 2021, 63, 102428. [Google Scholar] [CrossRef]
Khallaghi, N.; Hanak, D.P.; Manovic, V. Gas-fired chemical looping combustion with supercritical CO₂ cycle. Appl. Energy 2019, 249, 237–244. [Google Scholar] [CrossRef]
Amar, M.N.; Zeraibi, N. Application of hybrid support vector regression artificial bee colony for prediction of MMP in CO₂-EOR process. Petroleum 2020, 6, 415–422. [Google Scholar] [CrossRef]
Mi, H.Y.; Jing, X.; Liu, Y.; Li, L.; Li, H.; Peng, X.F.; Zhou, H. Highly durable superhydrophobic polymer foams fabricated by extrusion and supercritical CO₂ foaming for selective oil absorption. ACS Appl. Mater. Interfaces 2019, 11, 7479–7487. [Google Scholar] [CrossRef]
Li, M.J.; Zhu, H.H.; Guo, J.Q.; Wang, K.; Tao, W.Q. The development technology and applications of supercritical CO₂ power cycle in nuclear energy, solar energy and other energy industries. Appl. Therm. Eng. 2017, 126, 255–275. [Google Scholar] [CrossRef]
Wang, W.; Rao, L.; Wu, X.; Wang, Y.; Zhao, L.; Liao, X. Supercritical Carbon Dioxide Applications in Food Processing. Food Eng. Rev. 2021, 13, 570–591. [Google Scholar] [CrossRef]
Rezaei, T.; Nazarpour, V.; Shahini, N.; Bahmani, S.; Shahkar, A.; Abdihaji, M.; Ahmadi, S.; Shahdost, F.T. A universal methodology for reliable predicting the non-steroidal anti-inflammatory drug solubility in supercritical carbon dioxide. Sci. Rep. 2022, 12, 1043. [Google Scholar] [CrossRef] [PubMed]
Tsai, W.C.; Wang, Y. Progress of supercritical fluid technology in polymerization and its applications in biomedical engineering. Prog. Polym. Sci. 2019, 98, 101161. [Google Scholar] [CrossRef]
Hojjati, M.; Yamini, Y.; Khajeh, M.; Vatanara, A. Solubility of some statin drugs in supercritical carbon dioxide and representing the solute solubility data with several density-based correlations. J. Supercrit. Fluids 2007, 41, 187–194. [Google Scholar] [CrossRef]
Cao, Y.; Khan, A.; Zabihi, S.; Albadarin, A.B. Neural simulation and experimental investigation of Chloroquine solubility in supercritical solvent. J. Mol. Liq. 2021, 333, 115942. [Google Scholar] [CrossRef]
Chen, L.; Huang, Y.; Yu, X.; Lu, J.; Jia, W.; Song, J.; Liu, L.; Wang, Y.; Huang, Y.; Xie, J.; et al. Corynoxine protects dopaminergic neurons through inducing autophagy and diminishing neuroinflammation in rotenone-induced animal models of Parkinson’s disease. Front. Pharmacol. 2021, 12, 642900. [Google Scholar] [CrossRef]
Yamini, Y.; Moradi, M. Measurement and correlation of antifungal drugs solubility in pure supercritical CO₂ using semiempirical models. J. Chem. Thermodyn. 2011, 43, 1091–1096. [Google Scholar] [CrossRef]
Wang, S.W.; Chang, S.Y.; Hsieh, C.M. Measurement and modeling of solubility of gliclazide (hypoglycemic drug) and captopril (antihypertension drug) in supercritical carbon dioxide. J. Supercrit. Fluids 2021, 174, 105244. [Google Scholar] [CrossRef]
Cuadra, I.A.; Cabañas, A.; Cheda, J.A.R.; Pando, C. Polymorphism in the co-crystallization of the anticonvulsant drug carbamazepine and saccharin using supercritical CO₂ as an anti-solvent. J. Supercrit. Fluids 2018, 136, 60–69. [Google Scholar] [CrossRef]
Asiabi, H.; Yamini, Y.; Latifeh, F.; Vatanara, A. Solubilities of four macrolide antibiotics in supercritical carbon dioxide and their correlations using semi-empirical models. J. Supercrit. Fluids 2015, 104, 62–69. [Google Scholar] [CrossRef]
Esfandiari, N.; Sajadian, S.A. Experimental and modeling investigation of Glibenclamide solubility in supercritical carbon dioxide. Fluid Phase Equilib. 2022, 556, 113408. [Google Scholar] [CrossRef]
Sodeifian, G.; Saadati Ardestani, N.; Sajadian, S.A.; Golmohammadi, M.R.; Fazlali, A. Prediction of Solubility of Sodium Valproate in Supercritical Carbon Dioxide: Experimental Study and Thermodynamic Modeling. J. Chem. Eng. Data 2020, 65, 1747–1760. [Google Scholar] [CrossRef]
Pishnamazi, M.; Zabihi, S.; Jamshidian, S.; Hezaveh, H.Z.; Hezave, A.Z.; Shirazian, S. Measuring solubility of a chemotherapy-anti cancer drug (busulfan) in supercritical carbon dioxide. J. Mol. Liq. 2020, 317, 113954. [Google Scholar] [CrossRef]
Pavelić, K. Personalized neoantigen vaccine against cancer. Psychiatr. Danub. 2021, 33, 96–100. [Google Scholar] [CrossRef]
Feng, Y.; Li, F.; Yan, J.; Guo, X.; Wang, F.; Shi, H.; Du, J.; Zhang, H.; Gao, Y.; Li, D.; et al. Pan-cancer analysis and experiments with cell lines reveal that the slightly elevated expression of DLGAP5 is involved in clear cell renal cell carcinoma progression. Life Sci. 2021, 287, 120056. [Google Scholar] [CrossRef] [PubMed]
Euldji, I.; Si-Moussa, C.; Hamadache, M.; Benkortbi, O. QSPR Modelling of the Solubility of Drug and Drug-like Compounds in Supercritical Carbon Dioxide. In Molecular Informatics; Wiley: Hoboken, NJ, USA, 2022; p. 2200026. [Google Scholar] [CrossRef]
Hazaveie, S.M.; Sodeifian, G.; Sajadian, S.A. Measurement and thermodynamic modeling of solubility of Tamsulosin drug (anti cancer and anti-prostatic tumor activity) in supercritical carbon dioxide. J. Supercrit. Fluids 2020, 163, 104875. [Google Scholar] [CrossRef]
Ardestani, N.S.; Majd, N.Y.; Amani, M. Experimental Measurement and Thermodynamic Modeling of Capecitabine (an Anticancer Drug) Solubility in Supercritical Carbon Dioxide in a Ternary System: Effect of Different Cosolvents. J. Chem. Eng. Data 2020, 65, 4762–4779. [Google Scholar] [CrossRef]
Amooey, A.A. A simple correlation to predict drug solubility in supercritical carbon dioxide. Fluid Phase Equilib. 2014, 375, 332–339. [Google Scholar] [CrossRef]
Keshmiri, K.; Vatanara, A.; Yamini, Y. Development and evaluation of a new semi-empirical model for correlation of drug solubility in supercritical CO₂. Fluid Phase Equilib. 2014, 363, 18–26. [Google Scholar] [CrossRef]
Su, C.S.; Chen, Y.P. Correlation for the solubilities of pharmaceutical compounds in supercritical carbon dioxide. Fluid Phase Equilib. 2007, 254, 167–173. [Google Scholar] [CrossRef]
Faress, F.; Yari, A.; Rajabi Kouchi, F.; Safari Nezhad, A.; Hadizadeh, A.; Sharif Bakhtiar, L.; Naserzadeh, Y.; Mahmoudi, N. Developing an accurate empirical correlation for predicting anti-cancer drugs’ dissolution in supercritical carbon dioxide. Sci. Rep. 2022, 12, 9380. [Google Scholar] [CrossRef] [PubMed]
Zhu, H.; Zhu, L.; Sun, Z.; Khan, A. Machine learning based simulation of an anti-cancer drug (busulfan) solubility in supercritical carbon dioxide: ANFIS model and experimental validation. J. Mol. Liq. 2021, 338, 116731. [Google Scholar] [CrossRef]
Baghban, A.; Sasanipour, J.; Zhang, Z. A new chemical structure-based model to estimate solid compound solubility in supercritical CO₂. J. CO₂ Util. 2018, 26, 262–270. [Google Scholar] [CrossRef]
Nguyen, H.C.; Alamray, F.; Kamal, M.; Diana, T.; Mohamed, A.; Algarni, M.; Su, C.H. Computational prediction of drug solubility in supercritical carbon dioxide: Thermodynamic and artificial intelligence modeling. J. Mol. Liq. 2022, 354, 118888. [Google Scholar] [CrossRef]
Wang, T.; Su, C.H. Medium Gaussian SVM, Wide Neural Network and stepwise linear method in estimation of Lornoxicam pharmaceutical solubility in supercritical solvent. J. Mol. Liq. 2022, 349, 118120. [Google Scholar] [CrossRef]
Sadeghi, A.; Su, C.H.; Khan, A.; Rahman, M.L.; Sarjadi, M.S.; Sarkar, S.M. Machine learning simulation of pharmaceutical solubility in supercritical carbon dioxide: Prediction and experimental validation for busulfan drug. Arab. J. Chem. 2022, 15, 103502. [Google Scholar] [CrossRef]
Gunturi, S.K.; Sarkar, D. Ensemble machine learning models for the detection of energy theft. Electr. Power Syst. Res. 2021, 192, 106904. [Google Scholar] [CrossRef]
Mosavi, A.; Hosseini, F.S.; Choubin, B.; Abdolshahnejad, M.; Gharechaee, H.; Lahijanzadeh, A.; Dineva, A.A. Susceptibility prediction of groundwater hardness using ensemble machine learning models. Water 2020, 12, 2770. [Google Scholar] [CrossRef]
Chen, J.; Zou, Q.; Li, J. DeepM6ASeq-EL: Prediction of human N6-methyladenosine (m6A) sites with LSTM and ensemble learning. Front. Comput. Sci. 2022, 16, 1–7. [Google Scholar] [CrossRef]
Sodeifian, G.; Razmimanesh, F.; Ardestani, N.S.; Sajadian, S.A. Experimental data and thermodynamic modeling of solubility of Azathioprine, as an immunosuppressive and anti-cancer drug, in supercritical carbon dioxide. J. Mol. Liq. 2020, 299, 112179. [Google Scholar] [CrossRef]
Suleiman, D.; Estévez, L.A.; Pulido, J.C.; García, J.E.; Mojica, C. Solubility of anti-inflammatory, anti-cancer, and anti-HIV drugs in supercritical carbon dioxide. J. Chem. Eng. Data 2005, 50, 1234–1241. [Google Scholar] [CrossRef]
Yamini, Y.; Hojjati, M.; Kalantarian, P.; Moradi, M.; Esrafili, A.; Vatanara, A. Solubility of capecitabine and docetaxel in supercritical carbon dioxide: Data and the best correlation. Thermochim. Acta 2012, 549, 95–101. [Google Scholar] [CrossRef]
Pishnamazi, M.; Zabihi, S.; Jamshidian, S.; Borousan, F.; Hezave, A.Z.; Marjani, A.; Shirazian, S. Experimental and thermodynamic modeling decitabine anti cancer drug solubility in supercritical carbon dioxide. Sci. Rep. 2021, 11, 1075. [Google Scholar] [CrossRef] [PubMed]
Sodeifian, G.; Sajadian, S.A. Solubility measurement and preparation of nanoparticles of an anticancer drug (Letrozole) using rapid expansion of supercritical solutions with solid cosolvent (RESS-SC). J. Supercrit. Fluids 2018, 133, 239–252. [Google Scholar] [CrossRef]
Sodeifian, G.; Razmimanesh, F.; Sajadian, S.A. Prediction of solubility of sunitinib malate (an anti-cancer drug) in supercritical carbon dioxide (SC–CO₂): Experimental correlations and thermodynamic modeling. J. Mol. Liq. 2020, 297, 105998. [Google Scholar] [CrossRef]
Pishnamazi, M.; Zabihi, S.; Jamshidian, S.; Borousan, F.; Hezave, A.Z.; Shirazian, S. Thermodynamic modelling and experimental validation of pharmaceutical solubility in supercritical solvent. J. Mol. Liq. 2020, 319, 114120. [Google Scholar] [CrossRef]
Geurts, P.; Ernst, D.; Wehenkel, L. Extremely randomized trees. Mach. Learn. 2006, 63, 3–42. [Google Scholar] [CrossRef] [Green Version]
Mishra, G.; Sehgal, D.; Valadi, J.K. Quantitative Structure Activity Relationship study of the Anti-Hepatitis Peptides employing Random Forest and Extra Tree regressors. Bioinformation 2017, 13, 60–62. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Sharafati, A.; Asadollah, S.B.H.S.; Hosseinzadeh, M. The potential of new ensemble machine learning models for effluent quality parameters prediction and related uncertainty. Process Saf. Environ. Prot. 2020, 140, 68–78. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Shafiq, A.; Çolak, A.B.; Sindhu, T.N.; Lone, S.A.; Alsubie, A.; Jarad, F. Comparative study of artificial neural network versus parametric method in COVID-19 data analysis. Results Phys. 2022, 38, 105613. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Ayari, M.A.; Khandakar, A.; Chowdhury, M.E.H.; Zaman, S.M.A.U.; Rahman, T.; Vaferi, B. Estimating the Relative Crystallinity of Biodegradable Polylactic Acid and Polyglycolide Polymer Composites by Machine Learning Methodologies. Polymers 2022, 14, 527. [Google Scholar] [CrossRef]
Shafiq, A.; Çolak, A.B.; Sindhu, T.N.; Al-Mdallal, Q.M.; Abdeljawad, T. Estimation of unsteady hydromagnetic Williamson fluid flow in a radiative surface through numerical and artificial neural network modeling. Sci. Rep. 2021, 11, 14509. [Google Scholar] [CrossRef]
Akiba, T.; Sano, S.; Yanase, T.; Ohta, T.; Koyama, M. Optuna: A Next-generation Hyperparameter Optimization Framework. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Anchorage, AK, USA, 4–8 August 2019; pp. 2623–2631. [Google Scholar]
Zhu, X.; Khosravi, M.; Vaferi, B.; Nait Amar, M.; Ghriga, M.A.; Mohammed, A.H. Application of machine learning methods for estimating and comparing the sulfur dioxide absorption capacity of a variety of deep eutectic solvents. J. Clean. Prod. 2022, 363, 132465. [Google Scholar] [CrossRef]

Figure 1. The general architecture of the stacked approach.

Figure 2. Correlation between experimental and calculated solubilities of the studied anticancer drugs.

Figure 3. The histogram of residual errors provided by the stacked model (blue graph shows the normal distribution).

Figure 4. The kernel density estimation graphs for (a) internal and (b) external groups.

Figure 5. The Bland-Altman plots for (a) internal and (b) external groups.

Figure 6. Monitoring the effect of pressure on the anticancer drug (Capecitabine) solubility in supercritical CO₂ from the laboratory and modeling perspectives.

Figure 7. The experimental and modeling profiles of the effect of temperature on the solubility of the anticancer drug Decitabine in supercritical CO₂.

Figure 8. Average values of the solubility in supercritical CO₂ of the studied anticancer drugs achieved from experimental data and modeling results.

Table 1. Experimental data reported in the literature for the solubility of anticancer drugs in supercritical CO₂.

Anticancer Drug	Pressure	Temperature	CO₂ Density	Drug Solubility	No. of Data	Ref.
Anticancer Drug	bar	°C	kg/m³	Mole Fraction	No. of Data	Ref.
Sunitinib malate	120–270	35–65	388–914	5.00 × 10⁻⁶–8.56 × 10⁻⁵	24	[23]
Busulfan	120–400	35–65	383–971	3.27 × 10⁻⁵–8.65 × 10⁻⁴	32	[24]
Tamsulosin	120–270	35–65	384–914	1.80 × 10⁻⁷–1.01 × 10⁻⁵	24	[28]
Azathioprine	120–270	35–65	388–914	2.70 × 10⁻⁶–1.83 × 10⁻⁵	24	[42]
Paclitaxel	100–275	35–55	654–915	1.20 × 10⁻⁶–6.20 × 10⁻⁶	21	[43]
5-Fluorouracil	125–250	35–55	541–901	3.80 × 10⁻⁶–1.46 × 10⁻⁵	18	[43]
Thymidine	100–300	35–55	325–928	1.20 × 10⁻⁶–8.00 × 10⁻⁶	25	[43]
Capecitabine	152–354	35–75	477–955	2.70 × 10⁻⁶–1.59 × 10⁻⁴	35	[44]
Decitabine	120–400	35–65	383–971	2.84 × 10⁻⁵–1.07 × 10⁻³	32	[45]
Letrozole	120–360	45–75	319–922	1.60 × 10⁻⁶–8.51 × 10⁻⁵	20	[46]
Sorafenib tosylate	120–270	35–65	388–914	6.80 × 10⁻⁷–1.26 × 10⁻⁵	24	[47]
Tamoxifen	120–400	35–65	383–971	1.88 × 10⁻⁵–9.89 × 10⁻⁴	32	[48]

Table 2. Molecular weights and chemical structures of the investigated anticancer drugs.

Anticancer Drug	Molecular Weight	Molecular Structure
5-Fluorouracil	130
Azathioprine	277.26
Busulfan	246.3
Capecitabine	359.35
Decitabine	228.21
Letrozole	285.3
Paclitaxel	854
Sorafenib tosylate	637.03
Sunitinib malate	532.56
Tamoxifen	371.51
Tamsulosin	408.05
Thymidine	242

Table 3. Prediction accuracy of the base leaner machines.

Base Learner Model	Subgroup	AARD%	MAE	RAE%	MSE	R²
Extra tree	Internal	11.52	9.21 × 10⁻⁶	7.71	1.19 × 10⁻⁹	0.98283
	External	37.44	2.58 × 10⁻⁵	22.85	2.36 × 10⁻⁹	0.95534
	All data	16.77	1.26 × 10⁻⁵	10.63	1.43 × 10⁻⁹	0.97838
Gradient boosting	Internal	21.04	1.57 × 10⁻⁵	13.13	1.40 × 10⁻⁹	0.97898
	External	43.07	2.48 × 10⁻⁵	21.99	1.97 × 10⁻⁹	0.95756
	All data	25.50	1.75 × 10⁻⁵	14.82	1.52 × 10⁻⁹	0.97560
Random forest	Internal	20.27	1.50 × 10⁻⁵	12.53	1.54 × 10⁻⁹	0.98354
	External	44.29	2.51 × 10⁻⁵	22.20	2.51 × 10⁻⁹	0.94926
	All data	25.14	1.70 × 10⁻⁵	14.38	1.73 × 10⁻⁹	0.97844

Table 4. Prediction accuracy of the stacked model.

AI Scenario	Subgroup	AARD%	MAE	RAE%	MSE	R²
Stacked model	Internal	9.46	3.18 × 10⁻¹¹	2.66	1.51 × 10⁻²⁰	0.99791
	External	5.35	1.62 × 10⁻¹¹	1.44	2.66 × 10⁻²¹	0.99946
	All data	8.62	2.86 × 10⁻⁶	2.42	1.26 × 10⁻¹⁰	0.99809

Table 5. Prediction accuracy of the stacked model.

Drug	Model	R²	Reference
Decitabine	Adaptive neuro-fuzzy inference systems	0.99663	[36]
Decitabine	Stacked model	0.99508	This work
Busulfan	Support vector machines	0.98327	[38]
Busulfan	Stacked model	0.99054	This work

Table 6. Dependency of anticancer drug solubility in supercritical CO₂ on the independent variables.

Information	Dependent–Independent Pairs
Information	$y_{d r u g} - M_{d r u g}$	$y_{d r u g} - ρ_{C O_{2}}$	$y_{d r u g} - T_{e q}$	$y_{d r u g} - P_{e q}$
Pearson coefficient	−0.248	0.295	0.204	0.617
Direction of relationship	Indirect	Direct	Direct	Direct
Importance	Third	Second	Fourth	First

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Najmi, M.; Ayari, M.A.; Sadeghsalehi, H.; Vaferi, B.; Khandakar, A.; Chowdhury, M.E.H.; Rahman, T.; Jawhar, Z.H. Estimating the Dissolution of Anticancer Drugs in Supercritical Carbon Dioxide with a Stacked Machine Learning Model. Pharmaceutics 2022, 14, 1632. https://0-doi-org.brum.beds.ac.uk/10.3390/pharmaceutics14081632

AMA Style

Najmi M, Ayari MA, Sadeghsalehi H, Vaferi B, Khandakar A, Chowdhury MEH, Rahman T, Jawhar ZH. Estimating the Dissolution of Anticancer Drugs in Supercritical Carbon Dioxide with a Stacked Machine Learning Model. Pharmaceutics. 2022; 14(8):1632. https://0-doi-org.brum.beds.ac.uk/10.3390/pharmaceutics14081632

Chicago/Turabian Style

Najmi, Maryam, Mohamed Arselene Ayari, Hamidreza Sadeghsalehi, Behzad Vaferi, Amith Khandakar, Muhammad E. H. Chowdhury, Tawsifur Rahman, and Zanko Hassan Jawhar. 2022. "Estimating the Dissolution of Anticancer Drugs in Supercritical Carbon Dioxide with a Stacked Machine Learning Model" Pharmaceutics 14, no. 8: 1632. https://0-doi-org.brum.beds.ac.uk/10.3390/pharmaceutics14081632

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimating the Dissolution of Anticancer Drugs in Supercritical Carbon Dioxide with a Stacked Machine Learning Model

Abstract

1. Introduction

2. Anticancer Drugs’ Solubility in Supercritical Carbon Dioxide

3. Methods

3.1. Extra Tree

3.2. Gradient Boosting

3.3. Random Forest

3.4. Stacked Model

3.5. Performance Analysis

4. Results and Discussion

4.1. Developing Base Machine Models

4.2. Designing the Stacked Model

4.3. Comparison with the Other Modeling Scenarios

4.4. Evaluating the Performance of the Stacked Model Using Graphical Analyses

4.5. Trend Analyses

4.6. Importance of Independent Variables

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI