Artificial Neural Networks to Predict the Apparent Degree of Supersaturation in Supersaturated Lipid-Based Formulations: A Pilot Study

Bennett-Lenane, Harriet; O’Shea, Joseph P.; Murray, Jack D.; Ilie, Alexandra-Roxana; Holm, René; Kuentz, Martin; Griffin, Brendan T.

doi:10.3390/pharmaceutics13091398

Open AccessArticle

Artificial Neural Networks to Predict the Apparent Degree of Supersaturation in Supersaturated Lipid-Based Formulations: A Pilot Study

¹

School of Pharmacy, University College Cork, T12 YT20 Cork, Ireland

²

Drug Product Development, Janssen Research and Development, Johnson & Johnson, Turnhoutseweg 30, 2340 Beerse, Belgium

³

Department of Physics, Chemistry and Pharmacy, University of Southern Denmark, DK-5230 Odense, Denmark

⁴

School of Life Sciences, University of Applied Sciences and Arts Northwestern Switzerland, Hofackerstrasse 30, 4132 Muttenz, Switzerland

^*

Author to whom correspondence should be addressed.

Pharmaceutics 2021, 13(9), 1398; https://0-doi-org.brum.beds.ac.uk/10.3390/pharmaceutics13091398

Submission received: 13 August 2021 / Revised: 30 August 2021 / Accepted: 1 September 2021 / Published: 5 September 2021

(This article belongs to the Special Issue Feature Papers in Physical Pharmacy and Formulation)

Download

Browse Figures

Versions Notes

Abstract

:

In response to the increasing application of machine learning (ML) across many facets of pharmaceutical development, this pilot study investigated if ML, using artificial neural networks (ANNs), could predict the apparent degree of supersaturation (aDS) from two supersaturated LBFs (sLBFs). Accuracy was compared to partial least squares (PLS) regression models. Equilibrium solubility in Capmul MCM and Maisine CC was obtained for 21 poorly water-soluble drugs at ambient temperature and 60 °C to calculate the aDS ratio. These aDS ratios and drug descriptors were used to train the ML models. When compared, the ANNs outperformed PLS for both sLBF_Capmul^MC (r² 0.90 vs. 0.56) and sLBF_Maisine^LC (r² 0.83 vs. 0.62), displaying smaller root mean square errors (RMSEs) and residuals upon training and testing. Across all the models, the descriptors involving reactivity and electron density were most important for prediction. This pilot study showed that ML can be employed to predict the propensity for supersaturation in LBFs, but even larger datasets need to be evaluated to draw final conclusions.

Keywords:

lipid-based drug delivery; computational pharmaceutics; machine learning; supersaturated lipid-based formulations

Graphical Abstract

1. Introduction

In the face of increasing pressures for accelerated development, the work of formulation scientists could be advanced through miniaturised screening tools, computational methods, and a structured approach in preclinical testing [1,2]. Currently, more conservative “tried-and-tested” approaches to formulation design are typically employed, often leading to suboptimal formulations that may disregard influential molecular and physicochemical drug properties or compound interactions with formulation excipients. However, such classical formulation development is likely to change as different computational tools are already widely used in drug discovery and are gaining momentum in pharmaceutical development. Quantity structure–activity relationships (QSARs) have streamlined the selection of candidates with optimal binding profiles [3], physiologically based pharmacokinetic (PBPK) models have aided the simulation of pharmacokinetic parameters [4], while theory- or data-driven modelling applications have improved formulation development [5,6,7,8,9,10,11,12]. Using data-driven machine learning (ML) approaches, improved success rates are achievable by ascertaining the statistical relationships between molecular descriptors and the intended response.

The main goal of predicting an outcome using input variables is the same for both partial least squares (PLS) and artificial neural network (ANN) ML algorithms. However, the mathematical approaches used differ in terms of the dimensionality reduction in data versus the potential for non-linear data fitting. PLS is a well-established multivariate regression dimensionality reduction method. The model calculates the X- and Y-matrices to find the principal components in X (independent variables) that capture most of the variance in Y (dependent variable). These initial data are projected into a latent variable space, thereby maximising the covariance between X and Y [13]. While PLS aims to find a linear (or polynomial) relationship between X and Y, ANNs represent an emerging ML algorithm. ANNs differ in their capability to detect complex non-linear X–Y relationships while detecting possible interactions between X variables [14]. ANNs mimic basic human biological information processing methods, as the structure of the multilayer perceptron (MLP) algorithm contains some main elements: input layer, hidden layer, output layer, activation functions, and connection weights. Each neuron receives signals/inputs from other neurons in the preceding layers or directly from the independent variables. This signal has an associated weighted value, which determines the strength of this interconnection. A weighted sum of these inputs is computed and transformed using an activation function to produce an output signal, which is sent to the next neurons in subsequent layers. During training, samples are passed through the network and synaptic weights are continuously adjusted until a minimum prediction error is achieved. While an in-depth analysis of ANNs can be found in the literature [15,16], current research suggests that ANNs may provide a promising alternative tool to decode complex pharmaceutical datasets.

Over the last decade, interest regarding the use of ML algorithms across diverse disciplines in pharmaceutical design and development has grown [11,17,18,19,20,21,22,23,24,25,26]. While ML models have been produced to optimise lipid-based formulation (LBF) development [3,22,27,28,29,30,31,32,33], the application of more novel ML approaches for bio-enabling formulations currently focuses on solid dispersions (SDs) [21,34,35]. However, their application to LBFs, particularly supersaturated LBFs (sLBFs), remains unexplored. LBFs, in their most utilised form of lipid solutions, aim to solubilise poorly water-soluble drugs (PWSDs) and to improve biopharmaceutical properties by simulating endogenous lipid absorption pathways [36]. However, commercial utilisation has been declining [37], likely partly attributable to the dose loading limitations given by the inherent drug solubility in the lipid vehicle [38,39]. One delivery solution has involved the development of sLBFs. These are kinetically stable solutions containing a drug concentration above the thermodynamic solubility, where increased drug loads and exposure are achieved through thermally inducing supersaturation [38,40,41]. Previously, supersaturated solutions such as sLBFs have been characterised by the apparent degree of supersaturation (aDS) ratio [42,43,44], calculated to determine the propensity of drugs to supersaturate in specific lipid systems (i.e., fold increase in drug solubility with elevation of temperature). This has been used as an indicator of the likelihood of designing sLBFs and is critical regarding the ability to maintain drug supersaturation upon storage [43]. Therefore, we hypothesise that an in silico ML model predicting aDS from molecular properties would support streamlined screening of sLBFs.

Consequently, this pilot study sought to investigate if ANN modelling could be used to predict the aDS in sLBFs using a dataset generated for 21 PWSDs. PLS regression models produced from the same dataset facilitated a comparison of the two computational techniques for this dataset. Two medium-chain (MC) and long-chain (LC)-based mono/di-glycerides formulations were chosen as mono-/di-glyceride systems that previously facilitated improved supersaturation propensity and streamlined drug-excipient screenings [38,45,46]. PLS has been previously employed in computational modelling for LBFs [29,30]. However, this study provides, to the best of our knowledge, the first investigation into the application of ANNs to predict maximum dose loading in LBFs.

2. Materials and Methods

2.1. Chemicals and Materials

Celecoxib was purchased from Astatech Inc. (Bristol, PA, USA), while cinnarizine, JNJ-2A, ibuprofen, and itraconazole were obtained from Janssen Pharmaceutica (Beerse, Belgium). Fenofibrate and indomethacine were purchased from Sigma-Aldrich (Wicklow, Ireland). Progesterone, felodipine, sulfalazine, haloperidol, danazol, naproxen, venetoclax, carvedilol, dipyridamole, niclosamide, griseofulvin, fenofibric acid, ketoconazole, and clotrimazole were purchased from Kemprotec (Carnforth, UK), and Capmul MCM C8 was kindly donated by Abitec (Columbus, OH, USA). Maisine CC was a kind gift from Gattefossé (Lyon, France). All other chemicals and solvents were of analytical or high-performance liquid chromatography (HPLC) grade, purchased from Sigma-Aldrich (Wicklow, Ireland).

2.2. Formulations

Two prototype single-component LBFs were chosen based on their previous successful applications as sLBFs [43]. The MC system contained Capmul MCM, a blend of MC mono- and di-glycerides where caprylic acid (C8) is considered the predominant fatty acid. The LC system contained Maisine CC, a blend of LC mono- and di-glycerides where linoleic acid, C18:2, is considered the predominant fatty acid. These formulations are termed sLBF_Capmul^MC and sLBF_Masine^LC when referring to solubility testing at 60 °C.

2.3. Dataset Selection/Drug Physiochemical and Molecular Properties

Twenty-one structurally diverse PWSDs were selected (Table 1), where the criteria included the availability of physicochemical properties and potential utilisation as part of a commercial LBF, or a sLBF. The compounds were classified according to their glass-forming ability (GFA) [44], where eight drugs were Class 1, three drugs Class 2, and 10 drugs Class 3. Greater than 250 molecular descriptors were predicted from ADMET Predictor 9.5 (Simulations Plus, USA) and added to the experimental drug properties of melting point (T_m), glass transition temperature (T_g), entropy of fusion (∆S_fus), enthalpy of fusion (∆H_fus), T_m/T_g, and reduced glass transition temperature (T_rg), obtained from the literature [9,38,47,48,49]. As the molecular properties can be obtained for any drug once the structure is known, they were used as input data.

2.4. Equilibrium Solubility Determination

Equilibrium drug solubility studies were conducted in both LBFs at ambient temperature (AT) (22 °C) and at an elevated temperature (60 °C). The solubility at both temperatures for cinnarizine, celecoxib, and JNJ-2A was obtained previously [43]. Solubilities for the remaining drugs were conducted using an equivalent protocol as follows: An excess amount of drug was added to 2 mL of either Capmul MCM or Maisine CC in screw cap glass vials containing a magnetic stirrer. The resulting suspensions were stirred on a stirring plate (Mixdrive 15, 2MAG, München, Germany) at 200 rpm and incubated in temperature-controlled ovens (APT.line^TM BD (E2), Binder, GmbH, Tuttlingen, Germany) at and 60 °C. Aliquots were sampled at 24, 48, and 72 h (or further, if required) and centrifuged at 21,380× g (i.e., relative centrifugal force) (Mikro 200 R, Hettich GmbH, Tuttlingen, Germany) at 22 and 40 °C, respectively, for 15 min. Daily sampling was continued until equilibrium solubility was reached, i.e., solubility between two consecutive samples differed by less than 10%. The supernatant was centrifuged under identical conditions. To solubilise the oily excipient, the supernatant was diluted 1:10 (v/v) in acetonitrile/ethyl acetate (1:3, v/v), followed by further 1:10 (v/v) dilution with acetonitrile/ethyl acetate (3:1, v/v) and a final dilution with mobile phase. The efficiency of extraction recovery was >94%, tested using a known amount of each compound. All samples were run in triplicate, and the drug concentrations were determined using an Agilent 1200 series HPLC system. The columns and HPLC testing conditions for each drug can be found in the Supplementary Materials (Table S2).

Subsequently, to assess the short-term stability upon storage at AT, following the second centrifugation step, an aliquot of supernatant from the 60 °C samples was allowed to cool at for 2 h. Then, sampling and analysis was conducted as outlined above, with values obtained presented as aDS_2h. These short-term stability studies were conducted for the majority of the compounds.

2.5. Apparent Degree of Supersaturation (aDS)

The apparent degree of supersaturation (aDS), as previously defined [42], was determined as the ratio of the concentration of the drug in the supersaturated solution according to this experimental methodology and the concentration in the saturated solution. This theoretical aDS was calculated according to Equation (1) for both sLBFs loaded with drugs at 60 °C:

a D S = C_{s u p e r s a t u r a t i o n} / S_{e q u i l i b r i u m,}

(1)

where C_{supersaturation} is the concentration of the drug determined after heating the sLBF (to 60 °C) and S_equilibrium is the equilibrium solubility at AT.

Subsequently, to facilitate comparisons of the short-term stability of the sLBFs after 2 h, a second aDS (aDS_2h) was calculated according to Equation (2):

a D S_{2 h} = C_{s u p e r s a t u r a t i o n (2 h)} / S_{e q u i l i b r i u m,}

(2)

where, in this case, C_{supersaturation(2h)} is the drug concentration in the lipid system that was heated to 60 °C, followed by cooling to AT for 2 h. The values are reported as aDS (± standard error (SE)), with the SE calculated from Equation (3):

SE = a D S \times \sqrt{\frac{S A^{2}}{A^{2}} + \frac{S B^{2}}{B^{2}},}

(3)

where A, B, SA, and SB refer to the mean measured solubility values and standard errors for the equilibrium solubility at AT (A) and the concentration of the drug in the lipid system at 60 °C with/ without 2 h of cooling (B). The graphs were obtained using Prism (Version 5, Graphpad, San Diego, CA, USA).

2.6. Differential Scanning Calorimetry

The majority of GFA classifications and T_g values were obtained from the literature. However, for fenofibric acid, progesterone, and sulfasalazine, this information was obtained experimentally using differential scanning calorimetry (DSC) equipped with a TA Q1000 with a TA Refrigerated Cooling System 90 (TA Instruments, New Castle, DE, USA). The cell was purged with nitrogen at 50 mL/min. After the midpoint glass transition temperature (T_g,mid) had been determined, crystallisation screening experiments were conducted using the protocol by Baird et al. [47]. In brief, 2 mg of drug weighed into a T-zero pan and heated at 10 °C min⁻¹ to 10 °C above the T_m of each drug (as per Table 1, held isothermally for 3 min, cooled at a rate of 20 °C min⁻¹ to −75 °C, and reheated to 10 °C at 10 °C min^–1 above the T_m of each drug. Sample weights for each repeat sample were within 1 mg and experiments were run in triplicate. GFA was categorised, according to Baird et al., into Class I (in case of crystallisation during cooling prior to the T_g), Class II (for no crystallisation during cooling, but crystallisation was observed upon reheating above T_g), and Class III (for no crystallisation observed during cooling nor reheating to T_m) [47].

2.7. Statistical Analysis

To test the significance between paired solubility values in Capmul MCM versus Maisine CC and sLBF_Capmul^MC versus sLBF_Maisine^LC, the distribution of the differences was used to determine normality, or lack thereof. A two-sided bootstrap-paired test (5000 samples) determined the significance (p < 0.05). Simple scatter plots were produced for Capmul MCM versus Maisine CC and sLBF_Capmul^MC versus sLBF_Maisine^LC, regression coefficients fitted for interpretation, and a bootstrap test for the coefficients conducted. Statistical analysis was conducted using SPSS Statistics (Version 26, IBM Corporation, Armonk, NY, USA).

2.8. Partial Least Squares Regression (PLS)

Quantitative prediction of aDS using PLS regression was conducted using Unscrambler (Version 11, Camo Analytics, Bedford, MA, USA). PLS model development followed the standard steps described previously [30]. Molecular structures were acquired as smiles from PubChem and used as inputs for the ADMET Predictor (Version 9.5, Simulations Plus, Lancaster, CA, USA) to calculate >250 molecular descriptors, which were added to T_m, T_g, ∆H_fus, ∆S_fus, T_m/T_g, and T_rg and used as variable inputs. The individual modelling responses were aDS ratios from both sLBF_Capmul^MC and sLBF_Maisine^LC. Principal component analysis (PCA) was applied for a randomised assignment of training/test data. The training set criteria were that it covered the chemical space of the test set, along with a relatively even spread of aDS ratios. A Hotelling’s T² ellipse was applied for outlier detection (95% confidence interval). The nonlinear iterative partial least squares (NIPALs) algorithm was utilised, and all variables were mean-centred, de-identified, and standardised through scaling by standard deviation. To limit the overfitting potential, a limit of two principal components was used. Variable reduction was performed as previously described [30] using Martens’ uncertainty test [50], an important variables plot, and correlation loadings plot. Model accuracy was validated by the root mean square error (RMSE) of the training and test sets.

2.9. Artificial Neural Networks (ANNs)

Multilayer perception artificial neural networks (MLP-ANNs) were produced using SPSS Statistics (Version 26, IBM Corporation, Armonk, NY, USA) to predict aDS. A partition variable using the same training/test set split was utilised to compare PLS versus ANNs. The input properties were obtained as described above and were rescaled through standardisation, where values were converted to their z-scores. A hyperbolic tangent was chosen as the activation function for the hidden layer, while an identity output function was used in the output layer [51]. Supervised learning using the scaled conjugate gradient (SCG) algorithm was chosen for its speed, and a lack of user-critical parameters [52]. Batch training was selected due to the relatively small dataset size and the learning algorithm employed. Variable reduction was initially conducted using an independent variable importance analysis. As an arbitrary criterion, only variables with a relative importance of >70% were included in the architecture going forward. Topologies with only one hidden layer were considered to avoid overfitting. The optimum number of neurons in the hidden layer was identified following a systematic trial-and-error approach, where the number of neurons in the hidden layer were manually altered between 2 and 20, with runs performed in triplicate. The optimal network size was chosen thorough minimum RMSE in the training and test sets. The most important variables in each network were elucidated from the normalised importance chart. The PLS and ANN models produced were directly compared in terms of different performance evaluation functions, including correlation coefficient (r²), training set RMSE, test set RMSE, and residuals by predicted charts.

3. Results

3.1. Comparing the Solubility of MC- and LC-based LBFs and sLBFs

The initially solubility in both LBFs (Capmul MCM and Maisine CC) at AT and both sLBFs (sLBF_Capmul^MC and sLBF_Maisine^LC) at 60 °C was compared. Significant differences were seen at AT (* p < 0.05) and at 60 °C (* p < 0.05). The beta coefficients of the regression lines of both Maisine CC versus Capmul MCM and sLBF_Maisine^LC versus sLBF_Capmul^MC were also significant (both * p < 0.05). A relatively strong correlation was established between solubility (logS) in both blends at AT (r² = 0.84). This was stronger at 60 °C (r² = 0.90) (Figure 1). Fourteen of the 21 (66%) drugs demonstrated a higher aDS ratio in sLBF_Maisine^LC versus sLBF_Capmul^MC (Figure 2). All 21 drugs showed higher solubility in Capmul MCM when compared to Maisine CC at AT. In general, this trend was repeated at 60 °C, except for fenofibrate and cinnarizine, where the order of solubility was switched, albeit not significantly so.

3.2. Apparent Degree of Supersaturation

Increases in thermally induced solubility were seen for all drugs in both the MC and LC sLBFs (aDS ratio >1), reflecting increased dose loading relative to conventional LBFs. Drug solubility in Capmul MCM, Maisine CC, sLBF_Capmul^MC, and sLBF_Maisine^LC are presented as mean ± SD (n = 3) in the Supplementary Materials (Table S1). The extent of aDS ranged from 1.04 to 3.17 in sLBF_Campul^MC and between 1.06 and 3.4 in sLBF_Maisine^LC (Figure 2). In the rank order of supersaturation propensity, the investigational drug candidates JNJ-2A and felodipine produced the lowest aDS in sLBF_Capmul^MC and sLBF_Maisine^LC, respectively. Dipyridamole demonstrated the highest aDS using both sLBFs.

While correlations between GFA class and aDS ratios have previously been observed using solvent shift-mediated supersaturation [42], our data revealed no clear trend between aDS and GFA (Figure 2). The mean aDS for sLBF_Capmul^MC and sLBF_Maisine^LC in each GFA class was 2.04 and 2.08 (class 1), 2.22 and 2.56 (class 2), and 2.05 and 2.16 (class 3), respectively, indicating that between GFA classes, no significant differences were seen. The mean aDS for the three GFA classes also did not significantly differ according to the sLBFs’ fatty acid chain length.

Upon comparison of the aDS values obtained after cooling of the 60 °C samples for 2 h at AT (aDS_2h), average differences in aDS ratio units of 0.17 (sLBF_Capmul^MC) and 0.16 (sLBF_Maisine^LC) were observed (Supplementary Materials Table S3), corresponding to average drug solubility losses of 7.9% and 7.7% upon cooling, respectively. For this dataset, which comprised drugs of a variety of chemical structures, the range of precipitation upon removal of heating was moderate, i.e., less than 20%, with 71% of drugs displaying a less than 10% loss after 2 h.

3.3. Quantitatively Predicting aDS Using PLS and ANNs

Quantitative models predicting aDS were produced using PLS and ANNs. Unabridged versions of all the drug descriptor abbreviations in this section can be found in the Supplementary Materials (Figure S1). PLS models for both aDS sLBF_Capmul^MC and aDS sLBF_Maisine^LC of two PCs and eight and nine input variables, respectively, were developed (Table 2). The aDS sLBF_Capmul^MC model produced relatively weak predictions of r² = 0.56, and in the training and test sets, the RMSE was 0.4 and 0.79 using eight variables: VMcGowans, N_Hydrgn, EEM_Afc, EEM_AFnp, SHCH_321, SHaaCH, EEM_NFc, and Pi_FMi4 (Figure 3). Martens’ uncertainty test designated SHCH_321 and EEM_NFc as the most important variables. Comparatively, the two PCs aDS sLBF_Maisine^LC PLS model displayed a correlation coefficient of r² = 0.62 and an RMSE in the training and test sets of 0.4 and 0.45, respectively, using nine input variables: HIVI-TC, N_FrRotB, NPA_Q2, EEM_Nfc, EEM_NFnp, Pi_Aqo, Pi_AQc, Pi_FPI3, and Pi_FMi6. In this case, N_FrRotB and Pi_FMi6 were the most important variables.

Using ANNs, MLP 15-5-1 for sLBF_Capmul^MC and MLP 11-8-1 for sLBF_Maisine^LC were produced (Table 2). These equated to input layers with 15 and 11 drug properties, one hidden layer of five and eight nodes, and singular output layers, i.e., predicted aDS. A strong correlation between the predicted and observed aDS values was observed for the sLBF_Capmul^MC network (r² = 0.90) (Figure 3). This demonstrated a low RMSE upon training (0.19) and testing (0.36) (Table 2). The properties included in the network were Pi_FPl5, SolFactor, N_CYPAtoms EEM_F4, Pi_FPl3, NPA_Q6, MlogP, MolVol, NPA_Q1, S+S_Intrins, EqualEta, ∆Hfus, M_CX, Pi_MinQ, and N_Electr. The normalised importance chart signified ∆H_fus, EEM_F4, and N_Electr as the three most significant variables (Figure 4). The predicted and observed aDS values for aDS sLBF_Maisine^LC were strongly correlated (r² = 0.83), as training and testing RMSEs of 0.28 and 0.25 were observed (Figure 3). The drug properties in the final network were N_Bonds, Pi_FPl1, T_Rads, MaxQ, N_Atoms, Pi_FMi1, HBDch, F_AromB, NPA_Q2, SsssCH, and NPA_Q5. MaxQ, NPA_Q5, and NPA_Q2 were the most important variables (Figure 4).

Upon model comparison, the ANNs produced improved aDS predictions for both sLBFs, as both ANN models displayed substantially stronger correlation coefficients, lower training and testing RMSEs, and smaller residuals. The residuals for both ANN models demonstrated almost complete independence and random distribution in residuals by predicted charts (Supplementary Materials Figure S2). The relatively poor performance of the PLS models indicates that their inclusion was primarily for the purpose of comparison with the ANNs.

4. Discussion

The increasing adoption of model-based approaches across drug design and development has aided in improving efficiency in pharmaceutical research. Computational tools exist across the pharmaceutical industry in many forms. However, for LBFs, thus far, the drug property-based aspects of computational pharmaceutics have focused on solubility predictions for traditional solution or self-emulsifying drug delivery system (SEDDS) formulations [27,28,29,30]. The exploration of ANNs to support LBF development remains relatively unexplored. As a result, the main purpose of this research was to investigate if an ANN model could be developed to predict the aDS in sLBFs using drug physicochemical or molecular properties. These predictions could be used to guide whether the degree of supersaturation in lipids is sufficient to enable dosing in early development.

Accordingly, as part of this pilot study, two ANNs were developed, which predicted aDS in sLBFs from their drug properties. These ANNs produced superior predictions compared to PLS models developed using the same available dataset. These ANNs predicting aDS (sLBF_Capmul^MC and sLBF_Maisine^LC), containing one hidden layer of five and eight nodes and using fifteen and eleven drug properties, respectively, yielded strong prediction accuracy performance (r² = 0.90, 0.83) and low RMSEs upon both training (0.19, 0.28) and testing (0.36, 0.25). In comparison, when using PLS, a lower accuracy of prediction (r² = 0.56, 0.62) and higher residuals and RMSEs upon training (0.4, 0.4) and testing (0.79, 0.45) were observed using eight and nine drug properties. Accordingly, this study demonstrates that ANNs can be applied to link molecular drug properties to a predicted maximum dose loading capacity, i.e., aDS upon thermal induced supersaturation.

These modelling results suggest that aDS prediction is a complex and multifaceted phenomenon, as for this dataset numerous drug descriptors and non-linear mathematical algorithms were required for higher accuracy. One explanation for the improved performance of ANNs for this dataset may be attributed to its capability in decoding multidimensional highly non-linear relationships in datasets in the hidden layer, as opposed to linear relationships of the latent variables obtained through PLS. Consequentially, this work highlights the capability of ANNs to provide an industrially applicable alternative to the more established computational pharmaceutics modelling methods such as PLS. While PLS regression has advantages versus ANNs in terms of model transparency and decreased complexity in interpretation, in situations of interrelationships or substantial non-linearity, as seen here, ANNs may improve the accuracy of prediction. Therefore, it is hoped that this pilot study can initiate future larger-scale studies to strengthen these predictions.

Modelling indicated that drug properties hold key information about aDS. Overall, a wide range of drug descriptors, reflecting topology, reactivity, structure and size, electrostatics, and thermodynamics, were significant. Trends in important properties were revealed. The three most important properties predicting aDS for sLBF_Capmul^MC were ∆H_fus (enthalpy of fusion), EEM_F4 (fourth component of the autocorrelation vector of sigma Fukui indices), and N_Electr (total number of electrons in a molecule) (Figure 4). ∆H_fus is a thermodynamic property, involving the amount of thermal energy which must be absorbed or evolved to change 1 mole of a solid to a liquid with no temperature change [53]. ∆H_fus was shown to previously inversely correlate with the potential of a drug to supersaturate from solvent shift-induced supersaturation [42]. Fukui indices are frontier orbital indices, indicating atomic electron affinity and a molecule’s ability to become polarised upon changes to electron density [54,55]. Similar Fukui indices were previously important properties governing the intrinsic dissolution rate of PWSDs in biorelevant media [56] and in support vector machine modelling to predict GFA for compounds between 200 and 300 g/mol. In this case, a high value, which denoted increased electron reactivity, suggested a non-glass former [9]. The number of electrons in a molecule is related to its reactivity, as the electrons in the outermost atom shell determine the reactivity. Generally, polarizability increases as the volume occupied by electrons increases. To predict aDS in sLBF_Maisine^LC, MaxQ (maximal PEOE partial atomic charge), NPA_Q5, and NPA_Q2 (fifth and second components of the autocorrelation vector of estimated NPA partial atomic charges) were the most significant properties (Figure 4). Both natural population analysis (NPA) and partial equalization of orbital electronegativity (PEOE) are methods to calculate partial atomic charges. They describe the charge and electron density distributions within molecules, providing clues about chemical behaviour [57,58]. Comparatively, the PLS performance was poor in terms of correlation and residual error, and therefore, PLS is more suited here as a qualitative model. The fact that PLS and ANNs use different mathematical approaches to obtain correlations, and that ANNs can incorporate interrelationships between descriptor variables, likely explains the differences in the final model variables. Despite the observed differences, the Fukui indices, partial atomic charges, and atom-type E-state indices were significant for PLS and ANN prediction, supporting their importance for aDS.

As a lack of thermodynamic stability is a fundamental limitation of sLBFs, it is imperative that supersaturation is maintained over a sufficient period to facilitate adequate absorption. In this study, after 2 h of cooling, the sLBFs maintained relatively high levels of supersaturation across a variety of drugs, i.e., >80% of the drug remained above saturation solubility. aDS was previously suggested as a guide for the likelihood of precipitation from sLBFs [43], where drugs that generated higher aDS coupled with high T_m/T_g ratios (higher crystallisation tendency) demonstrated quick precipitation on storage at 25 °C, while drugs with low aDS and low T_m/T_g ratios resulted in good storage stability. Similarly, in this study, Dipyridamole (a Class 1 GFA drug with a high T_m/T_g ratio and ∆S_fus) produced the highest aDS in both sLBFs, while Class 3 GFA JNJ-2A and felodipine, both possessing low crystallisation tendencies produced the lowest aDS. Therefore, this could provide an extended application of these models to anticipate the precipitation potential, with reference to the indicators of crystallisation tendency (T_m/T_g, ∆S_fus) [47]. However, investigations regarding the overall accuracy of this combination were not within the scope of this current pilot study.

The influence of fatty acid chain length in terms of both aDS and drug solubility between the MC- and LC-based mono/di-glyceride blends was also observed. Similarly, to previous work involving MC and LC triglycerides [29,30], a relatively strong correlation was found between solubility in both blends at AT. Interestingly, it appeared that the common effect of heating became more influential for solubility rather than the properties of the lipids, as heating increased the strength of the correlation. While the solubility was higher in sLBF_Capmul^MC for the majority of drugs, approximately 60% demonstrated higher aDS in sLBF_Maisine^LC. This was potentially aided by the generally lower drug solvation in the long-chain formulation at AT, thereby permitting higher aDS gains upon heating.

Finally, as recent expert commentary has emphasised various shortcomings of data-driven modelling [11], we acknowledge the dataset used in model development here is limited in size (Supplementary Materials—Modelling Database). As such, this work was essentially a pilot study seeking to investigate the potential of ANNs to improve the accuracy of predictive models. Accordingly, the authors support strategies for further research using a larger dataset to confirm the correlations obtained and have provided the ANN models as predictive model markup language (PMML) in the Supplementary Materials (LC PMML and MC PMML). This will further clarify which molecular properties are significant for aDS, extending the applicability of the models. Notwithstanding this limitation, this pilot study successfully achieved the intended goal of demonstrating the robust predictive power of ANNs to LBF datasets.

5. Conclusions

This pilot study explored the application of ANNs as a computational technique to predict aDS in sLBFs. The ANN models demonstrated accuracy in the quantitative prediction of the aDS ratios versus PLS models from the same dataset. These models, while demonstrating ANNs’ ability to capture complex data relationships, also facilitated greater insight into the relationship between drug properties and supersaturation propensity. It was revealed that this complex phenomenon is related to the molecular descriptors of electron density and chemical reactivity. The study impacts support the application of ML-based computational pharmaceutics in early LBF development testing. Future research with larger datasets will be needed to confirm this pilot study’s findings. Moving forward, integration and dissemination of computational expertise and in silico tools will be vital for efficient decision-making in the development of lipid-based drug delivery systems of the future.

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/article/10.3390/pharmaceutics13091398/s1, Table S1: Equilibrium solubility values and aDS for the dataset of 21 drugs using Capmul MCM and Maisine CC; Table S2: RP-HPLC/UV methods utilised in this study; Table S3: Equilibrium solubility and aDS_2h in sLBF_Capmul^MC and sLBF_Maisine^LC and the aDS ratio difference from the average aDS and the average aDS_2h used to investigate the short-term stability of the sLBF after cooling at AT; Figure S1 Unabridged abbreviations of the independent input variables used in the final PLS and ANN models; Figure S2: Predicted by residual plots for sLBF_Capmul^MC and sLBF_Maisine^LC using PLS and ANN modelling. Modelling Database: Database containing the drug molecular descriptors used in model development. LC PMML and MC PMML: XML-format files containing the two ANN models produced in the study to predict aDS in PMML.

Author Contributions

Conceptualisation, H.B.-L. and B.T.G.; methodology, A.-R.I. and H.B.-L.; software, H.B.-L. and J.D.M.; writing—original draft preparation, H.B.-L.; writing—review and editing, B.T.G., M.K., J.P.O. and R.H.; supervision, B.T.G. and J.P.O. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported under funding from the Irish Research Council Post Graduate Scholarship Project, number GOIPG/2018/883. Alexandra-Roxana Ilie is part of the PEARRL European Training network, which received funding from the Horizon 2020 Marie Skłodowska-Curie Innovative Training Networks programme under grant agreement no. 674909.

Data Availability Statement

The database used for model development in this study is available in the Supplementary Materials (Modelling Database).

Acknowledgments

The authors would like to thank Marc Joosten for his assistance in the solubility analysis.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kuentz, M.; Holm, R.; Elder, D.P. Methodology of oral formulation selection in the pharmaceutical industry. Eur. J. Pharm. Sci. 2016, 87, 136–163. [Google Scholar] [CrossRef] [PubMed]
Kuentz, M.; Holm, R.; Kronseder, C.; Saal, C.; Griffin, B.T. Rational Selection of Bio-Enabling Oral Drug Formulations–A PEARRL Commentary. J. Pharm. Sci. 2021, 110, 1921–1930. [Google Scholar] [CrossRef] [PubMed]
Bergstrom, C.A.S.; Charman, W.N.; Porter, C.J.H. Computational prediction of formulation strategies for beyond-rule-of-5 compounds. Adv. Drug Deliv. Rev. 2016, 101, 6–21. [Google Scholar] [CrossRef] [PubMed]
Zhuang, X.; Lu, C. PBPK modeling and simulation in drug research and development. Acta Pharm. Sin. B 2016, 6, 430–440. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Rane, S.S.; Anderson, B.D. What determines drug solubility in lipid vehicles: Is it predictable? Adv. Drug Deliv. Rev. 2008, 60, 638–656. [Google Scholar] [CrossRef] [PubMed]
Niederquell, A.; Wyttenbach, N.; Kuentz, M. New prediction methods for solubility parameters based on molecular sigma profiles using pharmaceutical materials. Int. J. Pharm. 2018, 546, 137–144. [Google Scholar] [CrossRef]
DeBoyace, K.; Wildfong, P.L. The Application of Modeling and Prediction to the Formation and Stability of Amorphous Solid Dispersions. J. Pharm. Sci. 2018, 107, 57–74. [Google Scholar] [CrossRef] [Green Version]
Birru, W.A.; Warren, D.B.; Han, S.; Benameur, H.; Porter, C.J.; Pouton, C.W.; Chalmers, D.K. Computational Models of the Gastrointestinal Environment. Phase Behavior and Drug Solubilization Capacity of a Type I Lipid-Based Drug Formulation after Digestion. Mol. Pharm. 2017, 14, 580–592. [Google Scholar] [CrossRef]
Alhalaweh, A.; Alzghoul, A.; Kaialy, W.; Mahlin, D.; Bergström, C.A.S. Computational Predictions of Glass-Forming Ability and Crystallization Tendency of Drug Molecules. Mol. Pharm. 2014, 11, 3123–3132. [Google Scholar] [CrossRef]
Hossain, S.; Kabedev, A.; Parrow, A.; Bergström, C.A.; Larsson, P. Molecular simulation as a computational pharmaceutics tool to predict drug solubility, solubilization processes and partitioning. Eur. J. Pharm. Biopharm. 2019, 137, 46–55. [Google Scholar] [CrossRef]
Kuentz, M.; Bergström, C.A. Synergistic Computational Modeling Approaches as Team Players in the Game of Solubility Predictions. J. Pharm. Sci. 2021, 110, 22–34. [Google Scholar] [CrossRef] [PubMed]
Wyttenbach, N.; Niederquell, A.; Kuentz, M. Machine Estimation of Drug Melting Properties and Influence on Solubility Prediction. Mol. Pharm. 2020, 17. [Google Scholar] [CrossRef]
Panagou, E.; Mohareb, F.; Argyri, A.; Bessant, C.; Nychas, G.-J.E. A comparison of artificial neural networks and partial least squares modelling for the rapid detection of the microbial spoilage of beef fillets based on Fourier transform infrared spectral fingerprints. Food Microbiol. 2011, 28, 782–790. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Farizawani, A.; Puteh, M.; Marina, Y.; Rivaie, A. A review of artificial neural network learning rule based on multiple variant of conjugate gradient approaches. J. Phys. Conf. Ser. 2020, 1529. [Google Scholar] [CrossRef]
Bourquin, J.; Schmidli, H.; Van Hoogevest, P.; Leuenberger, H. Basic Concepts of Artificial Neural Networks (ANN) Modeling in the Application to Pharmaceutical Development. Pharm. Dev. Technol. 1997, 2, 95–109. [Google Scholar] [CrossRef] [PubMed]
Tu, J.V. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J. Clin. Epidemiol. 1996, 49, 1225–1231. [Google Scholar] [CrossRef]
Aksu, B.; Paradkar, A.; De Matas, M.; Özer, O.; Güneri, T.; York, P. Quality by Design Approach: Application of Artificial Intelligence Techniques of Tablets Manufactured by Direct Compression. AAPS PharmSciTech 2012, 13, 1138–1146. [Google Scholar] [CrossRef]
Damiati, S.A.; Martini, L.G.; Smith, N.W.; Lawrence, M.J.; Barlow, D.J. Application of machine learning in prediction of hydrotrope-enhanced solubilisation of indomethacin. Int. J. Pharm. 2017, 530, 99–106. [Google Scholar] [CrossRef] [Green Version]
Yang, Y.; Ye, Z.; Su, Y.; Zhao, Q.; Li, X.; Ouyang, D. Deep learning for in vitro prediction of pharmaceutical formulations. Acta Pharm. Sin. B 2019, 9, 177–185. [Google Scholar] [CrossRef]
Barmpalexis, P.; Karagianni, A.; Nikolakakis, I.; Kachrimanis, K. Artificial neural networks (ANNs) and partial least squares (PLS) regression in the quantitative analysis of cocrystal formulations by Raman and ATR-FTIR spectroscopy. J. Pharm. Biomed. Anal. 2018, 158, 214–224. [Google Scholar] [CrossRef]
Gao, H.; Wang, W.; Dong, J.; Ye, Z.; Ouyang, D. An integrated computational methodology with data-driven machine learning, molecular modeling and PBPK modeling to accelerate solid dispersion formulation design. Eur. J. Pharm. Biopharm. 2021, 158, 336–346. [Google Scholar] [CrossRef] [PubMed]
Brinkmann, J.; Exner, L.; Luebbert, C.; Sadowski, G. In-Silico Screening of Lipid-Based Drug Delivery Systems. Pharm. Res. 2020, 37, 1–12. [Google Scholar] [CrossRef] [PubMed]
Galata, D.L.; Farkas, A.; Könyves, Z.; Mészáros, L.A.; Szabó, E.; Csontos, I.; Pálos, A.; Marosi, G.; Nagy, Z.K. Fast, Spectroscopy-Based Prediction of In Vitro Dissolution Profile of Extended Release Tablets Using Artificial Neural Networks. Pharmaceutics 2019, 11, 400. [Google Scholar] [CrossRef] [Green Version]
Djuris, J.; Cirin-Varadjan, S.; Aleksic, I.; Djuris, M.; Cvijic, S.; Ibric, S. Application of Machine-Learning Algorithms for Better Understanding of Tableting Properties of Lactose Co-Processed with Lipid Excipients. Pharmaceutics 2021, 13, 663. [Google Scholar] [CrossRef]
Tosca, E.; Bartolucci, R.; Magni, P. Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules. Pharmaceutics 2021, 13, 1101. [Google Scholar] [CrossRef] [PubMed]
Van Hauwermeiren, D.; Stock, M.; De De Beer, T.; Nopens, I. Predicting Pharmaceutical Particle Size Distributions Using Kernel Mean Embedding. Pharmaceutics 2020, 12, 271. [Google Scholar] [CrossRef] [Green Version]
Alskar, L.C.; Porter, C.J.; Bergstrom, C.A.S. Tools for Early Prediction of Drug Loading in Lipid-Based Formulations. Mol. Pharm. 2016, 13, 251–261. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Alskar, L.C.; Keemink, J.; Johannesson, J.; Porter, C.J.H.; Bergstrom, C.A.S. Impact of Drug Physicochemical Properties on Lipolysis-Triggered Drug Supersaturation and Precipitation from Lipid-Based Formulations. Mol. Pharm. 2018, 15, 4733–4744. [Google Scholar] [CrossRef] [Green Version]
Persson, L.C.; Porter, C.; Charman, W.; Bergström, C.A.S. Computational Prediction of Drug Solubility in Lipid Based Formulation Excipients. Pharm. Res. 2013, 30, 3225–3237. [Google Scholar] [CrossRef]
Bennett-Lenane, H.; Koehl, N.J.; O’Dwyer, P.J.; Box, K.J.; O’Shea, J.P.; Griffin, B.T. Applying Computational Predictions of Biorelevant Solubility Ratio Upon Self-Emulsifying Lipid-Based Formulations Dispersion to Predict Dose Number. J. Pharm. Sci. 2020, 110, 164–175. [Google Scholar] [CrossRef] [PubMed]
Sacchetti, M.; Nejati, E. Prediction of drug solubility in lipid mixtures from the individual ingredients. AAPS PharmSciTech. 2012, 13, 1103–1109. [Google Scholar] [CrossRef] [Green Version]
Alsenz, J.; Kuentz, M. From Quantum Chemistry to Prediction of Drug Solubility in Glycerides. Mol. Pharm. 2019, 16, 4661–4669. [Google Scholar] [CrossRef]
Brinkmann, J.; Exner, L.; Verevkin, S.P.; Luebbert, C.; Sadowski, G. PC-SAFT Modeling of Phase Equilibria Relevant for Lipid-Based Drug Delivery Systems. J. Chem. Eng. Data 2021, 66, 1280–1289. [Google Scholar] [CrossRef]
Han, R.; Xiong, H.; Ye, Z.; Yang, Y.; Huang, T.; Jing, Q.; Lu, J.; Pan, H.; Ren, F.; Ouyang, D. Predicting physical stability of solid dispersions by machine learning techniques. J. Control. Release 2019, 311-312, 16–25. [Google Scholar] [CrossRef]
Mendyk, A.; Pacławski, A.; Szafraniec-Szczęsny, J.; Antosik, A.; Jamróz, W.; Paluch, M.; Jachowicz, R. Data-Driven Modeling of the Bicalutamide Dissolution from Powder Systems. AAPS PharmSciTech 2020, 21, 111–119. [Google Scholar] [CrossRef] [Green Version]
O’Driscoll, C.; Griffin, B. Biopharmaceutical challenges associated with drugs with low aqueous solubility—The potential impact of lipid-based formulations. Adv. Drug Deliv. Rev. 2008, 60, 617–624. [Google Scholar] [CrossRef]
Bennett-Lenane, H.; O’Shea, J.P.; O’Driscoll, C.M.; Griffin, B.T. A Retrospective Biopharmaceutical Analysis of >800 Approved Oral Drug Products: Are Drug Properties of Solid Dispersions and Lipid-Based Formulations Distinctive? J. Pharm. Sci. 2020, 109, 3248–3261. [Google Scholar] [CrossRef]
Koehl, N.J.; Henze, L.J.; Kuentz, M.; Holm, R.; Griffin, B.T. Supersaturated Lipid-Based Formulations to Enhance the Oral Bioavailability of Venetoclax. Pharmaceutics 2020, 12, 564. [Google Scholar] [CrossRef]
Thomas, N.; Holm, R.; Müllertz, A.; Rades, T. In vitro and in vivo performance of novel supersaturated self-nanoemulsifying drug delivery systems (super-SNEDDS). J. Control. Release 2012, 160, 25–32. [Google Scholar] [CrossRef] [PubMed]
Michaelsen, M.H.; Wasan, K.M.; Sivak, O.; Müllertz, A.; Rades, T. The Effect of Digestion and Drug Load on Halofantrine Absorption from Self-nanoemulsifying Drug Delivery System (SNEDDS). AAPS J. 2016, 18, 180–186. [Google Scholar] [CrossRef] [Green Version]
Thomas, N.; Holm, R.; Garmer, M.; Karlsson, J.J.; Müllertz, A.; Rades, T. Supersaturated Self-Nanoemulsifying Drug Delivery Systems (Super-SNEDDS) Enhance the Bioavailability of the Poorly Water-Soluble Drug Simvastatin in Dogs. AAPS J. 2013, 15, 219–227. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Blaabjerg, L.I.; Lindenberg, E.; Löbmann, K.; Grohganz, H.; Rades, T. Is there a correlation between the glass forming ability of a drug and its supersaturation propensity? Int. J. Pharm. 2018, 538, 243–249. [Google Scholar] [CrossRef]
Ilie, A.-R.; Griffin, B.T.; Kolakovic, R.; Vertzoni, M.; Kuentz, M.; Holm, R. Supersaturated lipid-based drug delivery systems–exploring impact of lipid composition type and drug properties on supersaturability and physical stability. Drug Dev. Ind. Pharm. 2020, 46, 356–364. [Google Scholar] [CrossRef] [PubMed]
Palmelund, H.; Madsen, C.M.; Plum, J.; Müllertz, A.; Rades, T. Studying the Propensity of Compounds to Supersaturate: A Practical and Broadly Applicable Approach. J. Pharm. Sci. 2016, 105, 3021–3029. [Google Scholar] [CrossRef]
Ilie, A.-R.; Griffin, B.T.; Vertzoni, M.; Kuentz, M.; Cuyckens, F.; Wuyts, K.; Kolakovic, R.; Holm, R. Toward simplified oral lipid-based drug delivery using mono-/di-glycerides as single component excipients. Drug Dev. Ind. Pharm. 2020, 46, 2051–2060. [Google Scholar] [CrossRef] [PubMed]
Holm, R. Bridging the gaps between academic research and industrial product developments of lipid-based formulations. Adv. Drug Deliv. Rev. 2019, 142, 118–127. [Google Scholar] [CrossRef]
Baird, J.A.; Van Eerdenbrugh, B.; Taylor, L. A Classification System to Assess the Crystallization Tendency of Organic Molecules from Undercooled Melts. J. Pharm. Sci. 2010, 99, 3787–3806. [Google Scholar] [CrossRef]
Baghel, S.; Cathcart, H.; Redington, W.; O’Reilly, N. An investigation into the crystallization tendency/kinetics of amorphous active pharmaceutical ingredients: A case study with dipyridamole and cinnarizine. Eur. J. Pharm. Biopharm. 2016, 104, 59–71. [Google Scholar] [CrossRef]
Alhalaweh, A.; Alzghoul, A.; Bergström, C.A. Molecular Drivers of Crystallization Kinetics for Drugs in Supersaturated Aqueous Solutions. J. Pharm. Sci. 2019, 108, 252–259. [Google Scholar] [CrossRef] [Green Version]
Forina, M.; Lanteri, S.; Oliveros, M.C.C.; Millán, C.P. Selection of useful predictors in multivariate calibration. Anal. Bioanal. Chem. 2004, 380, 397–418. [Google Scholar] [CrossRef]
Alshalif, S.A.; Ibrahim, N.; Herawan, T. (Eds.) Artificial Neural Network with Hyperbolic Tangent Activation Function to Improve the Accuracy of COCOMO II Model. Recent Advances on Soft Computing and Data Mining; Springer International Publishing: Berlin/Heidelberg, Germany, 2016. [Google Scholar] [CrossRef]
Møller, M.F. A scaled conjugate gradient algorithm for fast supervised learning. Neural Netw. 1993, 6, 525–533. [Google Scholar] [CrossRef]
Kirkham, M.B. Chapter 3-Structure and Properties of Water. In Principles of Soil and Plant Water Relations, 2nd ed.; Kirkham, M.B., Ed.; Academic Press: Cambridge, MA, USA, 2014; pp. 27–40. [Google Scholar]
Fradera, X.; Solà, M. Second-order atomic Fukui indices from the electron-pair density in the framework of the atoms in molecules theory. J. Comput. Chem. 2003, 25, 439–446. [Google Scholar] [CrossRef] [PubMed]
Fukui, K.; Yonezawa, T.; Shingu, H. A Molecular Orbital Theory of Reactivity in Aromatic Hydrocarbons. J. Chem. Phys. 1952, 20, 722–725. [Google Scholar] [CrossRef]
Teleki, A.; Nylander, O.; Bergström, C.A. Intrinsic Dissolution Rate Profiling of Poorly Water-Soluble Compounds in Biorelevant Dissolution Media. Pharmaceutics 2020, 12, 493. [Google Scholar] [CrossRef] [PubMed]
Geidl, S.; Bouchal, T.; Raček, T.; Vařeková, R.S.; Hejret, V.; Křenek, A.; Abagyan, R.; Koča, J. High-quality and universal empirical atomic charges for chemoinformatics applications. J. Chemin 2015, 7, 1–10. [Google Scholar] [CrossRef] [Green Version]
Gasteiger, J.; Marsili, M. Iterative partial equalization of orbital electronegativity—A rapid access to atomic charges. Tetrahedron 1980, 36, 3219–3228. [Google Scholar] [CrossRef]

Figure 1. Scatter plots of the solubility in Capmul MCM versus Maisine CC (a) and sLBF_Capmul^MC versus sLBF_Maisine^LC (b). Formulation abbreviations can be inferred from the main text.

Figure 2. Apparent degree of supersaturation (aDS) ratios achieved for the dataset in both sLBF_Capmul^MC and sLBF_Maisine^LC. No clear aDS trend was elucidated in terms of the glass-forming ability (GFA) classification (as grouped). Details and definitions of the abbreviations are given in the text.

Figure 3. Scatter plots illustrating the predicted versus observed aDS values obtained for aDS sLBF_Capmul^MC using PLS (r² = 0.56) and ANNs (r² = 0.90) (a,c). Scatter plots illustrating the predicted versus observed aDS values obtained for aDS sLBF_Maisine^LC using PLS (r² = 0.62) and ANNs (r² = 0.83) (b,d).

Figure 4. Normalised importance charts of the ANNs for sLBF_Capmul^MC (a) and sLBF_Maisine^LC (b) detailing the percentage importance of the input variables in predicting aDS. Details and explained abbreviations are given in the main text and Supplementary Materials (Figure S1).

Table 1. Selection of the physicochemical and molecular properties of the investigated compounds collated from the literature, predicted from ADMET Predictor 9.5 or obtained experimentally using DSC. AMPH refers to ampholyte.

Drug Compound	MW (g/mol)	clogP	logD_6.5	Acid/ Base/ Neutral	GFA Class	T_m (°C)	T_g (°C)	∆H_fus (kJ/mol)	∆S_fus × 0.01 (kJ/mol/K)	T_m/T_g	T_rg	HBA	HBD	Rotatable Bonds
Carvedilol	406.49	3.88	2.36	B	III	114.5	41.9	53.00	13.67	1.23	0.81	5	3	10
Celecoxib	381.38	3.81	3.81	A	II	163	58	34.10	7.80	1.32	0.76	4	1	2
Cinnarizine	368.53	4.92	3.98	B	II	121	8.5	37.50	9.50	1.39	0.72	2	0	5
Clotrimazole	344.85	5.08	5.06	B	III	148	30	33.34	7.97	1.39	0.72	1	0	4
Danazol	337.47	4.26	4.26	N	II	225.5	88.3	35.50	7.12	1.38	0.73	3	1	1
Dipyridamole	504.64	3.11	3.02	B	I	163	40.4	72.00	16.51	1.39	0.72	12	4	12
Felodipine	384.26	5.03	5.03	B	III	145	45	30.98	7.38	1.31	0.76	5	1	4
Fenofibrate	360.84	5.20	5.20	N	III	79	−19	33.00	9.32	1.39	0.72	4	0	5
Fenofibric acid	318.76	3.98	1.25	A	I	184	35.4	99.00	21.66	1.48	0.68	4	1	3
Griseofulvin	352.77	2.51	2.51	N	I	245	89	39.12	7.96	1.36	0.73	6	0	3
Haloperidol	375.87	3.82	2.06	B	I	148	33	54.26	12.80	1.38	0.73	3	1	5
Ibuprofen	206.29	3.64	1.69	A	III	77	−45	26.50	7.56	1.54	0.65	2	1	4
Indometdacin	357.80	4.03	1.45	A	III	161	45	37.60	8.64	1.37	0.73	4	1	3
Itraconazole	705.65	4.89	4.89	B	III	168	58	57.60	13.00	1.33	0.75	9	0	10
JNJ-2A	498.90	5.40	5.40	N	III	142	91.2	22.90	5.50	1.14	0.88	4	3	7
Ketoconazole	531.44	3.67	3.51	B	III	146	45	52.85	12.50	1.32	0.76	7	0	8
Naproxen	230.27	3.21	1.10	A	I	152	5.9	25.65	6.03	1.52	0.66	3	1	3
Niclosamide	327.13	4.03	4.02	A	I	230	86	40.70	8.01	1.40	0.71	5	2	2
Progesterone	314.47	3.94	3.94	N	I	130	55.2	23.67	5.87	1.23	0.81	2	0	1
Sulfalazine	398.40	3.15	−0.35	A	I	245	54.6	99.00	20.08	1.58	0.63	9	3	3
Venetoclax	868.46	6.68	6.54	AMPH	III	138	64	18.40	4.50	1.22	0.82	12	3	11

Table 2. Overview of the ANNs produced to predict aDS for sLBF_Capmul^MC and sLBF_Maisine^LC from their drug properties, including their architecture and various performance indicators. Tr and Te refer to the training and test sets.

Y Variable	Model Type	Architecture	Input Variables	r²	RMSE Tr	RMSE Te
aDS sLBF_Capmul^MC	PLS	2 PCs	VMcGowan, N_Hydrogn, SHCH_321, SHaaCH, EEM_Afc, EEM_Afnp, EEM_NFc, and Pi_FMi4	0.56	0.40	0.79
aDS sLBF_Capmul^MC	ANN	1 hidden layer, 5 nodes	Pi_FPl5, NPA_Q6, ∆H_fus, EEM_F4, EqualEta, M_CX, MlogP, MolVol, N_CYPAtoms, N_Electr, NPA_Q1, Pi_FPl3, Pi_MinQ, S+S_Intrins, and SolFactor	0.90	0.19	0.36
aDS sLBF_Maisine^LC	PLS	2 PCs	HIVI-TC, N_FrRotB, NPA_Q2, EEM_Nfc, EEM_NFnp, Pi_AQo, Pi_AQc, Pi_FPI3, and Pi_FMi6	0.62	0.40	0.45
aDS sLBF_Maisine^LC	ANN	1 hidden layer, 8 nodes	F_AromB, HBDch, MaxQ, N_Atoms, N_Bonds, NPA_Q2, NPA_Q5, Pi_FMi1, Pi_FPl1, SsssCH, and T_Rads	0.83	0.28	0.25

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bennett-Lenane, H.; O’Shea, J.P.; Murray, J.D.; Ilie, A.-R.; Holm, R.; Kuentz, M.; Griffin, B.T. Artificial Neural Networks to Predict the Apparent Degree of Supersaturation in Supersaturated Lipid-Based Formulations: A Pilot Study. Pharmaceutics 2021, 13, 1398. https://0-doi-org.brum.beds.ac.uk/10.3390/pharmaceutics13091398

AMA Style

Bennett-Lenane H, O’Shea JP, Murray JD, Ilie A-R, Holm R, Kuentz M, Griffin BT. Artificial Neural Networks to Predict the Apparent Degree of Supersaturation in Supersaturated Lipid-Based Formulations: A Pilot Study. Pharmaceutics. 2021; 13(9):1398. https://0-doi-org.brum.beds.ac.uk/10.3390/pharmaceutics13091398

Chicago/Turabian Style

Bennett-Lenane, Harriet, Joseph P. O’Shea, Jack D. Murray, Alexandra-Roxana Ilie, René Holm, Martin Kuentz, and Brendan T. Griffin. 2021. "Artificial Neural Networks to Predict the Apparent Degree of Supersaturation in Supersaturated Lipid-Based Formulations: A Pilot Study" Pharmaceutics 13, no. 9: 1398. https://0-doi-org.brum.beds.ac.uk/10.3390/pharmaceutics13091398

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Artificial Neural Networks to Predict the Apparent Degree of Supersaturation in Supersaturated Lipid-Based Formulations: A Pilot Study

Abstract

1. Introduction

2. Materials and Methods

2.1. Chemicals and Materials

2.2. Formulations

2.3. Dataset Selection/Drug Physiochemical and Molecular Properties

2.4. Equilibrium Solubility Determination

2.5. Apparent Degree of Supersaturation (aDS)

2.6. Differential Scanning Calorimetry

2.7. Statistical Analysis

2.8. Partial Least Squares Regression (PLS)

2.9. Artificial Neural Networks (ANNs)

3. Results

3.1. Comparing the Solubility of MC- and LC-based LBFs and sLBFs

3.2. Apparent Degree of Supersaturation

3.3. Quantitatively Predicting aDS Using PLS and ANNs

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI