Next Article in Journal
A New Adaptive Fuzzy PID Controller Based on Riccati-Like Equation with Application to Vibration Control of Vehicle Seat Suspension
Previous Article in Journal
Correlation between the Morphology of ZnO Layers and the Electroluminescence of Quantum Dot Light-Emitting Diodes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparing AutoDock and Vina in Ligand/Decoy Discrimination for Virtual Screening

by
Tatiana F. Vieira
and
Sérgio F. Sousa
*
UCIBIO@REQUIMTE, BioSIM—Departamento de Biomedicina, Faculdade de Medicina da Universidade do Porto, 4200-319 Porto, Portugal
*
Author to whom correspondence should be addressed.
Submission received: 3 October 2019 / Revised: 18 October 2019 / Accepted: 21 October 2019 / Published: 25 October 2019
(This article belongs to the Section Applied Biosciences and Bioengineering)

Abstract

:
AutoDock and Vina are two of the most widely used protein–ligand docking programs. The fact that these programs are free and available under an open source license, also makes them a very popular first choice for many users and a common starting point for many virtual screening campaigns, particularly in academia. Here, we evaluated the performance of AutoDock and Vina against an unbiased dataset containing 102 protein targets, 22,432 active compounds and 1,380,513 decoy molecules. In general, the results showed that the overall performance of Vina and AutoDock was comparable in discriminating between actives and decoys. However, the results varied significantly with the type of target. AutoDock was better in discriminating ligands and decoys in more hydrophobic, poorly polar and poorly charged pockets, while Vina tended to give better results for polar and charged binding pockets. For the type of ligand, the tendency was the same for both Vina and AutoDock. Bigger and more flexible ligands still presented a bigger challenge for these docking programs. A set of guidelines was formulated, based on the strengths and weaknesses of both docking program and their limits of validation.

1. Introduction

The use of computational methods is a crucial part of the drug discovery, development, and optimization process. Protein–ligand docking and virtual screening are two of the most used techniques in this field that continue to show promise in hit identification and subsequent optimization [1]. They are also helpful tools for drug repositioning [2,3,4]. These methods are effective and fast, and allow researchers to evaluate large virtual databases of molecular compounds as a first attempt to guide the selection of more limited sets of compounds for experimental testing. They do, however, possess few limitations [5,6].
Protein–ligand docking is a computational technique that predicts the conformation and orientation (pose) of a ligand when it is bound to a given protein [1,7,8,9,10,11,12]. With this method, the ligand-target interactions are modeled to achieve an optimal complementarity of steric and physicochemical properties [13]. This methodology has made possible the visualization of the potential interactions between a ligand and its target [14].
Docking, however, still faces difficulties, particularly regarding the correct modeling of ligand and protein flexibility [15,16,17,18] and of water-mediated interactions [18,19]. It is widely used for small molecules, but its use for small peptides and other larger biomolecules has only been under development in the last decade [20,21,22].
Typically, the docking software is an interplay between the search algorithm, which explores and generates different poses of the ligand, and the scoring function, which estimates the binding affinities of the poses previously created, discriminating between the best and not so good alternatives [1,5]. This estimate, which in some cases is a prediction of the free energy of binding, must be able to discriminate between molecules that bind to the target and those that do not [23]. When looking at the two enantiomers, for example, it is still not possible to identify the most active form with most of the scoring functions used by the most common docking software [24].
Even with all the significant improvements in computational power and docking software, considering all interactions that happen when a ligand binds to its target is an extremely challenging task. In order to be rigorous, the scoring functions would have to be much more complex, involve quantum calculations and, thus, these assays would turn out to be considerably expensive and time-consuming. When applying a virtual screening protocol, one wishes to screen very large databases of compounds in a relatively small period of time and, therefore, scoring functions are often simplified to improve the speed and cost of the computational screenings [6]. These simplifications come with a cost in accuracy, which might not be problematic for one ligand-target situation but takes a much more challenging scope when talking about virtual screening of thousands or millions of compounds [5].
The goal of virtual screening (VS) is to guide the selection of molecules for experimental testing. In these assays, millions of compounds are docked into one specific target and only a selection of the top scores proceeds for experimental testing. If a scoring functions fails to identify a potential strong binder, then, it remains hidden among those million compounds, despite their pharmacological potential. In fact, that is one of the main problems in VS, the false negatives, or molecules that the docking fails to identify as strong ligands. False positives are also a problem, that is, molecules that are incorrectly identified as strong binders. These molecules, however, are easily discarded in the preliminary experimental assays [25].
It is, however, difficult to compare the performance of different docking alternatives because each software handles the target and ligand in different manners. Additionally, it has been shown that the docking and VS results vary according to the type of target and ligand molecule [26,27,28,29,30]. In this study, two of the most commonly used docking tools—AutoDock (version 4.2.6) and Vina (AutoDock Vina)—were evaluated for different types of targets and ligands, using an unbiased reference validation set—Directory of Useful Decoys–Enhanced (DUD–E) [31]. Both docking programs are widely used to this day, for a large diversity of targets and problems [32,33,34,35,36,37,38,39].
AutoDock 4 is a well-known docking program developed by Morris and co-workers [40,41,42,43] at the Scripps Research Institute. Its free availability to academic users, together with the good accuracy and high versatility shown, had made it a very popular first choice for new users and have contributed to a widespread use of AutoDock, well portrayed in its impressively high number of citations. AutoDock 4 offers a variety of search algorithms and a scoring function that is based on a linear regression analysis, the Assisted Model Building with Energy Refinement (AMBER) force field, and a large set of diverse protein–ligand complexes with known inhibition constants. The program could be used with a visual interface called AutoDock Tools (ADT) which ensures an efficient analysis of the docking results.
AutoDock Vina [44,45,46] is a docking program developed by Trott and Olson also at the Scripps Research Institute, La Jolla, California, following the success of previous AutoDock versions. Vina is freely accessible to a large number of users, as it is open source. AutoDock Vina inherits some of the ideas and approaches of AutoDock 4, but it is designed in a conceptually different way. It offers significant improvements in the average accuracy of the binding mode predictions, while also being up to two orders of magnitude faster than AutoDock 4. It also features new search algorithm and a hybrid scoring function, combining empirical and knowledge-based scoring function. Its multi-core capability, high performance and enhanced accuracy, ease-of-use and free availability have contributed to an extremely fast dissemination through the docking community, well-portrayed in the high number of citations of the original paper. Its high computational efficiency and ability to use multiple CPUs or CPU cores also makes this program a competitive alternative for virtual screening.
The Directory of Useful Decoys–Enhanced (DUD–E) [31] holds a collection of decoys and ligands for benchmarking virtual screening, containing 22,432 active compounds and their affinities against 102 targets set by Huang et al. For each of the active compounds (i.e., the ligands), this database contains a set of 50 “decoys”, i.e., molecules with similar 1-D physico-chemical properties to remove bias (e.g., molecular weight, calculated LogP), but dissimilar 2-D topology to be likely non-binders, i.e., inactives. These characteristics make DUD–E a challenging dataset to test scoring functions and protein–ligand docking algorithms. Ideally, the perfect scoring function would rank the active molecules higher than the decoys, but that is not often the case.

2. Materials and Methods

The performance of AutoDock 4 and Vina was measured using the Directory of Useful Decoys–Enhanced (DUD–E). DUD–E contains a large collection of decoys and ligands that can be used for benchmarking ligand/decoys discrimination in virtual screening tests. The DUD–E dataset has been widely used to validate data from other open source such as Dock [47,48] and commercial programs such as Gold, Glide, Surflex, and FlexX [48]. It is also frequently used to validate the development of new consensus scoring functions [38,49,50,51,52,53,54].
An overview of the 102 protein-targets in DUD–E can be found in Figure 1, and Table 1 specifies the types of protein targets and the number of ligands and decoys in the dataset. DUD–E contains a wide variety of protein target types, including 26 kinases, 15 proteases, 11 nuclear receptors, 5 G protein-coupled receptor (GPCR), 2 ion channels, 2 cytochrome P450s, 36 other enzymes, and 5 miscellaneous proteins. About 18 of these proteins contain metal atoms, while the other 84 do not. Proteases, kinases, and metalloenzymes are the largest groups present in the DUD–E dataset and are the ones that were emphasized on in the discussion. We also included GPCRs, since this large class of proteins was structurally very similar and was the focus of many other studies. The results presented in this study here might guide the selection of the most adequate docking software for these specific families.
Using the DUD–E dataset, the performance of a scoring function in virtual screening could be expressed through a graphical representation of the true positive rate versus the false positive rate in terms of receiver operating characteristic (ROC) plots. In ROC plots, the true positive rate (TPR = TP/P) was plotted against the false positive rate (FPR = FP/N), where TP is the number of true positives, P is the total number of positives (actives), FP is the number of false positives, and N is the total number of negatives (decoys). A useful measure is the area under the curve (AUC). The higher the AUC value in a ROC curve, the better the discrimination between the true positive and the false positive poses.
As previously mentioned, a successful scoring function for virtual screening should rank active compounds very early on a large score list, so metrics that emphasize early recognition of ligands are normally used. One of such measures is the enrichment factor at 1% (abbreviated EF1%). This value measures the number of active ligands recovered at 1% of the ligand/decoy database, over the number of active ligands that should be expected at the same fraction of the database with random selection. Other values such as the EF20% were also used sometimes.
After an initial analysis of all the DUD–E targets, there was one (FGFR pdb:3C4F) that did not have the 1/50 proportion for active/decoys, so it was decided to exclude it from this test.
For each target in the DUD–E dataset, an initial analysis of the PDB file associated was performed. The binding pocket was studied and evaluated. Similar PDB structures with co-crystallized ligands were also inspected. Re-docking of the ligands for which there was a ligand-target structure available was performed with AutoDock and with Vina. The docking protocol for both programs was adjusted so as to reproduce the known experimental binding poses for each target, a standard protocol, when validating a docking program/protocol for a specific target [24], as presented in Figure 2. Parameters adjusted in this process with Vina included in the box size and position, number of generated binding modes and exhaustiveness. In AutoDock, the parameters optimized also included the box size and position, number of grid points and spacing, number of genetic algorithm (GA) runs, population size, maximum number of energy evaluations, and maximum number of generations. After the first optimization stage performed for each target, the box dimensions and center coordinates used for both AutoDock and Vina were the same. The exhaustiveness value used for Vina was 8. As for AutoDock, the grid spacing was set to 0.375 Å and the number of GA runs was set to 10. All this information is provided in Table S1 in the Supplementary Materials.
With the current protocol, the computational time for the virtual screening of the complete DUD–E dataset for Vina was of approximately 60 days in 24 CPUs. Calculations in AutoDock took on average 100 times more.
At the end of this stage an optimized docking protocol was selected for each target with each docking program. These protocols were used for the corresponding 101 protein targets to dock the associated ligands and decoys. For each target, ranked lists of ligands and decoys were prepared with AutoDock and Vina, based on the corresponding scores. These lists were used to determine the values of AUCs, EF1% and EF20%, allowing a comparison of the performance of the two docking programs in discriminating between ligands and decoys for each target. Average AUC, EF1% and EF20% were determined for the different families of protein targets and for the full 101 targets.
All protein targets were characterized in terms of the number of the total amino acid residues and molecular weight. The corresponding binding pockets were evaluated in terms of their percentage of hydrophobic, polar, and charged amino acid residues. Average AUC, EF1% and EF20% were determined for different classes of protein targets based on the protein’s size and type of residues at the binding pocket.
The Molecular Operating Environment (MOE) [55] program was used to calculate the chemical and structural properties for all ligands tested. Some of these properties were analyzed in more detail. Examples include the ligand’s molecular weight, volume, area, fraction of rotatable bonds, fraction of hydrophobic accessible surface area (FASA_H), fraction of polar accessible surface area (FASA_P), and fraction of positive and negative accessible surface areas (FASA+ and FASA−). Average AUC, EF(1%) and EF(20%) were also determined for the different classes of ligands based on the ligand’s size, fraction of rotatable bonds and electrostatic nature.

3. Results

3.1. Evaluation of the Performance of AutoDock and Vina

The chemical and structural properties of different proteins and enzymes can vary quite significantly, in features that include the nature, type, and range of interactions around the binding pocket, the pocket size and shape, and the exposure to solvent. Therefore, the challenges that such systems offer to docking and to virtual screening can also be quite different. Some programs and scoring functions are better able to capture some of these characteristics, while other show improved performance in targets with other features.
Table 2 compares the performance of AutoDock and Vina across the different classes of targets. The average results obtained for the set of 101 target showed that AutoDock and Vina exhibit a similar average performance in discriminating between ligands and decoys. In fact, the average EF1% values obtained were 7.6 and 8.9 for Vina and AutoDock, respectively (AUCs of 68.0 and 66.4). The EF1% values calculated for this extended data set show that these programs are able to rank in the top 1% of the total ligands (active and decoys) docked against each target, 7.6- and 8.9-times more active ligands than what would be expected from random selection, considering the relative percentage of actives and decoys available for each target.
However, the discrimination ability across different target classes could vary significantly. For GPCRs, for example, AutoDock exhibited superior discrimination ability, with an average EF1% of 16.6 against only 2.8 with VINA. AutoDock also demonstrated improved performance over Vina for Nuclear Receptors (EF1% of 18.4 versus 15.0). However, for kinases and metalloproteins the discrimination ability of Vina is on average better than that of AutoDock.
Figure 3, shows the average AUC values, calculated for the different target families. As previously mentioned, the higher the AUC, the better the discrimination ability between actives and decoys. AutoDock provided better results for GPCRs, ion channels, and nuclear receptors. Vina worked better for all the other families.
However, across large families of proteins there could be significant variations in the docking results, when looking into individual proteins. In the case of metalloenzymes, for example, Vina provided better results, on average. Analyzing each target in particular (Figure 4) it could be seen that for some targets the AutoDock performed significantly better. This might be explained by the fact that in this family there is a large variability of types of proteins as this group includes kinases, proteases, and others.
Table 3 analyzes the performance of AutoDock and VINA taking into consideration the number of amino acid residues that constitute the target. For smaller targets, the driving force for ligand-binding tends to be more concentrated in a smaller number of key specific residues. Additionally, the binding pockets tended to be smaller, or often more exposed to the solvent. On the other hand, in larger protein-targets, the range of interactions involved in ligand-binding tended to be larger and more diffused. In addition, the extra number of amino acid residues present in the larger targets could confer a more controlled environment to the corresponding binding pockets, shielding the interactions formed from the effect of the solvent. The non-specific protein environment could play a more important role for ligand-binding in these targets. Therefore, the number of amino acid residues that constituted the different targets could offer different trials for docking and virtual screening.
The results from Table 3 show that Vina was, on average, better in discriminating ligands from decoys in medium-sized targets, with 250 to 400 amino acid residues (average EF1% of 11.5, AUC 71.8). For targets with more than 400 amino acid residues, the performance of Vina was significantly lower (average EF1% of only 6.1, AUC of 65.4)
AutoDock exhibited a more uniform behavior, with average EF1% values in the range 7.9–9.4 for small (less than 250 aa) and large targets (more than 400 aa), resulting in an improved performance over Vina for the small targets (<250 aa) and the large targets (>400 aa).
Another important aspect regarding the nature of the target protein concerns the type of amino acid residues that constitute each binding pocket. For this analysis, all amino acid residues defining each binding pocket were grouped into polar, charged (negative and positive), and hydrophobic amino acid residues. Binding pockets were characterized based on the relative percentage of each of these types of residues. Average EF1% and AUC values were calculated with AutoDock and Vina for each category. The results are presented in Table 4.
The results presented in Table 4 showed that for poorly polar binding pockets (less than 25% of polar residues) AutoDock was on average better than Vina in discriminating between ligands and decoys, particularly among the top 1% of ranked solutions. For moderately polar and very polar binding pockets, Vina exhibited a better performance than AutoDock. The results also showed that both programs had more difficulty in discriminating ligands and decoys for very polar binding pockets (>35% of polar amino acid residues).
In terms of the percentage of hydrophobic residues, the results showed that Vina was significantly better than AutoDock in ligand/decoy discrimination for poorly hydrophobic binding pockets. As the percentage of hydrophobic residues at the binding pocket increased, the performance of Vina and AutoDock became increasingly similar, both in terms of EC1% and in terms of AUC values.
In terms of charge, the results showed that AutoDock was better in discriminating ligands and decoys in poorly charged binding pockets (<15%) than in moderate or highly charged ones. Vina, on the other hand, gave best results in highly charged binding pockets. These general tendencies concerning the presence of a charge at the binding pocket were also observed when particularly looking into positively charged residues or into negatively charged residues.
In general, these results showed that AutoDock was better in discriminating ligands and decoys in more hydrophobic, poorly polar, and poorly charged pockets, while Vina exhibited early recognition metrics that did not vary so significantly with the type of amino acid residues at the binding pocket. Vina tended to give better results for polar and charged binding pockets, which was particularly interesting, taking into consideration that the scoring function of Vina did not explicitly include charges, while that of AutoDock had an explicit electrostatic term.

3.2. Substrates

The type of molecule to be evaluated and its physico-chemical characteristics also offer different challenges for virtual screening, in terms of docking and its ability to discriminate between actives and decoys. For each specific target, the decoys included in the DUD–E were generated by having similar 1-D physico-chemical properties to the actives from which they originated, to remove bias [32]. Hence, to analyze how the different substrate properties affected the discriminating ability of each target, the physical properties of all actives identified in the ligands ranked as the top 1% were evaluated and compared with the other actives that were ranked the worst.
In this study, four fundamental properties of the ligands were analyzed—the size of the ligands, polarity, charge, and the number of rotatable bonds.
Figure 5 and Figure 6 present heat maps of the correlation between the substrate properties and their position in the ranking according to the type of target family (proteases and metalloenzymes, respectively). Darker red (+1) yield perfect positive correlation while darker blue (−1), yield perfect negative correlation. From Figure 5, it is clear that polarity and number of rotational bonds is important for both Vina and is even more distinct for AutoDock, since it presents a positive correlation, that is, as the ranking number increases, the polarity and number of rotational bonds also increase. This means that the molecules with more rotatable bonds and which are more polar, are ranked worst in the list. This leads to the conclusion that more polar and more flexible molecules present a bigger challenge for AutoDock, in particular. For metalloenzymes, the correlation profile is a little bit different from proteases. It is not easy to find a clear tendency because while some targets present a positive correlation for some property, others have a negative correlation for the same property. This could again be explained by the large variability of protein types in this particular family.

3.1.1. Influence of Molecular Weight

Figure 7 summarizes the variability of all molecules present in the DUD–E dataset, taking into account the molecular weight. The results showed that from the total of 22,321 active ligands considered for all 101 DUD–E targets, 7990 have a molecular weight below 400 Da, while 8833 have a molecular weight in the range of 400–500 Da, with 5498 with a molecular weight over 500 Da. The distribution of decoys across these ranges was the same, as they were generated automatically from the known ligands included.
Table 5 decomposes the number of ligands identified in the top 1% of compounds ranked, according to the molecular weight. AutoDock identified a total of 1935 actives in the top 1% of ligands, while in Vina, this number was of 2002. The results showed that Vina was, on average, better than AutoDock in identifying actives in the top 1% of small ligands (<400 MW) (536 versus 395 actives) and for large-sized ligands (>500 MW) (581 versus 497 actives). However, AutoDock was able to rank more medium-sized actives (400–500 MW) among the top 1% of the results (1043 versus 885).
Regarding each family of proteins, all exhibited the same tendency—smaller ligands were more difficult to discriminate and appeared at worst ranking positions for both Vina and AutoDock.
Figure 8 shows the influence of molecular weight on the average ranking distribution of the molecules within the full-ranked list determined for each protein target. The results showed that there was a similar tendency for both GPCR and kinase protein families, where the smaller ligands were ranked worst and the medium ligands were ranked better. For both GPCRs and kinases, AutoDock could rank smaller ligands better than Vina, even though their ranking position was relatively high. As for the medium-sized active molecules (300–400), these two families exhibited opposite results—while Vina provided better recognition for kinases, AutoDock was more effective in discriminating actives and decoys for GPCRs.

3.1.2. Influence of the Number of Rotational Bonds

Figure 9 presents the relative distribution of all active ligands in the DUD–E dataset taking into consideration the number of rotational bonds present. There is a higher prevalence in molecules with 4 to 7, and 8 to 11 rotational bonds, representing 73% of the dataset. The remaining 27% corresponds to molecules with 0 to 3 and higher than 12 rotational bonds.
Ligands with more rotatable bonds presented a higher challenge for docking because they could adopt a larger number of possible conformations. Discriminating actives with many rotatable bonds from decoys with many rotatable bonds hence became more difficult, because correctly identifying the real pose of the ligand was more challenging. Hence, ligands with a higher number of rotational bonds were placed at the worst position in the database, when comparing with the ligands with fewer rotatable bonds. In this study, this was observed for all studied families.
In Figure 10, the data for nuclear receptors and GPCRs are presented. For both families, AutoDock was able to rank more ligands early on. While in GPCRs there was a clear difference in the discrimination ability between Vina and AutoDock, for nuclear receptors, there was a similar behavior between both alternatives (exception—compounds with 4 rotatable bonds in nuclear receptors). According to our study, molecules with 5 to 10 rotational bonds ensured a better prediction with both AutoDock and Vina.

4. Discussion

AutoDock and Vina are efficient software alternatives for virtual screening, exhibiting on average similar performance when evaluating the ligand/decoy discriminating ability, across a large number of proteins. In spite of the similar average performance exhibited, both docking programs can present a marked difference when studying a particular protein target, or even when looking into proteins or enzymes from specific families, or for different types of ligands. Hence, for the common user wishing to embark in a virtual screening study, it is not easy to select a priori the alternative that should be used.
The goal of this study was to guide the selection of the docking software according to the type and characteristics of the target and its substrates. As demonstrated, the type of target, and specially the characteristics of the binding pocket could influence the outcome of the docking software. The results showed that AutoDock was clearly better in discriminating ligands and decoys in smaller targets, with more hydrophobic, poorly polar, and poorly charged pockets, while Vina tended to give better results for bigger targets with polar and charged binding pockets. According to the results presented, Vina provided better metrics for kinases, proteases, and cytochrome P450. On the other hand, ligand/decoy discrimination for GPCR, ion channels, and nuclear receptors was improved with AutoDock.
For the substrates, however, this analysis across 22,432 active compounds and 1,380,513 decoy molecules showed that AutoDock and Vina exhibited comparable trends with the ligands size, charge, and the number of rotatable bonds. Bigger, more flexible, and more polar ligands were more difficult to discriminate from decoys for both docking programs but the performance of Vina and AutoDock was quite similar.

5. Conclusions

While the present study offered useful guidelines that could help researchers to choose between AutoDock or Vina before starting a new virtual screening, according to the characteristics of their specific target, it also highlighted another important aspect. The performance of both programs could in some cases vary significantly, even for very similar proteins. Therefore, for very specific systems, it is recommended that researchers test both alternatives wisely, before starting a large virtual screening study.

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/2076-3417/9/21/4538/s1. Table S1: Docking parameters used for Vina and AutoDock; Figure S2: Comparison between the crystallographic (green) and “docked” (purple) poses for Vina and AutoDock to evaluate the influence of the number of rotational bonds in pose prediction. (a) Ligands with the lowest number of rotational bonds. (a1) Ligands with the highest number of rotational bonds.

Author Contributions

All the authors contributed equally for the realization of this work.

Funding

This work was supported by national funds from Fundação para a Ciência e a Tecnologia (SFRH/BD/137844/2018, and IF/00052/2014, and UID/Multi/04378/2019).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kitchen, D.B.; Decornez, H.; Furr, J.R.; Bajorath, J. Docking and Scoring in Virtual Screening for drug discovery: Methods and applications. Nat. Rev. Drug Discov. 2004, 3, 935–949. [Google Scholar] [CrossRef] [PubMed]
  2. Kinnings, S.L.; Liu, N.; Buchmeier, N.; Tonge, P.J.; Xie, L.; Bourne, P.E. Drug discovery using chemical systems biology: Repositioning the safe medicine Comtan to treat multi-drug and extensively drug resistant tuberculosis. PLoS Comput. Biol. 2009, 5, e1000423. [Google Scholar] [CrossRef]
  3. Ma, D.L.; Chan, D.S.H.; Leung, C.H. Drug repositioning by structure-based virtual screening. Chem. Soc. Rev. 2013, 42, 2130–2141. [Google Scholar] [CrossRef] [PubMed]
  4. Govindaraj, R.G.; Naderi, M.; Singha, M.; Lemoine, J.; Brylinski, M. Large-scale computational drug repositioning to find treatments for rare diseases. NPJ Syst. Biol. Appl. 2018, 4, 1–10. [Google Scholar] [CrossRef] [PubMed]
  5. Sousa, F.S.; Fernandes, P.A.; Ramos, M.J. Protein—Ligand Docking: Current Status and Future Challenges. Proteins Struct. Funct. Bioinforma 2006, 26, 15–26. [Google Scholar] [CrossRef]
  6. Sousa, S.F.; Cerqueira, N.M.F.S.A.; Fernandes, P.A.; Ramos, M.J. Virtual screening in drug design and development. Comb. Chem. High Throughput Screen. 2010, 13, 442–453. [Google Scholar] [CrossRef]
  7. Lohning, A.E.; Levonis, S.M.; Williams-Noonan, B.; Schweiker, S.S. A Practical Guide to Molecular Docking and Homology Modelling for Medicinal Chemists. Curr. Top. Med. Chem. 2017, 17, 2023–2040. [Google Scholar] [CrossRef] [Green Version]
  8. Sousa, S.F.; Ribeiro Antonio, J.M.; Coimbra, J.T.S.; Neves, R.P.P.; Martins, S.A.; Moorthy, N.S.H.N.; Fernandes, P.A.; Ramos, M.J. Protein-Ligand Docking in the New Millennium—A Retrospective of 10 Years in the Field. Curr. Med. Chem. 2013, 20, 2296–2314. [Google Scholar] [CrossRef]
  9. Ferreira, L.G.; Dos Santos, R.N.; Oliva, G.; Andricopulo, A.D. Molecular docking and structure-based drug design strategies. Molecules 2015, 20, 13384–13421. [Google Scholar] [CrossRef]
  10. Kroemer, R.T. Structure-based drug design: Docking and scoring. Curr. Protein Pept. Sci. 2007, 8, 312–328. [Google Scholar] [CrossRef]
  11. Taylor, R.D.; Jewsbury, P.J.; Essex, J.W. A review of protein-small molecule docking methods. J. Comput. Aided Mol. Des. 2002, 16, 151–166. [Google Scholar] [CrossRef] [PubMed]
  12. Huang, S.; Zou, X. Advances and challenges in Protein-ligand docking. Int. J. Mol. Sci. 2010, 11, 3016–3034. [Google Scholar] [CrossRef] [PubMed]
  13. Zhou, Z.; Cheng, T.; Li, Q.; Bryant, S.H.; Wang, Y. Structure-Based Virtual Screening for Drug Discovery: A Problem-Centric Review. AAPS J. 2012, 14, 133–141. [Google Scholar]
  14. Ban, T.; Ohue, M.; Akiyama, Y. Multiple grid arrangement improves ligand docking with unknown binding sites: Application to the inverse docking problem. Comput. Biol. Chem. 2018, 73, 139–146. [Google Scholar] [CrossRef] [PubMed]
  15. Totrov, M.; Abagyan, R. Flexible ligand docking to multiple receptor conformations: A practical alternative. Curr. Opin. Struct. Biol. 2008, 18, 178–184. [Google Scholar] [CrossRef]
  16. Huang, S.-Y. Comprehensive assessment of flexible-ligand docking algorithms: Current effectiveness and challenges. Brief. Bioinform. 2018, 19, 982–994. [Google Scholar] [CrossRef]
  17. Kong, X.; Sun, H.; Pan, P.; Zhu, F.; Chang, S.; Xu, L.; Li, Y.; Hou, T. Importance of protein flexibility in molecular recognition: A case study on Type-I1/2 inhibitors of ALK. Phys. Chem. Chem. Phys. 2018, 20, 4851–4863. [Google Scholar] [CrossRef]
  18. Sahai, M.A.; Biggin, P.C. Quantifying water-mediated protein-ligand interactions in a glutamate receptor: A DFT study. J. Phys. Chem. B 2011, 115, 7085–7096. [Google Scholar] [CrossRef]
  19. Munawar, S.; Vandenberg, J.I.; Jabeen, I. Molecular Docking Guided Grid-Independent Descriptor Analysis to Probe the Impact of Water Molecules on Conformational Changes of hERG Inhibitors in Drug Trapping Phenomenon. Int. J. Mol. Sci. 2019, 20, 3385. [Google Scholar] [CrossRef]
  20. Rentzsch, R.; Renard, B.Y. Docking small peptides remains a great challenge: An assessment using AutoDock Vina. Brief. Bioinform. 2015, 16, 1045–1056. [Google Scholar] [CrossRef]
  21. Hauser, A.S.; Windshügel, B. LEADS-PEP: A Benchmark Data Set for Assessment of Peptide Docking Performance. J. Chem. Inf. Model. 2016, 56, 188–200. [Google Scholar] [CrossRef] [PubMed]
  22. Isa, D.M.; Chin, S.P.; Chong, W.L.; Zain, S.M.; Rahman, N.A.; Lee, V.S. Dynamics and binding interactions of peptide inhibitors of dengue virus entry. J. Biol. Phys. 2019, 45, 63–76. [Google Scholar] [CrossRef] [PubMed]
  23. Cerqueira, N.M.; Gesto, D.; Oliveira, E.F.; Santos-Martins, D.; Brás, N.F.; Sousa, S.F.; Fernandes, P.A.; Ramos, M.J. Receptor-based virtual screening protocol for drug discovery. Arch. Biochem. Biophys. 2015, 582, 56–67. [Google Scholar] [CrossRef] [PubMed]
  24. Ramírez, D.; Caballero, J. Is it reliable to use common molecular docking methods for comparing the binding affinities of enantiomer pairs for their protein target? Int. J. Mol. Sci. 2016, 17, 525. [Google Scholar] [CrossRef]
  25. Muegge, I.; Rarey, M. Small Molecule Docking and Scoring. In Reviews in Computational Chemistry; Kenny Lipkowitz, K.B., Boyd, D.B., Eds.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2001; Volume 17, pp. 1–60. [Google Scholar]
  26. Cole, J.C.; Murray, C.W.; Nissink, J.W.M.; Taylor, R.D.; Taylor, R. Comparing protein-ligand docking programs is difficult. Proteins Struct. Funct. Bioinform. 2005, 60, 325–332. [Google Scholar] [CrossRef]
  27. Kontoyianni, M.; McClellan, L.M.; Sokol, G.S. Evaluation of Docking Performance: Comparative Data on Docking Algorithms. J. Med. Chem. 2004, 47, 558–565. [Google Scholar] [CrossRef]
  28. Xu, W.; Lucke, A.J.; Fairlie, D.P. Comparing sixteen scoring functions for predicting biological activities of ligands for protein targets. J. Mol. Graph. Model. 2015, 57, 76–88. [Google Scholar] [CrossRef] [Green Version]
  29. Wang, Z.; Sun, H.; Yao, X.; Li, D.; Xu, L.; Li, Y.; Tian, S.; Hou, T. Comprehensive evaluation of ten docking programs on a diverse set of protein-ligand complexes: The prediction accuracy of sampling power and scoring power. Phys. Chem. Chem. Phys. 2016, 18, 12964–12975. [Google Scholar] [CrossRef]
  30. Vieira, T.F.; Magalhaes, R.; Sousa, S.F. Tailoring specialized scoring functions for more efficient virtual screening. Front. Drug, Chem. Clin. Res. 2019, 2, 1–4. [Google Scholar]
  31. Mysinger, M.M.; Carchia, M.; Irwin, J.J.; Shoichet, B.K. Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking. J. Med. Chem. 2012, 55, 6582–6594. [Google Scholar] [CrossRef]
  32. Bartuzi, D.; Kaczor, A.; Targowska-Duda, K.; Matosiuk, D. Recent Advances and Applications of Molecular Docking to G Protein-Coupled Receptors. Molecules 2017, 22, 340. [Google Scholar] [CrossRef] [PubMed]
  33. Seong, S.H.; Ali, M.Y.; Kim, H.-R.; Jung, H.A.; Choi, J.S. BACE1 inhibitory activity and molecular docking analysis of meroterpenoids from Sargassum serratifolium. Bioorg. Med. Chem. 2017, 25, 3964–3970. [Google Scholar] [CrossRef] [PubMed]
  34. Nisha, C.M.; Kumar, A.; Vimal, A.; Bai, B.M.; Pal, D.; Kumar, A. Docking and ADMET prediction of few GSK-3 inhibitors divulges 6-bromoindirubin-3-oxime as a potential inhibitor. J. Mol. Graph. Model. 2016, 65, 100–107. [Google Scholar] [CrossRef] [PubMed]
  35. Ravindranath, P.A.; Forli, S.; Goodsell, D.S.; Olson, A.J.; Sanner, M.F. AutoDockFR: Advances in Protein-Ligand Docking with Explicitly Specified Binding Site Flexibility. PLoS Comput. Biol. 2015, 11, e1004586. [Google Scholar] [CrossRef] [PubMed]
  36. Labbé, C.M.; Rey, J.; Lagorce, D.; Vavrusa, M.; Becot, J.; Sperandio, O.; Villoutreix, B.O.; Tufféry, P.; Miteva, M.A. MTiOpenScreen: A web server for structure-based virtual screening. Nucleic Acids Res. 2015, 43, W448–W454. [Google Scholar] [CrossRef] [PubMed]
  37. Koebel, M.R.; Schmadeke, G.; Posner, R.G.; Sirimulla, S. AutoDock VinaXB: Implementation of XBSF, new empirical halogen bond scoring function, into AutoDock Vina. J. Cheminform. 2016, 8, 27. [Google Scholar] [CrossRef]
  38. Quiroga, R.; Villarreal, M.A. Vinardo: A scoring function based on autodock vina improves scoring, docking, and virtual screening. PLoS ONE 2016, 11, e0155183. [Google Scholar] [CrossRef]
  39. Di Muzio, E.; Toti, D.; Polticelli, F. DockingApp: A user-friendly interface for facilitated docking simulations with AutoDock Vina. J. Comput. Aided Mol. Des. 2017, 31, 213–218. [Google Scholar] [CrossRef]
  40. Morris, G.M.; Huey, R.; Lindstrom, W.; Sanner, M.F.; Belew, R.K.; Goodsell, D.S.; Olson, A.J. Software news and updates AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J. Comput. Chem. 2009, 30, 2785–2791. [Google Scholar] [CrossRef]
  41. Morris, G.M.; Goodsell, D.S.; Huey, R.; Olson, A.J. Distributed automated docking of flexible ligands to proteins: Parallel applications of AutoDock 2.4. J. Comput. Aided Mol. Des. 1996, 10, 293–304. [Google Scholar] [CrossRef]
  42. Goodsell, D.S.; Morris, G.M.; Olson, A.J. Automated docking of flexible ligands: Applications of AutoDock. J. Mol. Recognit. 1996, 9, 1–5. [Google Scholar] [CrossRef]
  43. Morris, G.M.; Goodsell, D.S.; Halliday, R.S.; Huey, R.; Hart, W.E.; Belew, R.K.; Olson, A.J. Automated Docking Using a Lamarckian Genetic Algorithm and an Empirical Binding Free Energy Function. J. Comput. Chem. 1998, 19, 1639–1662. [Google Scholar] [CrossRef]
  44. Trott, O.; Olson, A.J. AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization and multithreading. J. Comput. Chem. 2010, 31, 455–461. [Google Scholar] [CrossRef] [PubMed]
  45. Jaghoori, M.M.; Bleijlevens, B.; Olabarriaga, S.D. 1001 Ways to run AutoDock Vina for virtual screening. J. Comput. Aided Mol. Des. 2016, 30, 237–249. [Google Scholar] [CrossRef] [PubMed]
  46. Coleman, R.G.; Carchia, M.; Sterling, T.; Irwin, J.J.; Shoichet, B.K. Ligand Pose and Orientational Sampling in Molecular Docking. PLoS ONE 2013, 10, e75992. [Google Scholar] [CrossRef]
  47. Allen, W.J.; Balius, T.E.; Mukherjee, S.; Brozell, S.R.; Moustakas, D.T.; Lang, P.T.; Case, D.A.; Kuntz, I.D.; Rizzo, R.C. DOCK 6: Impact of new features and current docking performance. J. Comput. Chem. 2015, 36, 1132–1156. [Google Scholar] [CrossRef] [Green Version]
  48. Chaput, L.; Martinez-Sanz, J.; Saettel, N.; Mouawad, L. Benchmark of four popular virtual screening programs: Construction of the active/decoy dataset remains a major determinant of measured performance. J. Cheminform. 2016, 8, 1–17. [Google Scholar] [CrossRef]
  49. Uehara, S.; Tanaka, S. AutoDock-GIST: Incorporating Thermodynamics of Active-Site Water into Scoring Function for Accurate Protein-Ligand Docking. Molecules 2016, 21, 1604. [Google Scholar] [CrossRef]
  50. Ericksen, S.S.; Wu, H.; Zhang, H.; Michael, L.A.; Newton, M.A.; Hoffmann, F.M.; Wildman, S.A. Machine Learning Consensus Scoring Improves Performance Across Targets in Structure-Based Virtual Screening. J. Chem. Inf. Model. 2017, 57, 1579–1590. [Google Scholar] [CrossRef] [Green Version]
  51. Zhou, H.; Cao, H.; Skolnick, J. FINDSITE comb2.0: A New Approach for Virtual Ligand Screening of Proteins and Virtual Target Screening of Biomolecules. J. Chem. Inf. Model. 2018, 58, 2343–2354. [Google Scholar] [CrossRef]
  52. Ebejer, J.P.; Finn, P.W.; Wong, W.K.; Deane, C.M.; Morris, G.M. Ligity: A Non-Superpositional, Knowledge-Based Approach to Virtual Screening. J. Chem. Inf. Model. 2019, 59, 2600–2616. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Wang, D.; Cui, C.; Ding, X.; Xiong, Z.; Zheng, M.; Luo, X.; Jiang, H.; Chen, K. Improving the Virtual Screening Ability of Target-Specific Scoring Functions Using Deep Learning Methods. Front. Pharmacol. 2019, 10. [Google Scholar] [CrossRef] [PubMed]
  54. Chen, P.; Ke, Y.; Lu, Y.; Du, Y.; Li, J.; Yan, H.; Zhao, H.; Zhou, Y.; Yang, Y. DLIGAND2: An improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference state. J. Cheminform. 2019, 11, 52. [Google Scholar] [CrossRef] [PubMed]
  55. Molecular Operating Environment (MOE), 2013.08; Chemical Computing Group ULC, Montreal, Canada. 2019. Available online: https://www.chemcomp.com/index.htm (accessed on 18 October 2019).
Figure 1. Overview of the protein target families of the Directory of Useful Decoys–Enhanced (DUD–E) dataset.
Figure 1. Overview of the protein target families of the Directory of Useful Decoys–Enhanced (DUD–E) dataset.
Applsci 09 04538 g001
Figure 2. Comparison between the crystallographic (green) and “docked” (purple) poses of (a) glucocorticoid receptor (GCR) Autodock, (a1) GCR Vina, (b) CAH2 AutoDock, (b1) CAH2 Vina, (c) TYSY AutoDock, and (c1) TYSY Vina.
Figure 2. Comparison between the crystallographic (green) and “docked” (purple) poses of (a) glucocorticoid receptor (GCR) Autodock, (a1) GCR Vina, (b) CAH2 AutoDock, (b1) CAH2 Vina, (c) TYSY AutoDock, and (c1) TYSY Vina.
Applsci 09 04538 g002
Figure 3. Average AUC calculated for the different family targets, obtained with Vina and AutoDock.
Figure 3. Average AUC calculated for the different family targets, obtained with Vina and AutoDock.
Applsci 09 04538 g003
Figure 4. Average AUC calculated for the various metalloenzymes, obtained with Vina and AutoDock.
Figure 4. Average AUC calculated for the various metalloenzymes, obtained with Vina and AutoDock.
Applsci 09 04538 g004
Figure 5. Heat map correlation for proteases between Vina and AutoDock scores and ligand properties, such as fraction of hydrophobic accessible surface area (FASA_H), fraction of polar accessible surface area (FASA_P), fraction of positive and negative accessible surface areas (FASA+ and FASA−), molecular weight (MW), and number of rotational bonds (Rot. B.).
Figure 5. Heat map correlation for proteases between Vina and AutoDock scores and ligand properties, such as fraction of hydrophobic accessible surface area (FASA_H), fraction of polar accessible surface area (FASA_P), fraction of positive and negative accessible surface areas (FASA+ and FASA−), molecular weight (MW), and number of rotational bonds (Rot. B.).
Applsci 09 04538 g005
Figure 6. Heat map correlation for metalloenzymes between Vina and AutoDock scores and ligand properties, such as fraction of hydrophobic accessible surface area (FASA_H), fraction of polar accessible surface area (FASA_P), fraction of positive and negative accessible surface areas (FASA+ and FASA−), molecular weight (MW), and number of rotational bonds (Rot. B.).
Figure 6. Heat map correlation for metalloenzymes between Vina and AutoDock scores and ligand properties, such as fraction of hydrophobic accessible surface area (FASA_H), fraction of polar accessible surface area (FASA_P), fraction of positive and negative accessible surface areas (FASA+ and FASA−), molecular weight (MW), and number of rotational bonds (Rot. B.).
Applsci 09 04538 g006
Figure 7. Molecular weight of all active molecules present in the DUD–E database.
Figure 7. Molecular weight of all active molecules present in the DUD–E database.
Applsci 09 04538 g007
Figure 8. Influence of MW on (a) GPCR and (b) Kinase ligands ranking position with Vina and AutoDock.
Figure 8. Influence of MW on (a) GPCR and (b) Kinase ligands ranking position with Vina and AutoDock.
Applsci 09 04538 g008
Figure 9. Relative distribution of active molecules present in the DUD–E database by the number of rotational bonds.
Figure 9. Relative distribution of active molecules present in the DUD–E database by the number of rotational bonds.
Applsci 09 04538 g009
Figure 10. Influence of the number of rotational bonds of the active molecules in the ranking position for nuclear receptors and GPCRs.
Figure 10. Influence of the number of rotational bonds of the active molecules in the ranking position for nuclear receptors and GPCRs.
Applsci 09 04538 g010
Table 1. Protein targets evaluated in this study as part of DUD–E with indication of the different classes of targets considered.
Table 1. Protein targets evaluated in this study as part of DUD–E with indication of the different classes of targets considered.
Target ClassDUD–E CodePDB CodeProtein NameLigandsDecoysMetal
cytochrome P450CP2C91R9OCytochrome P450 2C91207446
CP3A43NXUCytochrome P450 3A417011,796
G protein-coupled receptorAA2AR3EMLAdenosine A2a receptor48231,498
ADRB12VT4Beta-1 adrenergic receptor24715,843
ADRB23NY8Beta-2 adrenergic receptor23114,994
CXCR43ODUC-X-C chemokine receptor type 4403406
DRD33PBLDopamine D3 receptor48034,022
Ion channelGRIA23KGCglutamate receptor ionotropic AMPA215811,832
GRIK11VSOglutamate receptor ionotropic kainate 11016547
KinasesABL12HZITyrosine-protein kinase ABL18210,750
AKT13CQWSerine/threonine-protein kinase AKT29316,426Mn2+
AKT23D0ESerine/threonine-protein kinase AKT21176893
BRAF3D4QSerine/threonine-protein kinase B-raf1529942
CDK21H00Cyclin-dependent kinase 247427,830
CSF1R3KRJMacrophage colony stimulating factor receptor16612,144
EGFR2RGPEpidermal growth factor receptor erbB154535,020
FAK13BZ3Focal adhesion kinase 11005350
IGF1R2OJ9Insulin-like growth factor I receptor1489291
JAK23LPBTyrosine-protein kinase JAK21076495
KIT3G0EStem cell growth factor receptor16610,447
KITH2B8TThymidine kinase572850
KPCB2I0EProtein kinase C beta1358692
LCK2OF2Tyrosine-protein kinase LCK42027,374
MAPK23M2WMAP kinase-activated protein kinase 21016147
MET3LQ8Hepatocyte growth factor receptor16611,240
MK012OJGMAP kinase ERK2794548
MK102ZDTc-Jun N-terminal kinase 31046599
MK142QD9MAP kinase p38 alpha57835,810
MP2K13EQHDual specificity mitogen-activated protein kinase 11218147Mg2+
PLK12OWBSerine/threonine-protein kinase 1076797Zn2+
ROCK12ETRRho-associated protein kinase 11006297
SRC3EL8tyrosine-protein kinase SRC52434,454
TGFR13HMMTGF-beta receptor type I1338498
VGFR22P2IVascular endothelial growth factor receptor 240924,927
WEE13BIZSerine/threonine-protein kinase 1026148
Nuclear receptorANDR2AM9Androgen Receptor26914,344
ESR11SJ0Estrogen receptor alpha38320,663
ESR22FSZEstrogen receptor beta36720,182
GCR3BQDglucocorticoid receptor25814,987
MCR2AA2Mineralocorticoid receptor945146
PPARA2P54Peroxisome proliferator-activated receptor alpha37319,356
PPARD2ZNPPeroxisome proliferator-activated receptor delta24012,223
PPARG2GTKPeroxisome proliferator-activated receptor gamma48425,256
PRGR3KBAProgesterone receptor29315,642
RXRA1MV9retinoid X receptor alpha1316935
THB1Q4XThyroid hormone receptor beta-I1037441
ProteasesACE3BKLAngiotensin-converting enzyme28216,864Zn2+
ADA172OI0protease53235,809Zn2+
BACE13L5DBeta-secretase 128318,082
CASP32CNKCaspase-319910,692
DPP42I78Dipeptidyl peptidase IV53340,916
FA103Kl6Coagulation factor X53720,023
FA71W7XCoagulation factor VII1146245
HIVPR1XL2human immunodeficiency virus type 1 protease53635,688
LKHA43CHPLeukotriene A4 hydrolase1719448Zn2+
MMP13830CMatrix metalloproteinase 1357237,126Zn2+
RENI3G6ZRenin1046956
THRB1YPEThrombin46126,948
TRY12AYWTrypsin I44925,914
TRYB12ZECTryptase beta-I1487643
UROK1SQTUrokinase-type plasminogen activator1629841
MiscellaneousAMPC1L2SBeta-lactamase482832
HIVRT3NF7human immunodeficiency virus type 1 integrase1006644
KIF113CJOKinesin-like protein 11166848Mg2+
OtherACES1_e66Acetylcholinesterase45326,234
ADA2E1WAdenosine deaminase935449Zn2+
ALDR2HV5Aldose reductase1598995
AOFB1S3BMonoamine oxidase B1226900
CAH21BCDCarbonic anhydrase II49231,132Zn2+
COMT3BWMCatechol O-methyltransferase413848Mg2+
DEF1LRUPeptide deformylase1025696Zn2+
DHI13FRJ11-beta-hydroxysteroid dehydrogenase 133019,340
FGFR13C4FFibroblast growth factor receptor 11394206
DYR3NXODihydrofolate reductase23117,170
FABP42NNQFatty acid binding protein adipocyte472749
FKB1A1J4HFK506-binding protein 1A1115800
FNTA3E37protein farnesyltransferase/geranyl genaryltransferase type I alpha subunit59251,430Zn2+
FPPS1ZW5Farnesyl diphosphate synthase858822Mg2+
GLCM2VF3beta glucocerebrosidade543799
HDAC23MAXhistone deacetylase 218510,300Zn2+
HDAC83F07histone deacetylase 817010,448Zn2+
HIVINT3NF7human immunodeficiency virus type 1 integrase1006644
HMDH3CCWHMG-CoA reductase1708743
HS90A1UYGheat shock protein HSP 90-alpha884848
HXK43F0Mhexokinase type IV924696
INHA2H7LEnoyl-[acyl-carrier-protein] reductase442300
ITAL2ICALeukocyte adhesion glycoprotein LFA-1 alpha1388487
NOS11QW6Nitric-oxide synthase, brain1008050Zn2+
NRAM1B9VNeuraminidase986199
PA2GA1KVOPhospholipase A2 group IIA995146
PARP13L3MPoly [ADP-ribose] polymerase-150830,035
PDE5A1UDTPhosphodiesterase 5A39827,521
PGH12OYUCyclooxygenase-119510,797
PGH23LN1Cyclooxygenase-243523,135
PNPH3BGSPurine nucleoside phosphorylase1036950
PTN12AZRProtein-tyrosine phosphatase 1B1307243
PUR21NJSGAR transformylase502694
PYGM1C8KMuscle glycogen phosphorylase773940
PYRD1D3GDihydroorotate dehydrogenase1116446
SAHH1LI4adenosylhomocysteinase633450
TYSY1SYNThymidylate synthase1096738
XIAP3HL5Inhibitor of apoptosis protein 31005145Zn2+
Metallo-enzymesMP2K13EQHDual specificity mitogen-activated protein kinase 11218147Mg2+
ACE3BKLAngiotensin-converting enzyme28216,864Zn2+
AKT13CQWSerine/threonine-protein kinase AKT29316,426Mn2+
ADA172OI0protease53235,809Zn2+
MMP13830CMatrix metalloproteinase 1357237,126Zn2+
PLK12OWBSerine/threonine-protein kinase 1076797Zn2+
CAH21BCDCarbonic anhydrase II49231,132Zn2+
LKHA43CHPLeukotriene A4 hydrolase1719448Zn2+
FNTA3_e37protein farnesyltransferase/geranyl genaryltransferase type I alpha subunit59251,430Zn2+
KIF113CJOKinesin-like protein 11166848Mg2+
ADA2E1WAdenosine deaminase935449Zn2+
COMT3BWMCatechol O-methyltransferase413848Mg2+
NOS11QW6Nitric-oxide synthase, brain1008050Zn2+
DEF1LRUPeptide deformylase1025696Zn2+
FPPS1ZW5Farnesyl diphosphate synthase858822Mg2+
HDAC23MAXhistone deacetylase 218510,300Zn2+
HDAC83F07histone deacetylase 817010,448Zn2+
XIAP3HL5Inhibitor of apoptosis protein 31005145Zn2+
Table 2. Performance of Vina and AutoDock in the discrimination between ligands and decoys for different classes of targets.
Table 2. Performance of Vina and AutoDock in the discrimination between ligands and decoys for different classes of targets.
VinaAutoDock
Target ClassTargetsActivesDecoysEF1%EF20%AUCEF1%EF20%AUC
cytochrome P450229019,2423.1 ± 0.81.4 ± 0.357.6 ± 0.62.9 ± 1.81.0 ± 0.150.4 ± 0.4
G protein-coupled receptor5148099,7632.8 ± 2.81.9 ± 0.964.9 ± 6.616.6 ± 13.73.2 ± 1.378.5 ± 15.4
ion channel225918,3794.2 ± 3.12.6 ± 0.966.7 ± 10.34.7 ± 3.12.8 ± 0.175.3 ± 4.4
kinases235065317,74613.3 ± 12.02.8 ± 0.875.2 ± 9.65.7 ± 10.21.7 ± 0.862.1 ± 13.0
nuclear receptor112995162,17515.0 ± 8.63.0 ± 0.773.5 ± 10.418.4 ± 13.13.3 ± 0.879.0 ± 8.8
Proteases 113526208,9485.3 ± 4.92.6 ± 0.772.1 ± 8.86.9 ± 7.31.7 ± 0.959.6 ± 10.0
other294663276,4759.3 ± 8.92.3 ± 1.069.6 ± 12.96.8 ± 8.32.1 ± 1.265.1 ± 16.0
metalloenzymes184154277,7858.2 ± 10.12.2 ± 1.164.6 ± 15.18.8 ± 10.72.2 ± 1.361.2 ± 21.9
non-metalloenzymes8318,2781,102,72810.0 ± 9.62.6 ± 0.971.4 ± 10.98.51 ± 10.52.2 ± 1.166.1 ± 14.7
Total/Average 10122,4321,380,5137.6 ± 4.72.4 ± 0.568.0 ± 5.88.9 ± 5.6 2.3 ± 0.8 66.4 ± 10.2
Table 3. Performance of Vina and AutoDock in the discrimination between ligands and decoys for the targets of different sizes based on the number of amino acid residues of each target (number of aa).
Table 3. Performance of Vina and AutoDock in the discrimination between ligands and decoys for the targets of different sizes based on the number of amino acid residues of each target (number of aa).
Target Size (Number of aa)VinaAutoDock
EF1%AUCEF1%AUC
Small (0–250 aa)8.6 ± 10.170.7 ± 11.09.4 ± 12.366.6 ± 16.0
Medium (250–400 aa)11.5 ± 10.071.8 ± 12.57.9 ± 9.764.3 ± 15.8
Large (>400 aa)6.1 ± 7.065.4 ± 11.08.9 ± 10.665.8 ± 18.2
Table 4. Performance of Vina and AutoDock in the discrimination between ligands and decoys for targets with binding pockets with different percentages of polar, charged, and hydrophobic amino acid residues.
Table 4. Performance of Vina and AutoDock in the discrimination between ligands and decoys for targets with binding pockets with different percentages of polar, charged, and hydrophobic amino acid residues.
VinaAutoDock
PolarityNumber of TargetsEF1%AUCEF1%AUC
Poorly Polar (0–25%)259.9 ± 8.8668.1 ± 12.911.7 ± 12.568.7 ± 15.7
Moderately Polar (25–35%)3610.7 ± 9.3271.7 ± 11.78.0 ± 9.563.7 ± 16.4
Very Polar (>35%)408.6 ± 10.670.0 ± 13.17.0 ± 9.864.2 ± 16.5
Hydrophobicity
Poorly Hydrophobic (0–30%)3811.1 ± 11.572.4 ± 13.48.5 ± 10.166.7 ± 17.2
Moderately Hydrophobic (30–40%)258.3 ± 8.568.4 ± 13.38.2 ± 13.362.9 ± 18.1
Very Hydrophobic (>40%)389.1 ± 8.469.0 ± 9.38.8 ± 9.365.0 ± 14.2
Charge
Poorly Charged (0–15%)409.9 ± 9.169.2 ± 11.410.2 ± 10.968.8 ± 15.5
Moderately Charged (15–20%)247.7 ± 8.668.2 ± 11.95.8 ± 8.755.1 ± 17.5
Very Charged (>20%)3710.7 ± 10.972.4 ± 12.78.5 ± 10.967.5 ± 13.8
Positive Charge
Poorly Positive (0–5%)2410.2 ± 8.567.9 ± 11.612.6 ± 11.668.8 ± 15.6
Moderately Positive (5–10%)348.5 ± 10.869.6 ± 14.26.9 ± 11.162.1 ± 17.5
Very Positive (>10%)4310.5 ± 9.472.6 ± 9.77.9 ± 8.865.4 ± 15.3
Negative Charge
Poorly Negative (0–5%)2211.0 ± 9.570.8 ± 9.811.7 ± 10.773.5 ± 11.9
Moderately Negative (5–10%)448.1 ± 8.370.2 ± 11.87.2 ± 10.262.2 ± 17.4
Very Negative (>10%)3510.9 ± 11.269.7 ± 13.88.2 ± 10.763.5 ± 10.7
Table 5. Influence of the molecular weight for actives found in the top 1% of the database.
Table 5. Influence of the molecular weight for actives found in the top 1% of the database.
VinaAutoDock
No. of Ligands within Top 1%No. of Ligands within Top 1%
Total2002 (8.9%)1935 (8.6%)
<400 Da536 (2.4%) 395 (1.8%)
400–500 Da885 (3.9%) 1043 (4.7%)
>500 Da581 (2.6%) 497 (2.2%)

Share and Cite

MDPI and ACS Style

Vieira, T.F.; Sousa, S.F. Comparing AutoDock and Vina in Ligand/Decoy Discrimination for Virtual Screening. Appl. Sci. 2019, 9, 4538. https://0-doi-org.brum.beds.ac.uk/10.3390/app9214538

AMA Style

Vieira TF, Sousa SF. Comparing AutoDock and Vina in Ligand/Decoy Discrimination for Virtual Screening. Applied Sciences. 2019; 9(21):4538. https://0-doi-org.brum.beds.ac.uk/10.3390/app9214538

Chicago/Turabian Style

Vieira, Tatiana F., and Sérgio F. Sousa. 2019. "Comparing AutoDock and Vina in Ligand/Decoy Discrimination for Virtual Screening" Applied Sciences 9, no. 21: 4538. https://0-doi-org.brum.beds.ac.uk/10.3390/app9214538

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop