Next Article in Journal
The Potential Role of the Piwi Gene in the Development and Reproduction of Plutella xylostella
Next Article in Special Issue
Honey Bee Larval Hemolymph as a Source of Key Nutrients and Proteins Offers a Promising Medium for Varroa destructor Artificial Rearing
Previous Article in Journal
Functional Evidence of CCDC186 as a New Disease-Associated Gene with Endocrine and Central Nervous System Alterations
Previous Article in Special Issue
The Proteome and Lipidome of Extracellular Vesicles from Haemonchus contortus to Underpin Explorations of Host–Parasite Cross–Talk
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Analysis of Haemonchus contortus Proteases and Protease Inhibitors Using Advanced Informatics Provides Insights into Parasite Biology and Host–Parasite Interactions

1
Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, VIC 3010, Australia
2
Department of Data Science and AI, Faculty of IT, Monash University, Melbourne, VIC 3800, Australia
3
Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
4
Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(15), 12320; https://0-doi-org.brum.beds.ac.uk/10.3390/ijms241512320
Submission received: 20 June 2023 / Revised: 24 July 2023 / Accepted: 24 July 2023 / Published: 1 August 2023
(This article belongs to the Special Issue Parasite Biology and Host-Parasite Interactions)

Abstract

:
Biodiversity within the animal kingdom is associated with extensive molecular diversity. The expansion of genomic, transcriptomic and proteomic data sets for invertebrate groups and species with unique biological traits necessitates reliable in silico tools for the accurate identification and annotation of molecules and molecular groups. However, conventional tools are inadequate for lesser-known organismal groups, such as eukaryotic pathogens (parasites), so that improved approaches are urgently needed. Here, we established a combined sequence- and structure-based workflow system to harness well-curated publicly available data sets and resources to identify, classify and annotate proteases and protease inhibitors of a highly pathogenic parasitic roundworm (nematode) of global relevance, called Haemonchus contortus (barber’s pole worm). This workflow performed markedly better than conventional, sequence-based classification and annotation alone and allowed the first genome-wide characterisation of protease and protease inhibitor genes and gene products in this worm. In total, we identified 790 genes encoding 860 proteases and protease inhibitors representing 83 gene families. The proteins inferred included 280 metallo-, 145 cysteine, 142 serine, 121 aspartic and 81 “mixed” proteases as well as 91 protease inhibitors, all of which had marked physicochemical diversity and inferred involvements in >400 biological processes or pathways. A detailed investigation revealed a remarkable expansion of some protease or inhibitor gene families, which are likely linked to parasitism (e.g., host–parasite interactions, immunomodulation and blood-feeding) and exhibit stage- or sex-specific transcription profiles. This investigation provides a solid foundation for detailed explorations of the structures and functions of proteases and protease inhibitors of H. contortus and related nematodes, and it could assist in the discovery of new drug or vaccine targets against infections or diseases.

1. Introduction

Proteases are enzymes that play vital roles in breaking down proteins into peptides or amino acids through the cleavage of peptide bonds and/or the catalysation of hydrolysis reactions [1]. They are actively involved in many crucial biological processes in organisms, such as immune responses [2,3], the digestion of food proteins [4] and programmed cell death (apoptosis) [5,6]. Moreover, the molecular and functional variations in proteases (and their inhibitors) in different species are inextricably linked to the evolution and biodiversity of organisms [1,7].
Proteases can be divided into the following main classes: asparagine, aspartic, cysteine, glutamic, metallo-, serine, threonine and ‘mixed’ proteases [8]. The online MEROPS database (https://www.ebi.ac.uk/merops/index.shtml; accessed on 15 February 2023); ref. [9]) is a comprehensive repository of information on proteases, such as classifications (classes), identifiers, structures, cleavage sites and protease inhibitors, enabling the study of many aspects of proteases [8]. Investigations of proteases of eukaryotic parasites are particularly important because of the critical roles that these enzymes play in pathogen invasion and tissue penetration [10], parasite growth, development and reproduction [11], as well as the acquisition/digestion of nutrients by parasites [12] and the modulation of immune responses and disease by parasites in host animals [13,14].
Research on parasite-derived proteases has been an active area for more than 70 years (reviewed by [15,16,17,18,19,20,21]). Recent advances in this field have expanded our understanding of the diversity and specificity of parasite-derived proteases, with the identification of unique proteases in various species of parasitic helminths, such as Angiostrongylus cantonensis [22], Ascaris suum [23] and Meloidogyne incognita [24]. These studies have highlighted the potential of proteases as potential drug and vaccine targets for the development of novel interventions against parasites (e.g., [18,21]). The use of advanced molecular technologies and informatics [25,26] is now providing improved insights into parasite-derived proteases, such as their classification and involvement in biological processes, as well as host–parasite interactions, with potential implications for the design of protease inhibitors.
Some of our research focuses on parasitic helminths with significant veterinary economic impact, particularly nematodes such as Haemonchus contortus (e.g., [27,28,29,30,31,32]). This parasite infects particularly small ruminants worldwide, and it causes substantial disease (haemonchosis) and production losses to the farming and animal industries [33,34,35]. Current control of haemonchosis is heavily reliant on anthelmintics, and the available vaccine does not protect animals for extended periods [36]. In addition, drug resistance in H. contortus poses a significant threat and demands the discovery of new interventions [37,38], which might be guided by molecular investigations.
There have been major advances in the genomics, transcriptomics and proteomics of H. contortus [39,40,41,42,43,44,45,46,47,48], allowing detailed explorations of this parasite at the molecular level and revealing distinct biological mechanisms and/or critical pathways [32,49]. Some recently developed informatic pipelines have demonstrated utility for analyses of molecular data sets [25,50] as well as protein annotation [51] and the prediction of essential genes [52]. In this context, exploring the proteases and associated inhibitors in H. contortus using abundant omics data sets and advanced informatics could contribute to finding parasite-specific targets.
In contrast to model organisms (e.g., Caenorhabditis elegans and Drosophila melanogaster), which possess a vast array of proteases [53,54,55], there has been no profound, systematic classification of proteases of H. contortus at the whole-genome level, despite previous studies of selected groups of proteases using traditional molecular techniques (e.g., [56,57,58,59,60]). There has been limited information on the transcription/expression of protease genes in H. contortus across different developmental stages and both sexes as well as the structures of proteases and their inhibitors, and conventional informatic methods [61] have not been sufficiently reliable for genome-wide identification and annotation of proteases and their inhibitors. In this study, we aimed to systematically identify, classify and annotate proteases and protease inhibitors encoded in the H. contortus genome using an advanced informatic workflow (employing a sequence- and structure-based strategy), paving the way for a better understanding of these molecules and providing potential for the discovery of new intervention (drug or vaccine) targets.

2. Results

2.1. Proteases and Protease Inhibitors Encoded in the H. contortus Genome

Using a combined sequence- and structure-based approach (Figure 1), we identified 790 genes in the H. contortus genome encoding 860 proteases and protease inhibitors (including isoforms), which were assigned to 83 distinct families (Table 1). These molecules included 280 metallo-, 145 cysteine, 142 serine, 121 aspartic and 81 ‘mixed’ proteases as well as 91 protease inhibitors (Figure 1). Of these 860 proteins, 346 (40.2%) were identified using both sequence-based and structure-based methods, whereas 55 (6.4%) proteins were identified using a structure-based and 459 (53.4%) employing a sequence-based method alone (Figure 1). Of all 860 protease and protease inhibitors inferred, 256 proteins had been identified in previous LC–MS/MS-based studies of the somatic and excretory/secretory proteomes of H. contortus [43,44], and 53 that were phosphorylated (Table S1) might play a role in the regulation of protease activity.
The density of genes encoding proteases or protease inhibitors was highest on chromosome chr1 (n = 196) and lowest on the sex chromosome chrX (n = 55) (Figure S1). Metallo-protease genes (n > 50) were shown to be abundant on individual chromosomes, except chr1, in which cysteine protease genes predominated (n = 77) (Figure S1).

2.2. Proteases and Protease Inhibitors Represent at Least 83 Gene Families

In H. contortus, proteases are represented by a diverse range of gene families. Metallo-proteases (n = 280) are encoded by 21 gene families. The top three gene families account for more than half of all metallo-proteases, with the M12 family being the largest (130; 46.4%), followed by families M13 and M1. In contrast, aspartic proteases are represented by four gene families, with family A1 comprising 81% (98) of them (Figure 1). Cysteine proteases (n = 145) were encoded by 20 gene families; more than half of these proteases belong to family C1, and the remainder are in 11 other families—with one protease per family. Serine proteases were relatively evenly distributed among the 12 gene families, except for family S9 with most serine proteases (n = 51; 35.9%) (Figure 1). Serine and mixed proteases both represent 12 gene families; most of the latter proteases (n = 81) belong to families S1 and T1. Protease inhibitors represent 14 gene families, with I2, I4 and I25 containing most protease inhibitors (i.e., 40.7%, 12.1% and 8.8%, respectively) (Figure 1).
Figure 1. Identification and classification of proteases and protease inhibitors encoded in the Haemonchus contortus genome employing a combined sequence- and structure-based approach. Using curated sequence data from the MEROPS database, we identified 805 encoded proteins based on sequence homology (left). Based on structural information available in the MEROPS database, we downloaded protein structures of proteases and protease inhibitors from the PDB database and used US-align to identify 401 proteins (right). Protein structures were predicted using the program AlphaFold2; only structures with a pLDDT value of >70 were retained. By merging the results obtained via sequence and structure searches, we identified a total of 860 proteins, 346 of which were identified based on both sequence and structure. These 860 proteins represented five distinct classes of proteases (pie charts: 280 metallo-, 145 cysteine, 142 serine, 121 aspartic and 81 ‘mixed’ proteases) as well as 91 protease inhibitors (green). The pie charts in the figure show the different gene families within individual classes; the top 10 gene families are designated in black.
Figure 1. Identification and classification of proteases and protease inhibitors encoded in the Haemonchus contortus genome employing a combined sequence- and structure-based approach. Using curated sequence data from the MEROPS database, we identified 805 encoded proteins based on sequence homology (left). Based on structural information available in the MEROPS database, we downloaded protein structures of proteases and protease inhibitors from the PDB database and used US-align to identify 401 proteins (right). Protein structures were predicted using the program AlphaFold2; only structures with a pLDDT value of >70 were retained. By merging the results obtained via sequence and structure searches, we identified a total of 860 proteins, 346 of which were identified based on both sequence and structure. These 860 proteins represented five distinct classes of proteases (pie charts: 280 metallo-, 145 cysteine, 142 serine, 121 aspartic and 81 ‘mixed’ proteases) as well as 91 protease inhibitors (green). The pie charts in the figure show the different gene families within individual classes; the top 10 gene families are designated in black.
Ijms 24 12320 g001

2.3. Protease Groups Have Divergent Biophysical Properties

Individual biophysical properties predicted for H. contortus proteases vary significantly, depending on their classification. Aspartic proteases had the lowest average molecular weight (46.75 kDa) and pI (5.78), whereas metallo-proteases had the highest pI (7.11) and the lowest mean charge (−1.63) at pH 7.0. Cysteine proteases had the lowest average GRAVY (−0.40) and the broadest range of charge values (−79.06 to 36.3) at pH 7.0. Serine proteases had an average pI (6.89) and mean GRAVY (−0.22), and mixed proteases had a slightly lower mean GRAVY (−0.25) than cysteine proteases. Moreover, protease inhibitors shared similar molecular weights and pIs to proteases, but exhibited a broader range of charges (−96.51 to 29.66) at pH 7.0 (Table S1).

2.4. Marked Variability in the Presence/Absence of a Signal Peptide

The number of proteins with signal peptides varied among the different groups of proteases. Almost half of all metallo-proteases (i.e., 134 of 280) had signal peptides, followed by 66.1% aspartic proteases (80 of 121). Cysteine and serine proteases had fewer proteins with signal peptides—i.e., 50.3% (73 of 145) and 28.9% (41 of 142), respectively. The mixed proteases had the least proteins with signal peptides (24 of 81) (Figure 2). These findings suggest that the presence/absence of signal peptides could relate to different biological roles and/or transport of distinct protein groups, and signal peptides were more prevalent in some groups than others (e.g., 66.1% for aspartic vs. 28.9% for serine proteases). The percentage of protein inhibitors with signal peptides (56.0%; 51 of 91) was similar to that of cysteine proteases (50.3%; 73 of 145) (Figure 2).

2.5. GO Annotations and Key Pathways

In total, 790 H. contortus genes encoding proteases or protease inhibitors were assigned 5560 GO terms (level 2) linked to 19 biological processes (BPs), three cellular components (CCs) and 14 molecular functions (MFs) (Table S2). Metabolic processes (GO:0008152) and cellular anatomical entity (GO:0110165) were the largest sub-categories in BP and CC, respectively. For MF, catalytic activity (GO:0003824) and binding (GO:0005488) were the predominant annotated functions (Figure 2).
Of the 790 H. contortus genes encoding proteases and protease inhibitors, 340 have C. elegans orthologues encoding 11 (9.6%) aspartic, 48 (36.1%) serine, 111 (42.9%) metallo-, 58 (80.6%) mixed, 70 (55.12%) serine proteases and 42 (50.0%) protease inhibitors (Table S1). An analysis revealed that these orthologous genes are involved in 409 distinct pathways or processes, including a key subset (~15%) involving E3 ubiquitin ligase ubiquitinate target proteins (R-HSA-8866654), protein ubiquitination (R-HSA-8852135), post-translational protein modification (R-HSA-597592) and protein metabolism (R-HSA-392499) (Table S3).

2.6. Diverse Transcription and Stage- and Sex-Specific Patterns

The transcriptional profiles of genes encoding proteases and protease inhibitors of H. contortus were investigated across all key developmental stages/sexes, including egg, L1, L2 and L3, L4f, L4m, Af and Am. Metallo-protease genes were represented in four main clusters, based on markedly distinct transcription patterns. Specifically, 39 genes were highly transcribed in parasitic stages (i.e., L4f, L4m, Af and Am), 28 genes displayed male-specific transcription (in L4m and Am), 11 genes exhibited relatively high transcription levels in the egg stage and 28 genes had a high transcription level in the L2 stage. Notably, most genes transcribed in parasitic stages (L4f, L4m, Af and Am) encoded excretory/secretory (ES) proteins, whereas those with high transcription in free-living stages (egg, L1, L2 and L3) encoded proteins that are not ES proteins (Figure 3).
Almost half of all cysteine protease genes (n = 61) exhibited high transcription in the parasitic stages (i.e., L4f, L4m, Af and Am), 90% of which encode ES proteins. Two gene clusters were identified, with five genes exhibiting transcriptional activity in the egg, and five genes with high transcription in the L1 stage (Figure 3).
In contrast to metallo-protease and cysteine protease genes, there were no serine protease gene clusters exhibiting high transcription in the parasitic stages of H. contortus. However, three distinct clusters of six genes each exhibiting high, male-specific transcription in L4m and Am, 17 genes exhibiting high transcription in L1, and 31 genes displaying high transcription in the L2 and L3 stages were identified. Most (76.5%) of these 17 protease genes with high transcription levels in the L1 stage encoded ES proteins, whereas most such genes with high expression in the L2 and L3 stages encoded non-ES proteins (Figure 3).
The transcription profiles of aspartic protease genes were diverse among the different developmental stages/sexes of H. contortus. Twelve and 38 genes exhibited high transcription in the parasitic female (i.e., L4f and Af) and male (i.e., L4m and Am) stages. Additionally, 17 genes exhibited high transcription in both sexes of the parasitic stages (i.e., L4f, L4m, Af and Am), nine genes in L1, and seven genes in the L3 stage (Figure 3).
For the ‘mixed’ protease genes, clusters were not as clearly defined as for other protease genes. However, 27 and 7 genes were highly transcribed in the egg and L1 stages, respectively. The former group represented predominantly ES proteins, whereas the latter represented mainly non-ES proteins. Intriguingly, most (83.3%) mixed protease genes were transcribed in free-living stages, which contrasts the findings for other protease gene groups. These findings suggest that mixed protease genes might play key roles, particularly in free-living stages of H. contortus (Figure 3).
Protease inhibitor genes also exhibited conspicuous transcription profile differences among developmental stages, with three distinctive gene clusters observed. Specifically, 10 inhibitor genes displayed high transcription in male (L4m and Am) parasitic stages, most of which coded for ES proteins; 13 genes were highly transcribed in the egg, half of which code for ES proteins; and five genes were highly transcribed in the L1 stage (Figure 3).

3. Discussion

Proteases have consistently been a focus in biological research due to their potential as vaccine and therapeutic targets [62,63,64], particularly in the context of controlling parasitic infections or diseases [18,65,66]. Although numerous studies have employed molecular techniques to identify and investigate particular proteases of H. contortus, such as Hc-CBL (HC58), Hc-CPL-1 and Hc-CBP-1 [59,67,68], there had been no comprehensive, systematic analysis of these proteins. Recent advances in genomics, transcriptomics, proteomics [43,44,45,46,47,48] and informatics, including artificial intelligence-based structural modelling [25,69,70,71], have created new avenues for the characterisation of proteases encoded within the H. contortus genome. Thus, using the curated data available in MEROPS database, we employed a combination of sequence- and structure-based methods to identify 790 genes encoding 860 proteases or protease inhibitors (including distinct isoforms) in the H. contortus genome. The proteases were shown to belong to 83 distinct gene families and displayed considerable variation in biophysical properties and in the presence of signal peptides. These findings offer valuable insights into the diversity of H. contortus proteases and their inhibitors, establishing a foundation for functional investigations of these molecules and for discovering potential novel drug or vaccine targets.

3.1. Combining Structure- and Sequence-Based Methods Enhances the Identification of Proteases and Their Inhibitors

As distinct from conventional methods that rely solely on sequence-based identification [9,24], the present study integrates sequence homology and structural similarity to accurately identify proteases and their inhibitors in H. contortus. Using this approach, we were able to detect 805 homologous proteins based on sequence similarity alone, and 401 proteins using structural similarity. Using both techniques, 860 proteins were identified, markedly enhancing the overall number (cf. Section 2.1).
Of the 860 proteins identified, 346 exhibited homologous relationships with proteins in the MEROPS database at both the sequence and structure levels [8]. For instance, protein HCON_00016690-00002 had a high sequence similarity (E-value < 10−109) with the metallo-protease MER0001185 in this database, and is an orthologue of C. elegans WBGene00009865 [72,73]. Additionally, it exhibits a structural similarity of TM = 0.96 with human endoplasmic reticulum aminopeptidase 2YD0 in the PDB database [74]. This information provides confidence about the identity of this molecule.
We discovered 55 proteins based on structural similarity alone. These proteins lacked homologues at the sequence level, but matched proteins with known structures [75,76,77] in MEROPS. For instance, protein HCON_00150390-00001 did not share Diamond/BLASTP homology to a protein in the MEROPS sequence search, but its predicted three-dimensional structure matched protease inhibitor BIRC-7 (TM = 0.93), clearly indicating that it is a protease inhibitor [78]. This result shows that a structure-based method can sometimes overcome limitations inherent in a sequence-based identification method [79,80,81].
The identification of 459 proteins exclusively using the sequence-based method highlights a limitation of structure-based identification alone [82,83,84], which may relate to the incompleteness of protein structure databases [85,86,87,88]. On the one hand, not all proteases in the MEROPS database are linked to structural information. On the other hand, only 66% of structures of H. contortus proteins predicted by AlphaFold2 were of an acceptable quality (pLDDT of > 70). Thus, we propose that a combined sequence- and structure-based approach achieves superior protein identification. Nonetheless, it is crucial that databases and informatic approaches are continually updated to allow high-quality identification and subsequent in silico classification and functional annotation.

3.2. Key Protease Gene Families Associate with Key Biological Functions in Parasite Invasion and Feeding as Well as Host–Parasite Interactions, Immunomodulation and Parasitism

The distribution of H. contortus proteases (according to families) was similar to that observed in the MEROPS database for C. elegans and other model metazoans [8], which provides evidence in support of the hypothesis that the ‘protease system’ has remained relatively constant throughout evolution (cf. [24,89,90]). However, we also noted some differences/discrepancies, particularly with respect to particular gene families that possess many members (e.g., A01, C01 and M12). This may relate to gene family expansions during the course of evolution [91,92,93], which would/could have facilitated adaptation to the unique parasitic lifestyle of H. contortus. Therefore, members of these families may be potential intervention targets, warranting future investigation.
The aspartic protease family A01 of H. contortus consists of a remarkably large number of proteins—a total of 98 (see Section 2.2). Such proteases possess the ability to cleave peptide bonds using two aspartic acid residues located in their active sites [94]. Previous studies of model species (e.g., C. elegans and D. melanogaster) revealed that this family includes pepsin, renin and cathepsin D [95,96,97]. In parasites, aspartic proteases of family A01 can play critical roles in the processing of surface proteins required for host cell invasion and/or immune evasion [98,99]. Additionally, these proteases contribute to the degradation of host proteins, which helps the parasites obtain nutrients from their host environment [100]. In helminths, these proteases are vital in breaking down host proteins, such as antibodies, facilitating a parasite’s invasion and establishment in the host animal [101,102,103]. This information indicates that these A01 proteases are critical for the H. contortus parasitic life. In addition, the transcriptional data also support this observation, where many H. contortus A01 protease genes encode ES proteins and are highly transcribed/expressed in the L4 and adult stages (i.e., Af and Am; Figure 2), indicating a role in host invasion and survival within the host.
Of the cysteine proteases identified in H. contortus, family C01 contains the largest number of proteins—a total of 84 (see Section 2.2). C01 proteases are known for their diverse functions in protein-turnover, antigen processing, apoptosis, inflammation and pathogenesis [104,105,106]. One of the significant characteristics of C01 proteases in parasites is their high degree of diversity [107,108]. For example, blood flukes, such as Schistosoma mansoni, have large numbers of C01 proteases that differ markedly in their sequence, structure and substrate specificity (e.g., [20,109]). This diversity allows parasites to efficiently degrade and utilise a wide range of host proteins and tissues [110,111]. Interestingly, almost all C01 family proteases identified in H. contortus are ES proteins, whose genes are highly transcribed in the parasitic stage (Figure 2). This finding indicates that H. contortus secretes C01 family proteases, which could act as virulence factors by degrading host proteins, disrupting host tissues and modulating host immune responses in the host animal (cf. [19,67]).
Metallo-protease family M12 contains the largest number of proteases in H. contortus, with a total of 130 members (cf. Section 2.2). This family of proteases is characterised by the presence of a conserved zinc-binding motif in their catalytic domains and plays crucial roles in various biological processes, including tissue remodelling and/or the regulation of signalling pathways [112,113]. In helminths, a variety of M12 proteases have been inferred or shown to be involved in the degradation of host tissues and in immune evasion [114]. In H. contortus, genes encoding M12 proteases were shown to be highly transcribed in all developmental stages, indicating important roles throughout the life cycle of this parasite (Figure 2).
Interestingly, some proteases belonging to the families A01, C01 and M12 are known to play a critical role in the blood-feeding pathways of parasites, as they participate in the proteolytic cascade of haemoglobin degradation [115,116]. It has been postulated that proteases of the family A01 represent the initial step in host haemoglobin degradation [18], such that host haemoglobin is first targeted by aspartic proteases and then degraded into smaller peptides by cysteine proteases (including those of family C01) [117]. This process is followed by further degradation into smaller peptides by metallo-proteases, including those of family M12 [12]. As haemoglobin degradation serves as the primary source of nutrition for H. contortus and is vital for its survival in the parasitic phase within the host, the expanded number of M12 protease genes might be explained by the parasite’s need for a diverse array of proteases to maintain a ‘stability’ in blood-feeding pathway and to suppress immune attack by the host animal (cf. [18,115]). The disruption or interruption of the functions of such proteases markedly affects a haematophagous parasite’s ability to acquire critical nutrients from blood, which is why previous efforts have targeted selected proteases as vaccine molecules or candidates (e.g., [118,119,120,121,122,123,124]). For example, the metallo-protease (family M01) represented by HCON_00156280 [125] is a membrane-bound glycoprotein H11 and a known vaccine molecule. When assessed as a vaccine, H11 reduced total worm numbers of adult H. contortus in vaccinated sheep by 70–80%, and associated egg numbers in faeces by >90% [119,125,126,127,128]. H11 is also a component of Barbervax®—the first licensed, native anti-H. contortus vaccine, launched in Australia (https://barbervax.com/; refs. [125,129,130]). Thus, there is clear merit in pursuing further research on metallo-proteases of H. contortus to identify well-defined immunogens, recognising that, sometimes, immunogenicity might be linked to post-translational (e.g., carbohydrate) moieties [131].

3.3. The Biological Relevance and Meaning of Diverse Transcriptional Profiles for Protease and Protease Inhibitor Genes—Significance and Implications

The genes that encode proteases and protease inhibitors in H. contortus displayed varied transcription patterns among developmental stages (Section 2.6). In free-living (i.e., egg, L1, L2 and L3) stages, transcription levels of genes encoding, for example, M12, C12, C13, A22, and T01 proteases, were high, particularly in the egg stage, suggesting key roles in embryonic development (cf. [132]). In the L1, L2 and L3 stages, genes encoding, for example, M12 proteases, have been proposed to be linked to larval growth and development and/or moulting [133,134].
In parasitic stages (i.e., L4f, L4m, Af and Am), more than 100 genes encoding metallo-, cysteine, serine and aspartic proteases (Section 2.6) were highly transcribed, contrasting the distinctly different profiles of orthologues in the free-living nematode C. elegans (see [135,136]), and are thought to be linked to parasite invasion, establishment and/or survival. These genes encoded predominantly metallo-proteases (particularly family M12), cysteine proteases (mostly family C01) and aspartic proteases (largely family A01) (Table S1), many of which are likely involved in immunomodulation [59,60] or in the blood-feeding pathway, which has been the subject of significant previous investigation in H. contortus (Section 3.2; [67,137,138]). Moreover, metallo-, cysteine and aspartic proteases, such as MEP1, HMCP-1, HMCP-4, HMCP-6, pep1 and H11, have been recognised as key participants in the haemoglobin degradation process [56,115,120,121,139,140,141], which makes many of them possible vaccine and drug target candidates.
In female parasitic stages (L4f and Af) of H. contortus, highly transcribed genes encoding aspartic proteases of family A01 and cysteine proteases of family C19 (Table S1), such as the ubiquitin-specific proteases (USPs), highly likely play a key role in regulating the ubiquitin-proteasome system (UPS), which are critically involved in protein turnover [32,142]. These proteases appear to be implicated in processes linked to sexual development, maturation and/or reproduction [143], which is why they may represent targets for the disruption of these genes/proteases (cf. [144]). In male parasitic stages (L4m and Am) of H. contortus, highly transcribed protease genes encoded predominantly aspartic proteases (family A01), serine proteases (families S09 and S12) and metallo-proteases (family M18) (Section 2.6). Given their apparent male-specificity, we propose that these molecules play one or more critical role(s) in male reproductive development or sperm production in this worm, supported by published evidence of the involvement of similar proteases in sperm maturation and function in various other eukaryotic organisms [145,146,147]. This information on sex-enriched transcription/expression of particular proteases provides a clear stimulus to explore the roles of these proteins in the reproductive processes in H. contortus, and also to assess their potential as intervention targets.

4. Materials and Methods

4.1. Identification, Classification and Annotation of Proteases and Protease Inhibitors of H. contortus

The latest genome and proteome of H. contortus (ISE strain; ref. [48]) were downloaded from WormBase ParaSite (version WBPS17; accession: PRJEB506; https://parasite.wormbase.org/; accessed on 15 February 2023; ref. [148]). Relevant information (including identifiers, classes and families) of proteases (including aspartic, cysteine, glutamic, “mixed”, metallo-, asparagine, serine and threonine representatives) was retrieved from the MEROPS database (https://www.ebi.ac.uk/merops/; accessed on 15 February 2023; ref. [8]) to assist the identification, classification and annotation of proteases and their inhibitors in H. contortus. Both sequence- and structure-based search methods were utilised for this purpose.
For the sequence-based strategy, the protease and protease inhibitor protein sequences from the MEROPS database were used as queries to identify homologues in H. contortus using the program BLAST v.2.12.0 [149], employing a sequence identity threshold (E-value) for BLAST of <10−5. InterProScan v.5.55-88.0 [150] was used to detect relevant Pfam [151] and PANTHER [152] domains using default settings. For the structure-based strategy, known protease and protease inhibitor structures—collected from, and documented in, MEROPS, and downloaded from PDB [153]—were used as queries to search against our own, in-house H. contortus protein structure database to establish structural homology using US-align v.20220904 [154]. US-align’s threshold (TM-score) was set at >0.5. The structures of mature proteases were predicted using AlphaFold2 [70]. Individual proteins and their gene models were manually inspected and curated [49]. Functional annotation was achieved employing a recently established pipeline, with all proteins being assigned gene ontology (GO) terms [51]. Excretory/secretory (ES) proteins were inferred according to previous studies [43,51].

4.2. Prediction of Biophysical Properties and Sequence Features

The physicochemical properties of identified proteases and protease inhibitors, including molecular weight (in kDa), isoelectric point (pI), charge (at pH 7.0), grand average of hydropathy (GRAVY) and instability index (II), were estimated using EXPASy (https://www.expasy.org/; refs. [155,156]) and Biopython (https://biopython.org/) [157]. Signal peptides were predicted using SignalP v.6.0 [158]. Conserved domains in proteins were investigated using InterProScan v.5.55-88.0 [150]. Protease-encoding genes identified were mapped to the H. contortus chromosome (ISE strain; accession: PRJEB506; ref. [48]) and displayed using TBtools v.1.09876 [159].

4.3. Detection of Orthologues and Pathway Analysis

OrthoFinder v.2.5.4 [160] was used to find orthologues of H. contortus proteases and protease inhibitors in C. elegans. Based on the information from C. elegans orthologues, we inferred pathway information linked to orthologues using the Reactome Pathway Knowledgebase (https://reactome.org/; ref. [161]).

4.4. Gene Transcription and Protein Expression

Transcription profiles for key developmental stages/sexes—i.e., egg, first-stage larva (L1), second-stage larva (L2), third-stage larva (L3), female (L4f) and male (L4m) fourth-stage larva (L4) and female (Af) and male (Am) adult—were available from previous studies [40,162]. RNA-seq data were processed and analysed using trim_galore v.0.6.5 (https://zenodo.org/badge/latestdoi/62039322, accessed on 15 February 2023), SAMtools v.1.13 [163], MultiQC v.1.10 [164], HISAT2 v.2.2.0 [165] and StringTie v.2.0.5 [165], and the transcript abundances of identified protease genes, expressed as fragments per kilobase per million (FPKM) [32], were calculated. Custom Python scripts were used to establish normalised transcription levels of individual genes.
The expression levels of proteases and protease inhibitors in H. contortus across different developmental stages were available from a previous study [43,44]. Proteome discoverer software [166] was used to search mass spectrometric data for the egg, L3, L4 and adult stages. Peptides were identified using a false discovery rate (FDR) cut-off of <1%, and peptide intensities were calculated with Spectronaut software. At least two matching peptides were required for expression to be recorded, and protein expression levels were expressed as ‘intensities’. Protein phosphorylation was explored with reference to published data sets [45,49].

5. Conclusions

Genetic and genomic variability both within and among species in the Tree of Life present major challenges for the accurate identification, characterisation, classification and functional annotation of molecules in different taxonomic groups of animals, particularly invertebrates. In order to overcome such challenges associated with proteases and protease inhibitors, we established here an effective bioinformatic workflow that utilises a combination of both sequence- and structure-based in silico methods. We employed curated genomic, transcriptomic and proteomic data sets for the well-characterised model parasitic nematode, H. contortus, to effectively identify, classify and functionally annotate proteases and protease inhibitors of this species on a genome-wide scale. Although established and critically assessed here for H. contortus, this workflow should now find broad applicability to any eukaryotic pathogen, enabling both fundamental biodiscovery and the search for new intervention targets in socio-economically important pathogens.

Supplementary Materials

The following supporting information can be downloaded at: https://0-www-mdpi-com.brum.beds.ac.uk/article/10.3390/ijms241512320/s1.

Author Contributions

Y.Z. contributed to the concept framework, design and implementation, collection of data, analyses and the interpretation of findings. He wrote the initial draft of the paper and edited it based on suggestions and comments from co-authors. N.D.Y. played a supervisory role and contributed to planning and conceptual framework, provided guidance and suggestions, critically appraised findings and interpretation. J.S. played a supervisory role and provided suggestions and comments on the final draft manuscript. R.B.G. played a supervisory role, contributed to the planning and conceptualising of the framework and the drafting, editing and revising of the manuscript, and oversaw and funded the project. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by grants from the Australian Research Council (ARC) to N.D.Y. (LP180101334) and R.B.G. (LP180101085 and LP190101209). Y.Z. received support from both a Melbourne Research Scholarship and a Rowden White Award.

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki, and was approved by the Institutional Review Board (Animal Ethics Committee) of The University of Melbourne (permit no. 1714374).

Informed Consent Statement

Not applicable.

Data Availability Statement

The predicted protein structure data are accessible via https://figshare.com/s/f8cff0e89caa31ff98e9 (accessed on 30 May 2023).

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Bond, J.S. Proteases: History, discovery, and roles in health and disease. J. Biol. Chem. 2019, 294, 1643–1651. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Colbert, J.D.; Matthews, S.P.; Miller, G.; Watts, C. Diverse regulatory roles for lysosomal proteases in the immune response. Eur. J. Immunol. 2009, 39, 2955–2965. [Google Scholar] [CrossRef] [PubMed]
  3. Seth, S.; Batra, J.; Srinivasan, S. COVID-19: Targeting proteases in viral invasion and host immune response. Front. Mol. Biosci. 2020, 9, 215. [Google Scholar]
  4. Reyes, J.; Ayala-Chavez, C.; Sharma, A.; Pham, M.; Nuss, A.B.; Gulia-Nuss, M. Blood digestion by trypsin-like serine proteases in the replete lyme disease vector tick, Ixodes scapularis. Insects 2020, 11, 201. [Google Scholar] [CrossRef] [Green Version]
  5. Salvesen, G.S.; Hempel, A.; Coll, N.S. Protease signaling in animal and plant-regulated cell death. FEBS J. 2016, 283, 2577–2598. [Google Scholar] [CrossRef] [Green Version]
  6. Stael, S.; Van Breusegem, F.; Gevaert, K.; Nowack, M.K. Plant proteases and programmed cell death. J. Exp. Bot. 2019, 70, 1991–1995. [Google Scholar] [CrossRef] [Green Version]
  7. López-Otín, C.; Bond, J.S. Proteases: Multifunctional enzymes in life and disease. J. Biol. Chem. 2008, 283, 30433–30437. [Google Scholar] [CrossRef] [Green Version]
  8. Rawlings, N.D.; Barrett, A.J.; Thomas, P.D.; Huang, X.; Bateman, A.; Finn, R.D. The MEROPS database of proteolytic enzymes, their substrates and inhibitors in 2017 and a comparison with peptidases in the PANTHER database. Nucleic Acids Res. 2018, 46, 624–632. [Google Scholar] [CrossRef] [Green Version]
  9. Rawlings, N.D.; Bateman, A. How to use the MEROPS database and website to help understand peptidase specificity. Protein Sci. 2021, 30, 83–92. [Google Scholar] [CrossRef]
  10. Blackman, M.J. Proteases in host cell invasion by the malaria parasite. Cell. Microbiol. 2004, 6, 893–903. [Google Scholar] [CrossRef]
  11. Alves, C.R.; de Souza, R.S.; dos Santos Charret, K.; de Castro Côrtes, L.M.; de Sá-Silva, M.P.; Barral-Veloso, L.; Oliveira, L.F.G.; da Silva, F.S. Understanding serine proteases implications on Leishmania spp lifecycle. Exp. Parasitol. 2018, 184, 67–81. [Google Scholar] [CrossRef]
  12. Williamson, A.L.; Lecchi, P.; Turk, B.E.; Choe, Y.; Hotez, P.J.; McKerrow, J.H.; Cantley, L.C.; Sajid, M.; Craik, C.S.; Loukas, A. A multi-enzyme cascade of hemoglobin proteolysis in the intestine of blood-feeding hookworms. J. Biol. Chem. 2004, 279, 35950–35957. [Google Scholar] [CrossRef] [Green Version]
  13. Hewitson, J.P.; Grainger, J.R.; Maizels, R.M. Helminth immunoregulation: The role of parasite secreted proteins in modulating host immunity. Mol. Biochem. Parasitol. 2009, 167, 1–11. [Google Scholar] [CrossRef] [PubMed]
  14. Donnelly, S.; Dalton, J.P.; Robinson, M.W. How pathogen-derived cysteine proteases modulate host immune responses. Adv. Exp. Med. Biol. 2011, 712, 192–207. [Google Scholar] [PubMed]
  15. McKerrow, J.H. Parasite proteases. Exp. Parasitol. 1989, 68, 111–115. [Google Scholar] [CrossRef]
  16. Trap, C.; Boireau, P. Proteases in helminthic parasites. Vet. Res. 2000, 31, 461–471. [Google Scholar] [CrossRef] [Green Version]
  17. Yatsuda, A.P.; Bakker, N.; Krijgsveld, J.; Knox, D.P.; Heck, A.J.R.; De Vries, E. Identification of secreted cysteine proteases from the parasitic nematode Haemonchus contortus detected by biotinylated inhibitors. Infect. Immun. 2006, 74, 1989–1993. [Google Scholar] [CrossRef] [Green Version]
  18. Knox, D. Proteases in blood-feeding nematodes and their potential as vaccine candidates. Adv. Exp. Med. Biol. 2011, 712, 155–176. [Google Scholar] [PubMed]
  19. Caffrey, C.R.; Goupil, L.; Rebello, K.M.; Dalton, J.P.; Smith, D. Cysteine proteases as digestive enzymes in parasitic helminths. PLoS Negl. Trop. Dis. 2018, 12, e0005840. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Grote, A.; Caffrey, C.R.; Rebello, K.M.; Smith, D.; Dalton, J.P.; Lustigman, S. Cysteine proteases during larval migration and development of helminths in their final host. PLoS Negl. Trop. Dis. 2018, 12, e0005919. [Google Scholar] [CrossRef] [Green Version]
  21. Alama-Bermejo, G.; Bartošová-Sojková, P.; Atkinson, S.D.; Holzer, A.S.; Bartholomew, J.L. Proteases as therapeutic targets against the parasitic cnidarian Ceratonova shasta: Characterization of molecules key to parasite virulence in salmonid hosts. Front. Cell. Infect. Microbiol. 2022, 11, 804864. [Google Scholar] [CrossRef]
  22. Cheng, M.; Yang, X.; Li, Z.; He, H.; Qu, Z.; He, A.; Wu, Z.; Zhan, X. Cloning and characterization of a novel cathepsin B-like cysteine proteinase from Angiostrongylus cantonensis. Parasitol. Res. 2012, 110, 2413–2422. [Google Scholar] [CrossRef]
  23. Jasmer, D.P.; Rosa, B.A.; Mitreva, M. Peptidases compartmentalized to the Ascaris suum intestinal lumen and apical intestinal membrane. PLoS Negl. Trop. Dis. 2015, 9, e3375. [Google Scholar] [CrossRef]
  24. Castagnone-Sereno, P.; Deleury, E.; Danchin, E.G.J.; Perfus-Barbeoch, L.; Abad, P. Data-mining of the Meloidogyne incognita degradome and comparative analysis of proteases in nematodes. Genomics 2011, 97, 29–36. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Korhonen, P.K.; Young, N.D.; Gasser, R.B. Making sense of genomes of parasitic worms: Tackling bioinformatic challenges. Biotechnol. Adv. 2016, 34, 663–686. [Google Scholar] [CrossRef] [PubMed]
  26. Patel, S.; Homaei, A.; El-Seedi, H.R.; Akhtar, N. Cathepsins: Proteases that are vital for survival but can also be fatal. Biomed. Pharmacother. 2018, 105, 526–532. [Google Scholar] [CrossRef] [PubMed]
  27. Gasser, R.B.; Schwarz, E.M.; Korhonen, P.K.; Young, N.D. Understanding Haemonchus contortus better through genomics and transcriptomics. Adv. Parasitol. 2016, 93, 519–567. [Google Scholar]
  28. Wang, T.; Nie, S.; Ma, G.; Korhonen, P.K.; Koehler, A.V.; Ang, C.S.; Reid, G.E.; Williamson, N.A.; Gasser, R.B. The developmental lipidome of Haemonchus contortus. Int. J. Parasitol. 2018, 48, 887–895. [Google Scholar] [CrossRef]
  29. Ma, G.; Gasser, R.B.; Wang, T.; Korhonen, P.K.; Young, N.D. Toward integrative ‘omics of the barber’s pole worm and related parasitic nematodes. Infect. Genet. Evol. 2020, 85, 104500. [Google Scholar] [CrossRef] [PubMed]
  30. Adderley, J.; Wang, T.; Ma, G.; Zheng, Y.; Young, N.D.; Doerig, C.; Gasser, R.B. Repurposing of a human antbody-based microarray to explore conserved components of the signalome of the parasitic nematode Haemonchus contortus. Parasit. Vectors 2022, 15, 273. [Google Scholar] [CrossRef]
  31. Taki, A.C.; Wang, T.; Nguyen, N.N.; Ang, C.-S.; Leeming, M.G.; Nie, S.; Byrne, J.J.; Young, N.D.; Zheng, Y.; Ma, G.; et al. Thermal proteome profiling reveals Haemonchus orphan protein HCO_011565 as a target of the nematocidal small molecule UMW-868. Front. Pharmacol. 2022, 13, 1014804. [Google Scholar] [CrossRef] [PubMed]
  32. Zheng, Y.; Ma, G.; Wang, T.; Hofmann, A.; Song, J.; Gasser, R.B.; Young, N.D. Ubiquitination pathway model for the barber’s pole worm—Haemonchus contortus. Int. J. Parasitol. 2022, 52, 581–590. [Google Scholar] [CrossRef] [PubMed]
  33. Emery, D.L.; Hunt, P.W.; Le Jambre, L.F. Haemonchus contortus: The then and now, and where to from here? Int. J. Parasitol. 2016, 46, 755–769. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Gasser, R.B.; von Samson-Himmelstjerna, G. Haemonchus contortus and Haemonchosis—Past, Present and Future Trends; Academic Press: Cambridge, MA, USA, 2016; ISBN 9780128103968. [Google Scholar]
  35. Carson, A.; Reichel, R.; Bell, S.; Collins, R.; Smith, J.; Bartley, D. Haemonchus contortus: An overview. Vet. Rec. 2023, 192, 755–769. [Google Scholar] [CrossRef]
  36. Ehsan, M.; Hu, R.S.; Liang, Q.L.; Hou, J.L.; Song, X.; Yan, R.; Zhu, X.Q.; Li, X. Advances in the development of anti-Haemonchus contortus vaccines: Challenges, opportunities, and perspectives. Vaccines 2020, 8, 555. [Google Scholar] [CrossRef]
  37. Chaparro, J.J.; Villar, D.; Zapata, J.D.; López, S.; Howell, S.B.; López, A.; Storey, B.E. Multi-drug resistant Haemonchus contortus in a sheep flock in Antioquia, Colombia. Vet. Parasitol. Reg. Stud. Rep. 2017, 10, 29–34. [Google Scholar] [CrossRef]
  38. Baltrušis, P.; Doyle, S.R.; Halvarsson, P.; Höglund, J. Genome-wide analysis of the response to ivermectin treatment by a Swedish field population of Haemonchus contortus. Int. J. Parasitol. Drugs Drug Resist. 2022, 18, 12–19. [Google Scholar] [CrossRef] [PubMed]
  39. Laing, R.; Kikuchi, T.; Martinelli, A.; Tsai, I.J.; Beech, R.N.; Redman, E.; Holroyd, N.; Bartley, D.J.; Beasley, H.; Britton, C.; et al. The genome and transcriptome of Haemonchus contortus, a key model parasite for drug and vaccine discovery. Genome Biol. 2013, 14, R88. [Google Scholar] [CrossRef] [Green Version]
  40. Schwarz, E.M.; Korhonen, P.K.; Campbell, B.E.; Young, N.D.; Jex, A.R.; Jabbar, A.; Hall, R.S.; Mondal, A.; Howe, A.C.; Pell, J.; et al. The genome and developmental transcriptome of the strongylid nematode Haemonchus contortus. Genome Biol. 2013, 14, R89. [Google Scholar] [CrossRef] [Green Version]
  41. Ma, G.; Wang, T.; Korhonen, P.K.; Ang, C.S.; Williamson, N.A.; Young, N.D.; Stroehlein, A.J.; Hall, R.S.; Koehler, A.V.; Hofmann, A.; et al. Molecular alterations during larval development of Haemonchus contortus in vitro are under tight post-transcriptional control. Int. J. Parasitol. 2018, 48, 763–772. [Google Scholar] [CrossRef] [Green Version]
  42. Ma, G.; Wang, T.; Korhonen, P.K.; Hofmann, A.; Sternberg, P.W.; Young, N.D.; Gasser, R.B. Elucidating the molecular and developmental biology of parasitic nematodes: Moving to a multiomics paradigm. Adv. Parasitol. 2020, 108, 175–229. [Google Scholar] [PubMed]
  43. Wang, T.; Ma, G.; Ang, C.S.; Korhonen, P.K.; Koehler, A.V.; Young, N.D.; Nie, S.; Williamson, N.A.; Gasser, R.B. High throughput LC-MS/MS-based proteomic analysis of excretory-secretory products from short-term in vitro culture of Haemonchus contortus. J. Proteom. 2019, 204, 103375. [Google Scholar] [CrossRef] [PubMed]
  44. Wang, T.; Ma, G.; Ang, C.S.; Korhonen, P.K.; Xu, R.; Nie, S.; Koehler, A.V.; Simpson, R.J.; Greening, D.W.; Reid, G.E.; et al. Somatic proteome of Haemonchus contortus. Int. J. Parasitol. 2019, 49, 311–320. [Google Scholar] [CrossRef] [PubMed]
  45. Wang, T.; Ma, G.; Ang, C.S.; Korhonen, P.K.; Stroehlein, A.J.; Young, N.D.; Hofmann, A.; Chang, B.C.H.; Williamson, N.A.; Gasser, R.B. The developmental phosphoproteome of Haemonchus contortus. J. Proteom. 2020, 213, 103615. [Google Scholar] [CrossRef]
  46. Wang, T.; Ma, G.; Nie, S.; Williamson, N.A.; Reid, G.E.; Gasser, R.B. Lipid composition and abundance in the reproductive and alimentary tracts of female Haemonchus contortus. Parasit. Vectors 2020, 13, 338. [Google Scholar] [CrossRef]
  47. Doyle, S.R.; Laing, R.; Bartley, D.J.; Britton, C.; Chaudhry, U.; Gilleard, J.S.; Holroyd, N.; Mable, B.K.; Maitland, K.; Morrison, A.A.; et al. A genome resequencing-based genetic map reveals the recombination landscape of an outbred parasitic nematode in the presence of polyploidy and polyandry. Genome Biol. Evol. 2018, 10, 396–409. [Google Scholar] [CrossRef] [Green Version]
  48. Doyle, S.R.; Tracey, A.; Laing, R.; Holroyd, N.; Bartley, D.; Bazant, W.; Beasley, H.; Beech, R.; Britton, C.; Brooks, K.; et al. Genomic and transcriptomic variation defines the chromosome-scale assembly of Haemonchus contortus, a model gastrointestinal worm. Commun. Biol. 2020, 3, 656. [Google Scholar] [CrossRef]
  49. Ma, G.; Wang, T.; Korhonen, P.K.; Stroehlein, A.J.; Young, N.D.; Gasser, R.B. Dauer signalling pathway model for Haemonchus contortus. Parasit. Vectors 2019, 12, 187. [Google Scholar] [CrossRef] [Green Version]
  50. Wang, D.; Korhonen, P.K.; Gasser, R.B.; Young, N.D. Improved genomic resources and new bioinformatic workflow for the carcinogenic parasite Clonorchis sinensis: Biotechnological implications. Biotechnol. Adv. 2018, 36, 894–904. [Google Scholar] [CrossRef]
  51. Zheng, Y.; Young, N.D.; Song, J.; Chang, B.C.H.; Gasser, R.B. An informatic workflow for the enhanced annotation of excretory/secretory proteins of Haemonchus contortus. Comput. Struct. Biotechnol. J. 2023, 21, 2696–2704. [Google Scholar] [CrossRef]
  52. Campos, T.L.; Korhonen, P.K.; Hofmann, A.; Gasser, R.B.; Young, N.D. Harnessing model organism genomics to underpin the machine learning-based prediction of essential genes in eukaryotes—Biotechnological implications. Biotechnol. Adv. 2022, 54, 107822. [Google Scholar] [CrossRef] [PubMed]
  53. Sarkis, G.J.; Kurpiewski, M.R.; Ashcom, J.D.; Jen-Jacobson, L.; Jacobson, L.A. Proteases of the nematode Caenorhabditis elegans. Arch. Biochem. Biophys. 1988, 261, 80–90. [Google Scholar] [CrossRef]
  54. LaFlamme, B.A.; Ravi Ram, K.; Wolfner, M.F. The Drosophila melanogaster seminal fluid protease ‘Seminase’ regulates proteolytic and post-mating reproductive processes. PLoS Genet. 2012, 8, e1002435. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Kr, P.; Lee, J.; Mortimer, N.T. The S1A protease family members CG10764 and CG4793 regulate cellular immunity in Drosophila. MicroPubl. Biol. 2021, 10, 17912. [Google Scholar]
  56. Pratt, D.; Cox, G.N.; Milhausen, M.J.; Boisvenue, R.J. A developmentally regulated cysteine protease gene family in Haemonchus contortus. Mol. Biochem. Parasitol. 1990, 43, 181–191. [Google Scholar] [CrossRef] [PubMed]
  57. Shompole, S.; Jasmer, D.P. Cathepsin B-like cysteine proteases confer intestinal cysteine protease activity in Haemonchus contortus. J. Biol. Chem. 2001, 276, 2928–2934. [Google Scholar] [CrossRef] [Green Version]
  58. Ruiz, A.; Molina, J.M.; González, J.F.; Conde, M.M.; Martín, S.; Hernández, Y.I. Immunoprotection in goats against Haemonchus contortus after immunization with cysteine protease enriched protein fractions. Vet. Res. 2004, 35, 565–572. [Google Scholar] [CrossRef] [Green Version]
  59. Murray, L.; Geldhof, P.; Clark, D.; Knox, D.P.; Britton, C. Expression and purification of an active cysteine protease of Haemonchus contortus using Caenorhabditis elegans. Int. J. Parasitol. 2007, 37, 117–125. [Google Scholar] [CrossRef] [Green Version]
  60. Li, B.; Gadahi, J.A.; Gao, W.; Zhang, Z.; Ehsan, M.; Xu, L.; Song, X.; Li, X.; Yan, R. Characterization of a novel aspartyl protease inhibitor from Haemonchus contortus. Parasit. Vectors 2017, 10, 191. [Google Scholar] [CrossRef] [Green Version]
  61. Wolfsberg, T.G.; Madden, T.L. Sequence similarity searching using the BLAST family of programs. Curr. Protoc. Mol. Biol. 2001, 27, 6–8. [Google Scholar]
  62. Verhelst, S.H.L. Intramembrane proteases as drug targets. FEBS J. 2017, 284, 1489–1502. [Google Scholar] [CrossRef] [Green Version]
  63. Al-Awadhi, F.H.; Luesch, H. Targeting eukaryotic proteases for natural products-based drug development. Nat. Prod. Rep. 2020, 37, 827–860. [Google Scholar] [CrossRef] [PubMed]
  64. Jin, Z.; Du, X.; Xu, Y.; Deng, Y.; Liu, M.; Zhao, Y.; Zhang, B.; Li, X.; Zhang, L.; Peng, C.; et al. Structure of Mpro from SARS-CoV-2 and discovery of its inhibitors. Nature 2020, 582, 289–293. [Google Scholar] [CrossRef] [Green Version]
  65. Mashiyama, S.T.; Koupparis, K.; Caffrey, C.R.; McKerrow, J.H.; Babbitt, P.C. A global comparison of the human and T. brucei degradomes gives insights about possible parasite drug targets. PLoS Negl. Trop. Dis. 2012, 6, e1942. [Google Scholar] [CrossRef] [PubMed]
  66. Rascon, A.; McKerrow, J. Synthetic and natural protease inhibitors provide insights into parasite development, virulence and pathogenesis. Curr. Med. Chem. 2013, 20, 3078–3102. [Google Scholar] [CrossRef]
  67. Muleke, C.I.; Ruofeng, Y.; Lixin, X.; Yanming, S.; Xiangrui, L. Characterization of HC58cDNA, a putative cysteine protease from the parasite Haemonchus contortus. J. Vet. Sci. 2006, 7, 249–255. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  68. Bakshi, M.; Tuo, W.; Aroian, R.V.; Zarlenga, D. Immune reactivity and host modulatory roles of two novel Haemonchus contortus cathepsin B-like proteases. Parasit. Vectors 2021, 14, 580. [Google Scholar] [CrossRef] [PubMed]
  69. Zou, J.; Huss, M.; Abid, A.; Mohammadi, P.; Torkamani, A.; Telenti, A. A primer on deep learning in genomics. Nat. Genet. 2019, 51, 12–18. [Google Scholar] [CrossRef]
  70. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
  71. Varadi, M.; Anyango, S.; Deshpande, M.; Nair, S.; Natassia, C.; Yordanova, G.; Yuan, D.; Stroe, O.; Wood, G.; Laydon, A.; et al. AlphaFold protein structure database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022, 50, 439–444. [Google Scholar] [CrossRef]
  72. Shaye, D.D.; Greenwald, I. Ortholist: A compendium of C. elegans genes with human orthologs. PLoS ONE 2011, 6, e20085. [Google Scholar] [CrossRef] [PubMed]
  73. Kim, W.; Underwood, R.S.; Greenwald, I.; Shaye, D.D. Ortholist 2: A new comparative genomic analysis of human and Caenorhabditis elegans genes. Genetics 2018, 210, 445–461. [Google Scholar] [CrossRef] [Green Version]
  74. Kochan, G.; Krojer, T.; Harvey, D.; Fischer, R.; Chen, L.; Vollmar, M.; Von Delft, F.; Kavanagh, K.L.; Brown, M.A.; Bowness, P.; et al. Crystal structures of the endoplasmic reticulum aminopeptidase-1 (ERAP1) reveal the molecular basis for N-terminal peptide trimming. Proc. Natl. Acad. Sci. USA 2011, 108, 7745–7750. [Google Scholar] [CrossRef] [PubMed]
  75. Chothia, C.; Gough, J. Genomic and structural aspects of protein evolution. Biochem. J. 2009, 429, 15–28. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  76. Madej, T.; Lanczycki, C.J.; Zhang, D.; Thiessen, P.A.; Geer, R.C.; Marchler-Bauer, A.; Bryant, S.H. MMDB and VAST+: Tracking structural similarities between macromolecular complexes. Nucleic Acids Res. 2014, 42, 297–303. [Google Scholar] [CrossRef] [Green Version]
  77. Malik, A.J.; Poole, A.M.; Allison, J.R. Structural phylogenetics with confidence. Mol. Biol. Evol. 2020, 37, 2711–2726. [Google Scholar] [CrossRef] [PubMed]
  78. Zheng, C.; Kabaleeswaran, V.; Wang, Y.; Cheng, G.; Wu, H. Crystal structures of the TRAF2: cIAP2 and the TRAF1: TRAF2: cIAP2 complexes: Affinity, specificity, and regulation. Mol. Cell 2010, 38, 101–113. [Google Scholar] [CrossRef] [Green Version]
  79. Lobb, B.; Doxey, A.C. Novel function discovery through sequence and structural data mining. Curr. Opin. Struct. Biol. 2016, 38, 53–61. [Google Scholar] [CrossRef]
  80. Levine, T.P. Structural bioinformatics predicts that the Retinitis Pigmentosa-28 protein of unknown function FAM161A is a homologue of the microtubule nucleation factor Tpx2. F1000Research 2020, 9, 1052. [Google Scholar] [CrossRef]
  81. Sanchez-Pulido, L.; Ponting, C.P. Extending the horizon of homology detection with coevolution-based structure prediction. J. Mol. Biol. 2021, 433, 167106. [Google Scholar] [CrossRef] [PubMed]
  82. Godzik, A. The structural alignment between two proteins: Is there a unique answer? Protein Sci. 1996, 5, 1325–1338. [Google Scholar] [CrossRef] [PubMed]
  83. Kolodny, R.; Petrey, D.; Honig, B. Protein structure comparison: Implications for the nature of ‘fold space’, and structure and function prediction. Curr. Opin. Struct. Biol. 2006, 16, 393–398. [Google Scholar] [CrossRef]
  84. Li, Z.; Jaroszewski, L.; Iyer, M.; Sedova, M.; Godzik, A. FATCAT 2.0: Towards a better understanding of the structural diversity of proteins. Nucleic Acids Res. 2020, 48, 60–64. [Google Scholar] [CrossRef] [PubMed]
  85. Sillitoe, I.; Lewis, T.E.; Cuff, A.; Das, S.; Ashford, P.; Dawson, N.L.; Furnham, N.; Laskowski, R.A.; Lee, D.; Lees, J.G.; et al. CATH: Comprehensive structural and functional annotations for genome sequences. Nucleic Acids Res. 2015, 43, 376–381. [Google Scholar] [CrossRef] [PubMed]
  86. Laskowski, R.A. Protein structure databases. Methods Mol. Biol. 2016, 1415, 31–53. [Google Scholar] [PubMed]
  87. Burley, S.K.; Bhikadiya, C.; Bi, C.; Bittrich, S.; Chen, L.; Crichlow, G.V.; Christie, C.H.; Dalenberg, K.; Di Costanzo, L.; Duarte, J.M.; et al. RCSB protein data bank: Powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. Nucleic Acids Res. 2021, 49, 437–451. [Google Scholar] [CrossRef]
  88. Service, R.F. Huge protein structure database could transform biology. Science 2021, 373, 478. [Google Scholar] [CrossRef]
  89. Sajid, M.; McKerrow, J.H. Cysteine proteases of parasitic organisms. Mol. Biochem. Parasitol. 2002, 120, 1–21. [Google Scholar] [CrossRef]
  90. Wu, Y.; Wang, X.; Liu, X.; Wang, Y. Data-mining approaches reveal hidden families of proteases in the genome of malaria parasite. Genome Res. 2003, 13, 601–616. [Google Scholar] [CrossRef] [Green Version]
  91. Zhao, P.; Wang, G.-H.; Dong, Z.-M.; Duan, J.; Xu, P.-Z.; Cheng, T.-C.; Xiang, Z.-H.; Xia, Q.-Y. Genome-wide identification and expression analysis of serine proteases and homologs in the silkworm Bombyx mori. BMC Genom. 2010, 11, 405. [Google Scholar] [CrossRef] [Green Version]
  92. Sharma, R.; Suresh, C.G. Genome-wide identification and structure-function studies of proteases and protease inhibitors in Cicer arietinum (chickpea). Comput. Biol. Med. 2015, 56, 67–81. [Google Scholar] [CrossRef]
  93. Henriques, B.S.; Gomes, B.; da Costa, S.G.; da Silva Moraes, C.; Mesquita, R.D.; Dillon, V.M.; de Souza Garcia, E.; Azambuja, P.; Dillon, R.J.; Genta, F.A. Genome wide mapping of peptidases in Rhodnius prolixus: Identification of protease gene duplications, horizontally transferred proteases and analysis of peptidase A1 structures, with considerations on their role in the evolution of hematophagy in Triatominae. Front. Physiol. 2017, 8, 1051. [Google Scholar] [PubMed] [Green Version]
  94. Gurumallesh, P.; Alagu, K.; Ramakrishnan, B.; Muthusamy, S. A systematic reconsideration on proteases. Int. J. Biol. Macromol. 2019, 128, 254–267. [Google Scholar] [CrossRef] [PubMed]
  95. Jacobson, L.A.; Jen-Jacobson, L.; Hawdon, J.M.; Owens, G.P.; Bolanowski, M.A.; Emmons, S.W.; Shah, M.V.; Pollock, R.A.; Conklin, D.S. Identification of a putative structural gene for cathepsin D in Caenorhabditis elegans. Genetics 1988, 119, 355–363. [Google Scholar] [CrossRef] [PubMed]
  96. Dunn, B.M. Structure and mechanism of the pepsin-like family of aspartic peptidases. Chem. Rev. 2002, 102, 4431–4458. [Google Scholar] [CrossRef]
  97. Egan, B.M.; Scharf, A.; Pohl, F.; Kornfeld, K. Control of aging by the renin-angiotensin system: A review of C. elegans, Drosophila, and mammals. Front. Pharmacol. 2022, 13, 938650. [Google Scholar] [CrossRef] [PubMed]
  98. Kay, J.; Meijer, H.J.G.; ten Have, A.; van Kan, J.A.L. The aspartic proteinase family of three Phytophthora species. BMC Genomics 2011, 12, 254. [Google Scholar] [CrossRef] [Green Version]
  99. Wang, S.; Wei, W.; Luo, X.; Wang, S.; Hu, S.; Cai, X. Comparative genomic analysis of aspartic proteases in eight parasitic platyhelminths: Insights into functions and evolution. Gene 2015, 559, 52–61. [Google Scholar] [CrossRef]
  100. Xu, L.; Xu, M.; Sun, X.; Xu, J.; Zeng, X.; Shan, D.; Yuan, D.; He, P.; He, W.; Yang, Y.; et al. The genetic basis of adaptive evolution in parasitic environment from the Angiostrongylus cantonensis genome. PLoS Negl. Trop. Dis. 2019, 13, e0007846. [Google Scholar] [CrossRef] [Green Version]
  101. Koehler, J.W.; Morales, M.E.; Shelby, B.D.; Brindley, P.J. Aspartic protease activities of schistosomes cleave mammalian hemoglobins in a host-specific manner. Mem. Inst. Oswaldo Cruz 2007, 102, 83–85. [Google Scholar] [CrossRef] [Green Version]
  102. Hwang, K.-P.; Chang, S.-H.; Wang, L.-C. Alterations in the expression level of a putative aspartic protease in the development of Angiostrongylus cantonensis. Acta Trop. 2010, 113, 289–294. [Google Scholar] [CrossRef]
  103. Xu, J.; Yue, W.W.; Xu, Y.X.Y.; Hao, H.N.; Liu, R.D.; Long, S.R.; Wang, Z.Q.; Cui, J. Molecular characterization of a novel aspartyl protease-1 from Trichinella spiralis. Res. Vet. Sci. 2021, 134, 1–11. [Google Scholar] [CrossRef] [PubMed]
  104. Rana, S.; Dikhit, M.R.; Rani, M.; Moharana, K.C.; Sahoo, G.C.; Das, P. CPDB: Cysteine protease annotation database in Leishmania species. Integr. Biol. 2012, 4, 1351–1357. [Google Scholar] [CrossRef] [PubMed]
  105. Liu, J.; Ma’ayeh, S.; Peirasmaki, D.; Lundström-Stadelmann, B.; Hellman, L.; Svärd, S.G. Secreted Giardia intestinalis cysteine proteases disrupt intestinal epithelial cell junctional complexes and degrade chemokines. Virulence 2018, 9, 879–894. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  106. Solberg, R.; Lunde, N.N.; Forbord, K.M.; Okla, M.; Kassem, M.; Jafari, A. The mammalian cysteine protease legumain in health and disease. Int. J. Mol. Sci. 2022, 23, 15983. [Google Scholar] [CrossRef] [PubMed]
  107. Siqueira-Neto, J.L.; Debnath, A.; McCall, L.-I.; Bernatchez, J.A.; Ndao, M.; Reed, S.L.; Rosenthal, P.J. Cysteine proteases in protozoan parasites. PLoS Negl. Trop. Dis. 2018, 12, e0006512. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  108. Rosenthal, P.J. Falcipain cysteine proteases of malaria parasites: An update. Biochim. Biophys. Acta Proteins Proteom. 2020, 1868, 140362. [Google Scholar] [CrossRef]
  109. Bos, D.H.; Mayfield, C.; Minchella, D.J. Analysis of regulatory protease sequences identified through bioinformatic data mining of the Schistosoma mansoni genome. BMC Genom. 2009, 10, 488. [Google Scholar] [CrossRef] [Green Version]
  110. McKerrow, J.H. The diverse roles of cysteine proteases in parasites and their suitability as drug targets. PLoS Negl. Trop. Dis. 2018, 12, e0005639. [Google Scholar] [CrossRef]
  111. Allain, T.; Fekete, E.; Buret, A.G. Giardia cysteine proteases: The teeth behind the smile. Trends Parasitol. 2019, 35, 636–648. [Google Scholar] [CrossRef]
  112. Freitas-Rodríguez, S.; Folgueras, A.R.; López-Otín, C. The role of matrix metalloproteinases in aging: Tissue remodeling and beyond. Biochim. Biophys. Acta Mol. Cell Res. 2017, 1864, 2015–2025. [Google Scholar] [CrossRef] [PubMed]
  113. Rivera, S.; García-González, L.; Khrestchatisky, M.; Baranger, K. Metalloproteinases and their tissue inhibitors in Alzheimer’s disease and other neurodegenerative disorders. Cell. Mol. Life Sci. 2019, 76, 3167–3191. [Google Scholar] [CrossRef] [PubMed]
  114. Martín-Galiano, A.J.; Sotillo, J. Insights into the functional expansion of the astacin peptidase family in parasitic helminths. Int. J. Parasitol. 2022, 52, 243–251. [Google Scholar] [CrossRef] [PubMed]
  115. Williamson, A.L.; Brindley, P.J.; Knox, D.P.; Hotez, P.J.; Loukas, A. Digestive proteases of blood-feeding nematodes. Trends Parasitol. 2003, 19, 417–423. [Google Scholar] [CrossRef]
  116. Pearson, M.S.; Ranjit, N.; Loukas, A. Blunting the knife: Development of vaccines targeting digestive proteases of blood-feeding helminth parasites. Biol. Chem. 2010, 391, 901–911. [Google Scholar] [CrossRef]
  117. Ranjit, N.; Zhan, B.; Hamilton, B.; Stenzel, D.; Lowther, J.; Pearson, M.; Gorman, J.; Hotez, P.; Loukas, A. Proteolytic degradation of hemoglobin in the intestine of the human hookworm Necator americanus. J. Infect. Dis. 2009, 199, 904–912. [Google Scholar] [CrossRef] [Green Version]
  118. Boisvenue, R.J.; Stiff, M.I.; Tonkinson, L.V.; Cox, G.N.; Hageman, R. Fibrinogen-degrading proteins from Haemonchus contortus used to vaccinate sheep. Am. J. Vet. Res. 1992, 53, 1263–1265. [Google Scholar]
  119. Smith, W.D.; Smith, S.K.; Murray, J.M. Protection studies with integral membrane fractions of Haemonchus contortus. Parasite Immunol. 1994, 16, 231–241. [Google Scholar] [CrossRef]
  120. Longbottom, D.; Redmond, D.L.; Russell, M.; Liddell, S.; Smith, W.D.; Knox, D.P. Molecular cloning and characterisation of a putative aspartate proteinase associated with a gut membrane protein complex from adult Haemonchus contortus. Mol. Biochem. Parasitol. 1997, 88, 63–72. [Google Scholar] [CrossRef]
  121. Redmond, D.L.; Knox, D.P.; Newlands, G.; Smith, W.D. Molecular cloning and characterisation of a developmentally regulated putative metallopeptidase present in a host protective extract of Haemonchus contortus. Mol. Biochem. Parasitol. 1997, 85, 77–87. [Google Scholar] [CrossRef]
  122. Tak, I.R.; Dar, J.S.; Dar, S.A.; Ganai, B.A.; Chishti, M.Z.; Ahmad, F. A comparative analysis of various antigenic proteins found in Haemonchus contortus—A review. Mol. Biol. 2015, 49, 883–890. [Google Scholar] [CrossRef]
  123. Scarff, C.A.; Thompson, R.F.; Newlands, G.F.J.; Jamson, A.H.; Kennaway, C.; da Silva, V.J.; Rabelo, E.M.; Song, C.F.; Trinick, J.; Smith, W.D.; et al. Structure of the protective nematode protease complex H-gal-GP and its conservation across roundworm parasites. PLoS Pathog. 2020, 16, e1008465. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  124. Camas-Pereyra, R.; Bautista-García, G.A.; Avila, G.; Alcala-Canto, Y.; Maza-Lopez, J.; Reyes-Guerrero, D.E.; Higuera-Piedrahita, R.I.; López-Arellano, M.E. In silico analysis of two Haemonchus spp. serine protease peptides (S28) and their immunomodulatory activity in vitro. Mol. Biochem. Parasitol. 2023, 253, 111545. [Google Scholar] [CrossRef] [PubMed]
  125. Smith, T.S.; Munn, E.A.; Graham, M.; Tavernor, A.S.; Greenwood, C.A. Purification and evaluation of the integral membrane protein H11 as a protective antigen against Haemonchus contortus. Int. J. Parasitol. 1993, 23, 271–280. [Google Scholar] [CrossRef]
  126. Smith, T.S.; Graham, M.; Munn, E.A.; Newton, S.E.; Knox, D.P.; Coadwell, W.J.; McMichael-Phillips, D.; Smith, H.; Smith, W.D.; Oliver, J.J. Cloning and characterization of a microsomal aminopeptidase from the intestine of the nematode Haemonchus contortus. Biochim. Biophys. Acta Protein Struct. Mol. Enzymol. 1997, 1338, 295–306. [Google Scholar] [CrossRef]
  127. Smith, S.K.; Pettit, D.; Newlands, G.F.; Redmond, D.L.; Skuce, P.J.; Knox, D.P.; Smith, W.D. Further immunization and biochemical studies with a protective antigen complex from the microvillar membrane of the intestine of Haemonchus contortus. Parasite Immunol. 1999, 21, 187–199. [Google Scholar] [CrossRef]
  128. Munn, E.A.; Smith, T.S.; Smith, H.; James, F.M.; Smith, F.C.; Andrews, S.J. Vaccination against Haemonchus contortus with denatured forms of the protective antigen H11. Parasite Immunol. 1997, 19, 243–248. [Google Scholar] [CrossRef]
  129. LeJambre, L.F.; Windon, R.G.; Smith, W.D. Vaccination against Haemonchus contortus: Performance of native parasite gut membrane glycoproteins in Merino lambs grazing contaminated pasture. Vet. Parasitol. 2008, 153, 302–312. [Google Scholar] [CrossRef]
  130. Nisbet, A.J.; Meeusen, E.N.; González, J.F.; Piedrafita, D.M. Immunity to Haemonchus contortus and vaccine development. Adv. Parasitol. 2016, 93, 353–396. [Google Scholar]
  131. Wang, C.; Liu, L.; Wang, T.; Liu, X.; Peng, W.; Srivastav, R.K.; Zhu, X.-Q.; Gupta, N.; Gasser, R.B.; Hu, M. H11-induced immunoprotection is predominantly linked to N-glycan moieties during Haemonchus contortus infection. Front. Immunol. 2022, 13, 1034820. [Google Scholar] [CrossRef]
  132. Hashmi, S.; Britton, C.; Liu, J.; Guiliano, D.B.; Oksov, Y.; Lustigman, S. Cathepsin L is essential for embryogenesis and development of Caenorhabditis elegans. J. Biol. Chem. 2002, 277, 3477–3486. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  133. Stepek, G.; McCormack, G.; Birnie, A.J.; Page, A.P. The astacin metalloprotease moulting enzyme NAS-36 is required for normal cuticle ecdysis in free-living and parasitic nematodes. Parasitology 2011, 138, 237–248. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  134. Huang, Y.; Wu, J.; Chen, X.; Tong, D.; Zhou, J.; Wu, F.; Zhang, H.; Yang, Y.; Ma, G.; Du, A. A zinc metalloprotease NAS-33 is required for molting and survival in parasitic nematode Haemonchus contortus. Front. Cell Dev. Biol. 2021, 9, 695003. [Google Scholar] [CrossRef] [PubMed]
  135. Lu, M.R.; Lai, C.-K.; Liao, B.-Y.; Tsai, I.J. Comparative transcriptomics across nematode life cycles reveal gene expression conservation and correlated evolution in adjacent developmental stages. Genome Biol. Evol. 2020, 12, 1019–1030. [Google Scholar] [CrossRef] [PubMed]
  136. Wit, J.; Dilks, C.M.; Andersen, E.C. Complementary approaches with free-living and parasitic nematodes to understanding anthelmintic resistance. Trends Parasitol. 2021, 37, 240–250. [Google Scholar] [CrossRef] [PubMed]
  137. Karanu, F.N.; Rurangirwa, F.R.; McGuire, T.C.; Jasmer, D.P. Haemonchus contortus: Identification of proteases with diverse characteristics in adult worm excretory-secretory products. Exp. Parasitol. 1993, 77, 362–371. [Google Scholar] [CrossRef]
  138. Newlands, G.F.; Skuce, P.J.; Knox, D.P.; Smith, W.D. Cloning and expression of cystatin, a potent cysteine protease inhibitor from the gut of Haemonchus contortus. Parasitology 2001, 122, 371–378. [Google Scholar] [CrossRef]
  139. Munn, E.A.; Smith, T.S.; Graham, M.; Tavernor, A.S.; Greenwood, C.A. The potential value of integral membrane proteins in the vaccination of lambs against Haemonchus contortus. Int. J. Parasitol. 1993, 23, 261–269. [Google Scholar] [CrossRef]
  140. Skuce, P.J.; Redmond, D.L.; Liddell, S.; Stewart, E.M.; Newlands, G.F.; Smith, W.D.; Knox, D.P. Molecular cloning and characterization of gut-derived cysteine proteinases associated with a host protective extract from Haemonchus contortus. Parasitology 1999, 119, 405–412. [Google Scholar] [CrossRef]
  141. Jasmer, D.P.; Roth, J.; Myler, P.J. Cathepsin B-like cysteine proteases and Caenorhabditis elegans homologues dominate gene products expressed in adult Haemonchus contortus intestine. Mol. Biochem. Parasitol. 2001, 116, 159–169. [Google Scholar] [CrossRef]
  142. Atkinson, H.J.; Babbitt, P.C.; Sajid, M. The global cysteine peptidase landscape in parasites. Trends Parasitol. 2009, 25, 573–581. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  143. Rhoads, M.L.; Fetterer, R.H. Developmentally regulated secretion of cathepsin L-like cysteine proteases by Haemonchus contortus. J. Parasitol. 1995, 81, 505–512. [Google Scholar] [CrossRef] [PubMed]
  144. Edelmann, M.J.; Kessler, B.M. Ubiquitin and ubiquitin-like specific proteases targeted by infectious pathogens: Emerging patterns and molecular principles. Biochim. Biophys. Acta 2008, 1782, 809–816. [Google Scholar] [CrossRef]
  145. Smith, J.R.; Stanfield, G.M. TRY-5 is a sperm-activating protease in Caenorhabditis elegans seminal fluid. PLoS Genet. 2011, 7, e1002375. [Google Scholar] [CrossRef]
  146. Signorelli, J.; Diaz, E.S.; Morales, P. Kinases, phosphatases and proteases during sperm capacitation. Cell Tissue Res. 2012, 349, 765–782. [Google Scholar] [CrossRef] [PubMed]
  147. Holcomb, R.J.; Oura, S.; Nozawa, K.; Kent, K.; Yu, Z.; Robertson, M.J.; Coarfa, C.; Matzuk, M.M.; Ikawa, M.; Garcia, T.X. The testis-specific serine proteases PRSS44, PRSS46, and PRSS54 are dispensable for male mouse fertility. Biol. Reprod. 2020, 102, 84–91. [Google Scholar] [CrossRef]
  148. Howe, K.L.; Bolt, B.J.; Shafie, M.; Kersey, P.; Berriman, M. WormBase Parasite—A comprehensive resource for helminth genomics. Mol. Biochem. Parasitol. 2017, 215, 2–10. [Google Scholar] [CrossRef] [PubMed]
  149. Johnson, M.; Zaretskaya, I.; Raytselis, Y.; Merezhuk, Y.; McGinnis, S.; Madden, T.L. NCBI BLAST: A better web interface. Nucleic Acids Res. 2008, 36, 5–9. [Google Scholar] [CrossRef]
  150. Jones, P.; Binns, D.; Chang, H.Y.; Fraser, M.; Li, W.; McAnulla, C.; McWilliam, H.; Maslen, J.; Mitchell, A.; Nuka, G.; et al. InterProScan 5: Genome-scale protein function classification. Bioinformatics 2014, 30, 1236–1240. [Google Scholar] [CrossRef] [Green Version]
  151. Mistry, J.; Chuguransky, S.; Williams, L.; Qureshi, M.; Salazar, G.A.; Sonnhammer, E.L.L.; Tosatto, S.C.E.; Paladin, L.; Raj, S.; Richardson, L.J.; et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 2021, 49, 412–419. [Google Scholar] [CrossRef]
  152. Thomas, P.D.; Campbell, M.J.; Kejariwal, A.; Mi, H.; Karlak, B.; Daverman, R.; Diemer, K.; Muruganujan, A.; Narechania, A. PANTHER: A library of protein families and subfamilies indexed by function. Genome Res. 2003, 13, 2129–2141. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  153. Berman, H.M.; Westbrook, J.; Feng, Z.; Gilliland, G.; Bhat, T.N.; Weissig, H.; Shindyalov, I.N.; Bourne, P.E. The protein data bank. Nucleic Acids Res. 2000, 28, 235–242. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  154. Zhang, C.; Shine, M.; Pyle, A.M.; Zhang, Y. US-align: Universal structure alignments of proteins, nucleic acids, and macromolecular complexes. Nat. Methods 2022, 19, 1109–1115. [Google Scholar] [CrossRef] [PubMed]
  155. Gasteiger, E.; Gattiker, A.; Hoogland, C.; Ivanyi, I.; Appel, R.D.; Bairoch, A. ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003, 31, 3784–3788. [Google Scholar] [CrossRef] [Green Version]
  156. Duvaud, S.; Gabella, C.; Lisacek, F.; Stockinger, H.; Ioannidis, V.; Durinx, C. Expasy, the swiss bioinformatics resource portal, as designed by its users. Nucleic Acids Res. 2021, 49, 216–227. [Google Scholar] [CrossRef]
  157. Cock, P.J.A.; Antao, T.; Chang, J.T.; Chapman, B.A.; Cox, C.J.; Dalke, A.; Friedberg, I.; Hamelryck, T.; Kauff, F.; Wilczynski, B.; et al. Biopython: Freely available python tools for computational molecular biology and bioinformatics. Bioinformatics 2009, 25, 1422–1423. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  158. Teufel, F.; Almagro Armenteros, J.J.; Johansen, A.R.; Gíslason, M.H.; Pihl, S.I.; Tsirigos, K.D.; Winther, O.; Brunak, S.; von Heijne, G.; Nielsen, H. SignalP 6.0 predicts all five types of signal peptides using protein language models. Nat. Biotechnol. 2022, 40, 1023–1025. [Google Scholar] [CrossRef]
  159. Chen, C.; Chen, H.; Zhang, Y.; Thomas, H.R.; Frank, M.H.; He, Y.; Xia, R. TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef]
  160. Emms, D.M.; Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019, 20, 238. [Google Scholar] [CrossRef] [Green Version]
  161. Jassal, B.; Matthews, L.; Viteri, G.; Gong, C.; Lorente, P.; Fabregat, A.; Sidiropoulos, K.; Cook, J.; Gillespie, M.; Haw, R.; et al. The reactome pathway knowledgebase. Nucleic Acids Res. 2020, 48, 498–503. [Google Scholar] [CrossRef]
  162. Mohandas, N.; Young, N.D.; Jabbar, A.; Korhonen, P.K.; Koehler, A.V.; Amani, P.; Hall, R.S.; Sternberg, P.W.; Jex, A.R.; Hofmann, A.; et al. The barber’s pole worm CAP protein superfamily—A basis for fundamental discovery and biotechnology advances. Biotechnol. Adv. 2015, 33, 1744–1754. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  163. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. The sequence alignment/map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  164. Ewels, P.; Magnusson, M.; Lundin, S.; Käller, M. MultiQC: Summarize analysis results for multiple tools and samples in a single report. Bioinformatics 2016, 32, 3047–3048. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  165. Pertea, M.; Kim, D.; Pertea, G.M.; Leek, J.T.; Salzberg, S.L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 2016, 11, 1650–1667. [Google Scholar] [CrossRef]
  166. Orsburn, B.C. Proteome discoverer—A community enhanced data processing suite for protein informatics. Proteomes 2021, 9, 15. [Google Scholar] [CrossRef]
Figure 2. Haemonchus contortus proteases and protease inhibitors, and their annotations. At the top, the numbers of proteins with or without signal peptides are given as well as total numbers (right). At the bottom, all proteins were assigned the GO annotations, according to biological process (BP), cellular component (CC) and/or molecular function (MF) (cf. Table S2).
Figure 2. Haemonchus contortus proteases and protease inhibitors, and their annotations. At the top, the numbers of proteins with or without signal peptides are given as well as total numbers (right). At the bottom, all proteins were assigned the GO annotations, according to biological process (BP), cellular component (CC) and/or molecular function (MF) (cf. Table S2).
Ijms 24 12320 g002
Figure 3. Transcription profiles of protease and protease inhibitor genes in different developmental stages of Haemonchus contortus. Free-living stages: egg; first-, second- and third-stage larvae (i.e., L1, L2 and L3; and free-living stages). Parasitic stages: female and male fourth-stage larvae (L4f and L4m); and female and male adults (Af and Am). Left: clustering of genes based on transcription profiles, with green indicating genes encoding excretory/secretory (ES) proteins and yellow denoting other (i.e., non-ES) proteins. Right: transcription profiles of individual genes. The blue to red scale indicates normalised fragments per kilobase per million (FPKM) values.
Figure 3. Transcription profiles of protease and protease inhibitor genes in different developmental stages of Haemonchus contortus. Free-living stages: egg; first-, second- and third-stage larvae (i.e., L1, L2 and L3; and free-living stages). Parasitic stages: female and male fourth-stage larvae (L4f and L4m); and female and male adults (Af and Am). Left: clustering of genes based on transcription profiles, with green indicating genes encoding excretory/secretory (ES) proteins and yellow denoting other (i.e., non-ES) proteins. Right: transcription profiles of individual genes. The blue to red scale indicates normalised fragments per kilobase per million (FPKM) values.
Ijms 24 12320 g003
Table 1. Distribution of protease and protease inhibitor genes, proteins (isoforms) and protein families in Haemonchus contortus.
Table 1. Distribution of protease and protease inhibitor genes, proteins (isoforms) and protein families in Haemonchus contortus.
Class/GroupNumber of GenesNumber of Proteins (Including Isoforms)Number of Protein Families
Aspartic proteases1151214
Cysteine proteases13314520
Metallo-proteases25928021
Serine proteases12714212
‘Mixed’ proteases728112
Protease inhibitors849114
Totals:79086083
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zheng, Y.; Young, N.D.; Song, J.; Gasser, R.B. Genome-Wide Analysis of Haemonchus contortus Proteases and Protease Inhibitors Using Advanced Informatics Provides Insights into Parasite Biology and Host–Parasite Interactions. Int. J. Mol. Sci. 2023, 24, 12320. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms241512320

AMA Style

Zheng Y, Young ND, Song J, Gasser RB. Genome-Wide Analysis of Haemonchus contortus Proteases and Protease Inhibitors Using Advanced Informatics Provides Insights into Parasite Biology and Host–Parasite Interactions. International Journal of Molecular Sciences. 2023; 24(15):12320. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms241512320

Chicago/Turabian Style

Zheng, Yuanting, Neil D. Young, Jiangning Song, and Robin B. Gasser. 2023. "Genome-Wide Analysis of Haemonchus contortus Proteases and Protease Inhibitors Using Advanced Informatics Provides Insights into Parasite Biology and Host–Parasite Interactions" International Journal of Molecular Sciences 24, no. 15: 12320. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms241512320

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop