Next Article in Journal
Comparative Genomics Analysis of Lactobacillus mucosae from Different Niches
Next Article in Special Issue
Domesticated gag Gene of Drosophila LTR Retrotransposons Is Involved in Response to Oxidative Stress
Previous Article in Journal
Shedding Light on the Antimicrobial Peptide Arsenal of Terrestrial Isopods: Focus on Armadillidins, a New Crustacean AMP Family
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Complete Genome of an Endogenous Nimavirus (Nimav-1_LVa) From the Pacific Whiteleg Shrimp Penaeus (Litopenaeus) Vannamei

1
Genetic Information Research Institute, 20380 Town Center Lane, Suite 240, Cupertino, CA 95014, USA
2
Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, 106 Nanjing Road, Qingdao 266071, China
3
Fundación para la Conservation de la Biodiversidad Acuática y Terrestre (FUCOBI), Quito EC1701, Ecuador
4
Environmental Genomics Inc., ONE HEALTH Epigenomics Educational Initiative, P.O. Box 196, Southborough, MA 01772, USA
*
Authors to whom correspondence should be addressed.
Submission received: 17 December 2019 / Revised: 8 January 2020 / Accepted: 9 January 2020 / Published: 14 January 2020

Abstract

:
White spot syndrome virus (WSSV), the lone virus of the genus Whispovirus under the family Nimaviridae, is one of the most devastating viruses affecting the shrimp farming industry. Knowledge about this virus, in particular, its evolution history, has been limited, partly due to its large genome and the lack of other closely related free-living viruses for comparative studies. In this study, we reconstructed a full-length endogenous nimavirus consensus genome, Nimav-1_LVa (279,905 bp), in the genome sequence of Penaeus (Litopenaeus) vannamei breed Kehai No. 1 (ASM378908v1). This endogenous virus seemed to insert exclusively into the telomeric pentanucleotide microsatellite (TAACC/GGTTA)n. It encoded 117 putative genes, with some containing introns, such as g012 (inhibitor of apoptosis, IAP), g046 (crustacean hyperglycemic hormone, CHH), g155 (innexin), g158 (Bax inhibitor 1 like). More than a dozen Nimav-1_LVa genes are involved in the pathogen-host interactions. We hypothesized that g046, g155, g158, and g227 (semaphorin 1A like) were recruited host genes for their roles in immune regulation. Sequence analysis indicated that a total of 43 WSSV genes belonged to the ancestral/core nimavirus gene set, including four genes reported in this study: wsv112 (dUTPase), wsv206, wsv226, and wsv308 (nucleocapsid protein). The availability of the Nimav-1_LVa sequence would help understand the genetic diversity, epidemiology, evolution, and virulence of WSSV.

1. Introduction

The pacific whiteleg shrimp Penaeus (Litopenaeus) vannamei is one of the most important penaeid species in the aquaculture and fishing industry. The natural range of wild P. vannamei populations is the pacific coast of Latin America, from northern Peru to northern Mexico. However, P. vannamei has been introduced into most of the shrimp-producing countries around the world, partly due to the domestication and availability of specific pathogen-free (SPF) stocks [1,2,3]. The term SPF means “healthy”, i.e., conditionally free of a list of known shrimp pathogens of the office of international epizootics (OIE), but not necessarily resistant and/or tolerant to any of the pathogens [3]. The first SPF P. vannamei was produced in Hawaii by the breeding program of the United States Marine Shrimp Farming Program (USMSFP) consortium and was maintained at the Oceanic Institute in Hawaii, USA [1,2]. Recently, the shrimp genome from the Kona line of the USMSFP was partially sequenced for a total length of ~470 Mb [1], from which numerous transposable elements, integrated viruses, and simple sequence repeats (SSRs) have been categorized [4] and deposited in Repbase [5]. Kona line is also known as research line, high-growth line, and/or Taura Syndrome Virus (TSV)-susceptible line, and was distributed to private commercial breeding companies [1]. In parallel, the genome of a male P. vannamei farmed in China (breed Kehai No. 1) was completely sequenced and assembled to be 1.66 Gb in size [6]. Although the expected genome size of P. vannamei ranges from 2.45 to 2.89 Gb [1], this 1.66 Gb scaffold sequence, in which 25,596 protein-coding genes were identified, would allow researchers to (a) complete a continuous whole-genome assembly of this highly complex species that contains the highest percentage of SSRs than any other species sequenced so far [1,6], (b) perform more basic epidemiology and evolutionary biology research, and (c) develop treatments and diagnostics tools for diseases of bacterial [1,7] and viral origin [8,9,10].
White spot disease (WSD) is the most devastating infectious shrimp disease. Infected shrimps are characterized by white spots (calcified deposits) on the exoskeleton. The first reported appearances of WSD in penaeid shrimp occurred in China (Fujian) in 1992 [11] and spread globally [10,12,13,14,15] to Taiwan, Korea, and Japan (1993), South East Asian countries (1996), United States (Texas and SC in 1995), India (1998), Latin America (1999), Madagascar, Mozambique and Saudi Arabia (2010–2012), and Australia (2016). The cause of WSD is large, enveloped dsDNA virus called white spot syndrome virus (WSSV) [16,17,18] that infects over 90 arthropod species naturally or experimentally [17,19], such as crayfishes, lobsters, crabs, and others. So far, 14 complete WSSV genomes of different isolates have been stored in GenBank, ranging between 280 Kb and 309 Kb in size, and are predicted to have ~180 open reading frames (ORFs) of 50 amino acids or above [16,18]. Different WSSV genomes share >95.22% overall sequence identity and could cluster in three or more phylogenetic groups [20,21]. In the Genbank database, many shrimp expressed sequence tags (ESTs) have been found showing homology to WSSV, especially when ESTs are from the SPF P. vannamei of the USMSFP breeding program from Hawaii [1]. WSSV fragments have been reported endogenized or integrated into an SPF stock of giant tiger shrimp (Penaeus monodon) from Thailand [22], showing Mendelian inheritance [23]. A recent study in Kuruma shrimp, Penaeus (Marsupenaeus) japonicus, illustrated that the entry of WSSV into the host cell is via the endocytosis pathway, triggered by the interaction of virion and a transmembrane immunoglobulin receptor, designated as MjpIgR [24]. So far, progress has been made in developing WSSV-resistant P. vannamei lines [25,26], but a lot more work remains ahead to achieve the stabilization of the resistance.
WSSV has long been regarded as the lone virus (type species) of the genus Whispovirus, which is the only genus of the family Nimaviridae [18]. However, this notion is changing with the recent discovery of diverse endogenous WSSV-like nimaviruses [27,28,29,30]. In some crustacean genomes, such as P. monodon (Pm), even two different types of endogenous nimaviruses can be distinguished [28]. The genome scaffolds of these endogenous nimaviruses vary in length from ~190 Kb to ~230 Kb, but none is considered a complete virus genome [28]. According to the phylogeny reported by Kawato et al. [28], family Nimaviridae currently consists of seven major phylogenetic groups (or genus, if diversity qualified), and different groups share less than 60% DNA sequence identity to each other [28]. The representative viruses of the seven groups are WSSV, Chionoecetes opilio bacilliform virus (CoBV), and the five endogenous nimaviruses from Penaeus (Marsupenaeus) japonicus (Mj), Penaeus monodon, Hemigrapsus takanoi, Metapenaeus ensis, Sesarmops intermedium, respectively (Table 1). Comparative analysis showed that 39 WSSV genes could be termed as ancestral/core nimavirus genes since their orthologs were ubiquitously (core) or widely (ancestral) present in the seven Nimaviridae lineages, particularly in the Mj nimavirus, which belongs to the most distant group (Mj-group) to WSSV [28]. These 39 genes include envelope proteins, capsid proteins, DNA polymerase, protein kinase, and some other hypothetical or unknown proteins. In other words, these ancestral/core genes (families) are rarely lost in the course of evolution [31].
From the ~470 Mb genome of the first SPF P. vannamei [1], we previously reconstructed a 279,384 bp long consensus sequence, designated as DNAV-1_LVa, to represent the complete genome of a WSSV-like virus [29,30]. In Repbase [5], DNAV-1_LVa is stored as seven smaller segments (entries): DNAV-1a_LVa to DNAV-1g_LVa. We reported here an updated version of this WSSV-like nimavirus, reconstructed from the high-quality sequence data of P. vannamei Kehai No. 1 genome [6]. This new consensus was designated as Nimav-1_LVa (279,905 bp) to emphasize its upgraded quality over DNAV-1_LVa. With about 65–74% sequence identity to the Mj endogenous nimavirus, Nimav-1_LVa clearly belonged to the Mj-group. In Nimav-1_LVa, 117 protein-coding genes were predicted, including four genes newly demonstrated as nimavirus ancestral/core genes. In addition, four other Nimav-1_LVa genes might be captured host genes for their regulatory roles in the host-pathogen interactions and/or immune response. This complete genome of Nimav-1_LVa might provide a useful source to aid in our understanding of the evolution of virus family Nimaviridae.

2. Materials and Methods

2.1. Nimav-1_LVa Virus Consensus Reconstruction

The process of reconstructing the consensus of various repetitive families have been described elsewhere [5]. Briefly, RepeatModeler [32] tool was used to initially identify “pre-consensus” sequences in the genome. These “pre-consensus” sequences were used by BlastN to bait out top hit sequences in the genome, from which the consensus sequences were reconstructed again. To extend to the complete length of a given family, a stepwise extension in both directions was performed until the sign of termini appears. The consensus of Nimav-1_LVa is provided in Supplementary File S1.

2.2. Viral Gene Prediction and Visualization

Nimav-1_LVa genes or ORFs were predicted in three steps. First, ORFs with 70 codons or above were predicted. ORFs completely overlapped by other larger ORFs or that largely derived from simple sequences or tandem repeats were discarded. The tandem repeat region was predicted by Tandem Repeat Finder [33] (TRF, Version 4.09) with default parameters. Second, regions consisting of multiple adjacent short ORFs in the same direction were subjected to online FGENESH [34] prediction to check the possibility of exon-containing genes. We chose Apis dorsata (giant honey bee) as the species parameter for FGENESH since the predicted proteins proved more correct than using some other species. Lastly, to further reduce the error in gene prediction, the predicted proteins were subjected to comparative TblastN or BlastP analyses against either the Nimav-1_LVa or the other nimaviruses. By this approach, we corrected a few frameshifts caused by ambiguity in short tandem repeats. Some obvious duplicated partial gene fragments were also discarded. The 117 protein sequences of Nimav-1_LVa are provided in Supplementary File S1. Multiple sequence alignment (MSA) was performed by an online MAFFT server [35] and was visualized in Jalview [36].

2.3. Homology Searches

Protein homology searching (TblastN or BlastP) was performed locally with the Censor tool [37] implemented with Wu-blast (version 2.0) search engine. Protein database searching was conducted by BlastP or PSI-Blast (Position-Specific Iterated Blast) at NCBI (https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins). HMMER3 [38] software was used to detect more distant viral proteins. MSA alignment was constructed using online MAFFT [35], version 7.423, and HMM (hidden Markov models) profile generated were used in HMMSEARCH in the HMMER3 suite.

2.4. Dataset

Nimavirus genomes or assemblies used in this paper for comparative analysis included 3 WSSV genomes (AF332093.3, WSSV-CN; AF369029.2, WSSV-TH; and KT995472.1, WSSV-CN01), Metopaulias depressus (Md, KR820240 to KR820242), and the other 6 genomes listed in Table 1. Except for the 3 WSSV genomes, all other nimaviruses genomes were incomplete. The whole-genome sequences (WGS) of Penaeus monodon isolate Shenzhen (NIUS000000000) and Marsupenaeus (Penaeus) japonicus isolate Guangxi (NIUR010000000) were downloaded from GenBank.

3. Results

3.1. Building the Consensus of Nimav-1_LVa

Using the PacBio sequencing method, we previously conducted a small-scale genome sequencing project on the SPF P. vannamei Kona line of the USMSFP [1]. Around 470 Mb sequences were randomly obtained from the genome. From this data, a 279,384 bp long WSSV-like consensus sequence was reconstructed and was deposited in Repbase [5] under the name DNAV-1_LVa [29]. Due to the high error rate of PacBio sequencing, and the low genome coverage of the data, the sequence quality of DNAV-1_LVa proved prohibitive for a thorough analysis. In this study, we reconstructed this DNAV-1_LVa-like consensus using the high-quality genome sequences of P. vannamei breed Kehai No. 1 variety (GenBank assembly No. ASM378908v1) that were generated by both PacBio and Illumina platforms. We designated the new consensus with a different name: Nimav-1_LVa, to reflect its being a nimavirus and emphasize its superior sequence quality to the original DNAV-1_LVa. Nimav-1_LVa was 279,905 bp long, ~98% identical to DNAV-1_LVa sequence, and showed the same overall structure, but length variations were observed in some tandem repeat regions. The sequence of Nimav-1_LVa is provided in Supplementary File S1.
In the Nimav-1_LVa sequence, except for a ~1.8 Kb region (184,126 to 185,979 nt) and its immediate ~100 bp flanking sequences, the whole Nimav-1_LVa consensus was well-supported by at least three long genomic sequences from different loci (Figure 1A), all >98% identical to the consensus. In the current shrimp Kehai No. 1 genome assembly, this 1.8 Kb sequence occurred only in one contig NW_020871279.1. In another contig NW_020871249.1 from the same genomic locus, this 1.8 Kb region was substituted by a 413-bp unsequenced polyN tract (491,007–491,419 nt). Luckily, this 1.8 Kb region was located within the coding region of the gene g187 (Figure 1A), which encoded in its single, long ORF a 4332 AA protein (187p), showing 56% identity over the whole length to a wsv343-like protein BBD20111.1 (4287 AA) encoded in Mj nimavirus. Thus, this poorly-supported 1.8 Kb region would not seriously affect our subsequent analysis.
In the current 1.66 Gb genome assembly of the shrimp breed Kehai No. 1, a total of 3335 Kb sequences was found to be derived from Nimav-1_LVa: >95% identity to the consensus, and 80% of these sequences showed >98% identity to the consensus (Supplementary Table S1). These data indicated that at least 12 copies (3335/279 = 11.9) of Nimav-1_LVa were integrated into the shrimp genome during the relatively recent past. Among the available endogenous nimaviruses assemblies, M. japonicus (Mj) endogenous nimavirus (BFCD01000001 and AP010878) [28] was the closest relative to Nimav-1_LVa. They shared a 65–74% nucleic acid sequence identity to each other, and both featured low GC-content: 34.6% in Nimav-1_LVa and 32.9% in the Mj endogenous nimavirus. By contrast, all other nimaviruses genomes exhibited significantly high GC-content: 45% in the Pm endogenous nimavirus, 47% in the Ht endogenous nimavirus, 45.4% in the Me endogenous nimavirus, 44.2% in the Si endogenous nimavirus, 44.1% in the Md endogenous nimavirus, 41% in WSSV, and 40% in the Chionoecetes opilio bacilliform virus.

3.2. The Integration Site of Nimav-1_LVa

As shown in Figure 1A, the integration site on the circular virus genome was located between gene g002 and gene g276. Hereafter, the orientation of the linear Nimav-1_LVa was defined as in Figure 1A. In the assembly of shrimp breed Kehai No. 1, a total of 21 genomic loci were juxtaposed with the termini of Nimav-1_LVa: 10 loci at the 5′-end and 11 at the 3′-end. The number of these termini (21) accorded well with the number of the integrated Nimav-1_LVa copies (12), which was deduced from the total length of the viral sequences. Thus, this data implied that the site between g002 and g276 was the only possible recombination site on the virus genome. Moreover, we found all these Nimav-1_LVa copies were flanked by a long tract of (TAACC/GGTTA)n microsatellites (Figure 1B), which were reported as the telomeric sequence in P. vannamei [6,39]. Notably, the (TAACC/GGTTA)n microsatellite region was internally absent in the Nimav-1_LVa consensus, strongly indicating that the integration between Nimav-1_LVa and the host genome happens preferentially, if not exclusively, between one specific virus site and the telomeric microsatellite repeats. However, one caveat must be noted that the Nimav-1_LVa might also integrate into non-telomeric regions, but these viruses had been subsequently eliminated during evolution.
The precise boundary between integrated Nimav-1_LVa and shrimp genome is undetermined yet. The termini of this linear Nimav-1_LVa, 5′-CAG, and ACC-3′, as illustrated in Figure 1, were approximate and tentative. No obvious target site duplications (TSDs) were observed flanking Nimav-LVa. Little is known about the molecular mechanism underlying such integration because we cannot exclude the possibility that circular Nimav-1_LVa could harbor one short tract of variable length of (TAACC/GGTTA)n microsatellites somewhere between g002 and g276. If so, the integration of Nimav-1_LVa would be through the homology-based recombination, which is adopted in the telomere-specific integration of human herpesvirus HHV-6A, HHV-6B [40,41,42], and chicken lymphotropic alphaherpesvirus Marek’s disease virus (MDV) [43,44].

3.3. Nimav-1_LVa Sequences in Other Penaeid Shrimps

To test if Nimav-1_LVa is present in other shrimp species, we blasted the Nimav-1_LVa sequence against the two available whole-genome sequences (WGS) of P. monodon isolate Shenzhen (NIUS000000000, 1.4 Gb) and M. japonicus isolate Guangxi (NIUR010000000, 1.6 Gb). In addition, we performed two similar searches using the Mj-type and the Pm-type endogenous nimaviruses (Table 1). As a result, a substantial amount of homologous sequences, either identical (>99%) or highly homologous (>88%), was detected in the two genomes. The detected homologous viral sequences seemed to scatter throughout the whole virus genome; in some specific locations, even three different versions of viral sequences could be detected. The cumulative lengths of the homologous sequences in each search are listed in Table 2. The varying amounts of the integrated viral sequences might be accounted for by the different magnitudes of infection and different levels of host tolerances to the integration of different viruses. These data suggested that at least three types of nimavirus sequences were integrated into the two shrimp isolates from P. monodon and M. japonicus. The first virus type was obviously the Nimav-1_LVa type (>99% identity). The other two types, given the fairly high sequence identity (>88% or >91%) to the query sequences, could be called Pm-like and Mj-like (Table 2). Putting together, the identification of almost identical Nimav-1_LVa sequence in three species, P. monodon, M. japonicas, and P. vannamei (previous section), highly suggested that Nimav-1_LVa virus or its closest variant is or was a potentially transmissible virus in nature.

3.4. Genes Encoded in Nimav-1_LVa

In the Nimav-1_LVa sequence, a total of 117 protein-coding genes were predicted (Table 3 and Supplementary File S1 for the protein sequences), each with 70 codons or longer. Ninety-seven of the genes were supported by homologous proteins, mostly from other nimaviruses (Table 3). The remaining 20 genes were hypothetical, generally short, with the exception of only two genes (g153 and g234) coding for proteins over 400 residues.
Twenty-eight out of the 117 genes were found homologous to at least one other Nimav-1_LVa gene. Based on their mutual similarity, these genes were clustered into six “paralog families” (PF): PF1 (g002, g006, g008, g009, g010, g011, g141, g143, g146, g161), PF2 (g003, g012, g017, g030, g047, g049), PF3 (g050, g051, g052, g257), PF4 (g172, g173, g276), PF5 (g056, g269, g271), and PF6 (g034 and g139). Notably, it was possible that in some gene families, some shorter genes were just pseudogenes or gene fragments due to partial duplication or to the errors in gene prediction, such as the g002 gene in the PF1 family, the g030 in the PF2 family (Table 3). In the PF3 family, g052 was much longer than the rest of the members, and the homologous region was limited to the N-terminal half region of g052. Nevertheless, for the purposes of documentation, these genes are still enlisted in Table 3.
PF1 was the largest gene family with a total of 10 family members, reflecting its critical roles for the virus. However, the roles of PF1 families were largely unknown: no significant conserved domain was found. In the PF2 family, all six members contained one to three BIR domains (baculoviral inhibition of apoptosis protein repeat, cd00022) (Table 3). In addition, a carboxyl-terminal zinc-finger domain of the RING-HC (C3HC4-type) subclass was present in four PF2 members. The four zinc-finger domains belonged to two subtypes: RING-HC_BIRC2_3_7 (cd16713) in g012 and g017, and RING-HC_BIRC4_8 (cd16714) in g047 and g049 (Table 3). The BIR and RING domain arrangement is also found in a number of well-studied inhibitors of apoptosis (IAP) proteins [45]. As indicated by the acronym BIRC (baculoviral IAP repeat-containing protein) in the zinc-finger subtype name, the other IAP proteins include BIRC2 (also known as c-IAP1, cellular inhibitor of apoptosis protein 1), BIRC3 (c-IAP2), BIRC7 (Livin), BIRC4 (XIAP, X-linked inhibitor of apoptosis protein), and BIRC8 (ILP-2, IAP-like protein 2). It is known that these IAP proteins act as ubiquitin E3 ligases to mediate the ubiquitination of the substrates involved in apoptosis, nuclear factor-kappaB (NF-kappaB) signaling, and oncogenesis [46]. BIRC3 influences ubiquitin-dependent pathways that modulate innate immune signaling by activation of NF-kappaB, and BIRC4, 7, 8 are all implicated in the effect of anti-apoptosis [45,46,47].
One striking feature of Nimav-1_LVa was that exon-intron structures are found in nine genes, including five PF2 family genes (g003, g012, g017, g047, and g049), g022, g046 (CHH), g155 (innexin), and g158 (BAX inhibitor 1-like) (Table 3). While the exons in g022 have yet to be confirmed by other independent resources, the existence of exons seemed to be positively confirmed for the other eight genes by their homologs from GenBank. Notably, no WSSV gene is found to be spliced so far [18].
It has been known that 39 WSSV genes and their homologs are commonly present in nimaviruses, in particular, Mj-type nimavirus and WSSV [28], and are so-called nimavirus ancestral/core genes. However, because of the incompleteness of the current scaffold of the Mj-type nimavirus genome (~220 Kb, Table 1), this ancestral/core gene set could be incomplete. Given the close relationship between Nimav-1_LVa and M. japonicus (Mj) nimavirus, both under the Mj-group [28], we examined the possible homologous genes between Nimav-1_LVa and WSSV, aiming at additional Nimav-1_LVa genes that could be included into the ancestral/core gene set.
As a result, 44 Nimav-1_LVa genes were found homologous to 43 WSSV genes. These paired homologous genes are indicated with “wsvNNN-like” in the “Comment” column in Table 3. The WSSV genes here referred to those annotated for the genome of the WSSV CN strain (AF332093.3). Of the 44 Nimav-1_LVa genes, 39 genes proved to be the orthologs of the known 39 ancestral/core genes [28], the other five newly-included genes were g140 (wsv112-like), g217 (wsv308-like), g225 (wsv226-like), g034 (wsv206-like), and g139 (wsv206-like). The last two genes were two paralogs belonging to the PF6 gene family. These five newly identified proteins showed marginal similarity (<30% amino acids identity), or no detectable similarity, to their WSSV counterparts by BlastP; however, their orthology was well-supported in the multiple sequences alignment (MSA) (Figure 2 and Supplementary Figures S1–S3). For example, although the g217-encoded protein (217p) showed no detectable similarity with the wsv308 protein, also called VP51, a nucleocapsid protein [48], it did show trace similarity (<18% identity) with another S. intermedium (Si) nimavirus protein GBG35584.1, which was annotated as a wsv308-like protein [28]. When 217p, GBG35584.1, wsv308, and some other wsv308-like proteins were included in the multiple sequence alignment, the orthology was clearly revealed by the many highly-conserved residues/blocks throughout the whole length (Figure 2). Similarly, we concluded that g140 was a wsv112-like dUTPase enzyme (Supplementary Figure S1); g225 was wsv226-like (Supplementary Figure S2); and the two PF6 members, g034 and g139, as well as their homologs in Mj nimavirus (GBG35398.1 and GBG35402.1), were indeed homologs of wsv206 (Supplementary Figure S3). Admittedly, Kawato et al. did acknowledge that GBG35398.1 and GBG35402.1 were likely homologs of wsv206, but this uncertainty was unsolved in the paper [28]. Notably, the wsv206-like protein GBG35398.1 contains a macro domain (cl00019, E-Value = 3.00076 × 10−5), which is a high-affinity ADP-ribose binding module.
Besides the 44 ancestral/core genes, eight Nimav-1_LVa genes were found with equivalents in the non-WSSV and non-Mj-group nimaviruses. The absence of WSSV homologs for these genes could be explained by the gene loss in WSSV. The eight genes included g115, g206, and the six inhibitors of apoptosis from the PF2 family. The counterparts of g115 (SCV_095, GAV93215.1) and g206 (SCV_028, GAV93152.1) were encoded in CoBV. The BIR domain in the PF2 family members was absent in WSSV proteins, but it was encoded in one Md nimavirus protein (AKS10635.1), one CoBV protein (GAV93213.1), and one Ht nimavirus protein (GBG35369.1).
The remaining 45 homolog-supported genes could only find their homologs from the Mj-type nimavirus or the non-redundant (nr) protein database of NCBI. These 45 genes and the 20 hypothetical genes were tentatively called “Mj-group-specific” genes (indicated in bold font, Table 3). Theoretically, these “Mj-group-specific” genes comprised three sections: (1) genes that were acquired in the common ancestor of the Mj-group after its split from other nimaviruses, (2) genes whose orthologs have been lost in the evolution of other nimaviruses, (3) genes underwent faster evolutionary rate, thus making it difficult to detect their homologs in other virus groups. Unless more nimavirus genomes are completely assembled, a lot of uncertainty remains in this area.

3.5. Nimav-1_LVa Genes Involved in Host-Pathogen Interaction

Although the molecular functions of a lot of Nimav-1_LVa proteins were unknown, a large number of genes/families seemed connected to roles in host-pathogen interaction and innate immune response. These genes/families included: (1) g103 (heat shock protein, Hsp70), (2) g118 (DnaJ, also called Hsp40), (3) g132 (ubiquitin), (4) the 6 IAPs of the PF2 family, (5) g171 (wsv267-like anti-apoptotic protein), (6) g046 (CHH), (7) g155 (innexin), (8) g158 (BAX inhibitor 1 like), and (9) g227 (semaphorin 1A like).
In the cases of the first four genes/families: g103 (heat shock protein, Hsp70), g118 (Hsp40), g132 (ubiquitin), and the six inhibitors of apoptosis of the PF2 gene family, their involvements in the host-pathogen interaction were well acknowledged. It is well known that apoptosis is a key immune process in the shrimp response to the WSSV invasion [49]. Various heat shock proteins and ubiquitin are also well documented for their functions in host-virus interaction. For example, extracellular Hsp70s have been demonstrated with a number of cytoprotective and immunomodulatory functions, such as stimulators of innate immune responses in the human system [50]. A heat shock protein 70 (Hsc70) was found to inhibit apoptosis induced by WSSV infection in hemocyte shrimp cells [51]. In shrimp P. vannamei, the expression of the Hsp70 gene was also reported altered after the WSSV infection [52,53], and intramuscularly injection of Hsp70 protein could significantly reduce mortality after WSSV infection [54]. As for the Hsp40 gene, its responses to viral infection have been reported in halibut Paralichthys olivaceus [55]. In another study using the WSSV challenged tiger shrimp P. monodon, ubiquitin gene was down-regulated during the first 12 hours, but reversed in the following period [56]. Lastly, a study in red swamp crayfish, Procambarus clarkii, listed DnaJ (Hsp40), ubiquitin, and innexin (detailed below) proteins for their possible anti-WSSV roles [57].
The g171 gene is the ortholog of WSSV wsv267. The wsv267 protein, also known as anti-apoptotic protein 4 (APP4) [18], has been shown capable of inhibiting apoptosis by binding with the p20 domain of P. monodon caspase (PmCasp) protein, which can induce apoptosis [58]. There are four anti-apoptotic WSSV proteins identified (APP1 to APP4) [18], but only APP4 (wsv267) protein could find its homolog in Nimav-1_LVa (171p).
In the cases of the last four genes, g046 (CHH), g155 (innexin), g158 (BAX inhibitor 1 like), g227 (semaphorin 1A like), their roles in virus infection was not obvious. The Nimav-1_LVa g046 gene encodes a 123 AA protein (ROT61446), which is 59% homologous to the crustacean hyperglycemic hormone (CHH) like protein encoded by gene KJ660843 [59]. Both proteins are encoded in three coding-exons and are co-classified in the CHH group named as type-Ib [6]. Notably, there are around 21 type-Ib CHH genes in the P. vannamei genome [6], and 13 of them seem to be accounted for by this viral g046. In addition to the manifold functions in blood glucose regulation, control of the molt cycle, osmoregulation, etc. [60,61], CHH peptides can increase the survival rate of bacteria-infected shrimp [62] and might be involved in hemocyte intracellular signaling pathways to regulate exocytosis and immune response [63].
Gene g155 encodes a membrane protein innexin (pfam00876), which is functionally analogous to the vertebrate connexin in the cell gap junction [64,65]. There are 21 innexin genes in P. vannamei [6], and some of them are due to the multiplication of the viral genome. Innexin is involved in immune response and cell apoptosis [65,66], probably by regulating the closure of the gap channel to reduce the neighboring cellular apoptosis [67,68]. In a study in red swamp crayfish, the innexin gene has been listed as a candidate anti-WSSV gene [57]. Notably, the g155 gene contains four exons, and its homolog is found in Mj nimavirus (BFCD01000001.1). Interestingly, innexin-like genes were also reported in a number of parasitoid viruses from the Ichnovirus genus in the Polydnaviridae family, such as Campoletis sonorensis ichnovirus (CsIV) and Hyposoter didymator ichnovirus (HdIV) and Hyposoter fugitivus ichnovirus (HfIV) [69,70], where innexins are termed vinnexins but are viewed as orthologs of host innexins acquired by the viruses since they show strong sequence similarity to insect innexins [69,70]. However, unlike the Nimav-1_LVa encoded g155 (innexin), these vinnexin genes lack introns [71].
Gene g158 encodes a BAX inhibitor (BI)-1-like protein (cd10430), which is located primarily in the membranes of the endoplasmic reticulum (ER) and suppresses ER stress-induced apoptosis [72,73]. BI-1 is a conserved suppressor of programmed cell death in animals and plants [74]. Gene g158 also contains exons, but its homolog is not found in the Mj nimavirus genome, probably due to the incompleteness of the current Mj nimavirus assembly. Interestingly, the genomic loci of g158 are next to that of g155 (Table 3).
Gene g227 encodes a trans-membrane semaphorin 1A (Sema1A)-like protein (Class 1 semaphorins). While semaphorins generally act as signaling ligands that regulate the shape and motility of cells, their roles in immunity have been noticed [75,76]. Membrane-associated semaphorins play a role in regulating immune homeostasis in mouse models [77], according to which CD72 (Cluster of Differentiation 72) and TIM-2 (T cell immunoglobulin and mucin domain protein 2) ligands functionally interact with semaphorin Sema4D and Sema4A, respectively [78]. Although direct evidence supporting the involvement of Sema1A in immune regulation still lack in invertebrate system, the finding of Sema1A-like protein encoded in a virus-like Nimav-1_LVa is probably not a simple coincidence, especially considering that the other three cellular-like genes, g046 (CHH), g155 (innexin), g158 (BAX inhibitor 1 like), are all present in Nimav-1_LVa, likely involved in pathogen-host interactions. Therefore, we hypothesized that g227 (semaphorin 1A like) could also have a potential role in immune regulation.

4. Discussion

4.1. Nimav-1_LVa Consensus Sequence

We reported reconstructing a 279 Kb long, high-quality consensus sequence from the genome of P. vannamei breed Kehai No. 1 variety farmed in China [6], to represent the complete genome of an endogenous nimavirus, Nimav-1_LVa. This consensus sequence showed a ~98% sequence identity to our previous DNAV-1_LVa consensus reconstructed from the first SPF P. vannamei domesticated in the US [1,30,39]. It was reported that Kehai No. 1 was derived from Hawaii, USA, as well [79]. However, it remains to be determined if the original Kehai No. 1 stocks were purchased from a private American shrimp breeding company (High Health Aquaculture, HHA) based in Kona, Hawaii, or from the original SPF Kona line of the breeding program of the USMSFP Consortium, which was funded by the USDA-CSREES and maintained at The Oceanic Institute in Honolulu, Oahu, Hawaii until 2009 [1]. This 279 Kb of Nimav-1_LVa is very close to the known genome size range of WSSV viruses (280–309 Kb) but much larger than the current scaffold assemblies of all other endogenous nimaviruses (Table 1). This is probably because only those contigs bearing homology to WSSV sequences are considered [28]. The successful reconstruction of Nimav-1_LVa is largely attributed to two factors that a large quantity of Nimav-1_LVa remnant is present in the shrimp genome and that the integration of Nimav-1_LVa is a relatively recent event. Given the high sequence similarity among the large Nimav-1_LVa fragments in the genome, the question arises of if this Nimav-1_LVa, coupled with the highly abundant (>23.93%) SSRs [6], could cause any assembling problem, and to what extent. With hindsight, the current 1.6 Gb Kehai No. 1 assembly is quite apart from the expected 2.45 to 2.89 Gb of P. vannamei [1]. Considering the complexity of the shrimp genome, it would be good to have another genome assembly from a different P. vannamei stock available in the future.
Compared to the sequence from individual loci, consensus sequence possesses the merit of restoring the viral sequence to its early state when the integration first happened, thus minimizing the adverse effects caused by numerous sequence mutations. Gene prediction made on the consensus would be more accurate. For instance, in the Mj nimavirus sequence (BFCD01000001.1), the corresponding coding region of Nimav-1_LVa gene g130 (1145 AA) is interrupted by a frameshift mutation.
Sequence analysis in the shrimp genome indicates Nimav-1_LVa viruses integrate exclusively into telomeric microsatellite (TAACC/GGTTA)n [6,39]. The telomere-specific integration pattern could be partly explained by negative selection on those integrations in the non-dormant regions because, as demonstrated in human herpesvirus 6A and 6B (HHV-6A and HHV-6B) [41], insertion into telomere could help viruses to maintain a state of latency, although reversible. However, the molecular mechanism underlying such a site-specific integration cannot be excluded and is worthwhile for future investigations. In the scope of DNA virus, it is known that HHV-6A and HHV-6B and the chicken lymphotropic alphaherpesvirus Marek’s disease virus (MDV) can insert specifically into telomere site via the homology-dependent recombination, where the linear double-stranded DNA viruses have variable length of telomere-like repeat regions at either genome end [40,42,43,44]. As shown in Figure 1B, it remains to be determined if the circular Nimav-1_LVa genome does harbor one or two tracts of telomeric pentanucleotide (TAACC/GGTTA)n.

4.2. Endogenized or Free-Living Virus

Although a number of endogenous nimaviruses have been revealed in the genomes of various crustacean species [27,28,80], one compelling question remains, that is whether these endogenous virus sequences are passive relics of some old nimaviruses (“fossilized”), or recent inhabitants in these eukaryotic genomes, from some unidentified free-living viruses, and still possibly possess the capability to proliferate and transmit to different genomes/species under certain circumstances. Currently, at least two cases of endogenous nimaviruses suggest the latter scenario. The first one is the detection of almost identical Nimav-1_LVa sequences in the genomes of three shrimp species: P. vannamei Kehai No. 1, P. monodon isolate Shenzhen (NIUS000000000), and M. japonicus isolate Guangxi (NIUR010000000), in spite of the fact that much less Nimav-1_LVa is in the latter two shrimp genomes. The second line of evidence is the identification of almost identical (99%) Mj-type nimavirus sequence in M. japonicus and M. latisulcatus [28]. In light of the unexpected large diversity of virome observed in a single species of marine invertebrate (P. monodon) from different geographic locations [81], these data suggest the two nimaviruses or their closest relatives may exist as free-living viruses in nature, except that they may be not so virulent as WSSV (more discussed below).
It is worth noting that endogenous and free-living states are two equally essential stages/phases in the life cycle of some parasitoid viruses [82], such as the polydnavirus Campoletis sonorensis ichnovirus (CsIV) [69,70]. The genomes of these viruses, comprising multiple endogenous DNA segments, are endogenously integrated into the genome of the parasitoid wasp (Campoletis sonorensis) [69,82], which is parasitic on a host (usually lepidopteran) larva. The virus particles are only replicated (produced) in specific cell types in the female wasp’s reproductive organs and are injected, together with one or more eggs, into the lepidopteran host. In such a system, viral genes are essentially inhibitors of the wasp’s host’s immune system, preventing it from killing the wasp’s injected egg and the immature wasp, until the ultimate death of the parasitized host. This mutualism association or coevolution of virus and parasitoid insect was dated over at least 64 million years [83].

4.3. Nimav-1_LVa Encoded Proteins

A total of 117 protein genes, including 97 homology-based and 20 hypothetical genes, have been predicted in the Nimav-1_LVa genome if the criterion is set to 70 amino acids long. This number of genes is presumably very close to the actual gene number in Nimav-1_LVa because only 16% of the virus genome is intergenic region when long microsatellite regions are excluded. These 117 genes can be generally divided into three sections according to their evolutionary status: (1) 44 nimavirus ancestral/core genes, which are shared in Nimav-1_LVa and WSSV, (2) eight genes whose homologs are found in non-WSSV and non-Mj-group nimaviruses, (3) 65 genes whose homologs seemingly only exist in the Mj-group nimaviruses or in the eukaryotic host genome. This division is just for the purpose of expedience because some genuine homologs are inevitably overlooked due to the vast sequence divergence, especially for those smaller genes. Notably, it is possible that in the intergenic region, still exist some smaller protein genes or viral miRNA genes [84,85].
Compared with WSSV, one prominent feature of Nimav-1_LVa is that it encodes more than a dozen genes involved in the critical processes in pathogen-host interactions, such as immune responses and/or apoptosis inhibition [86]. These genes/families include g103 (Hsp70), g118 (DnaJ, also called Hsp40), g132 (ubiquitin), g046 (CHH), g155 (innexin), g158 (BAX inhibitor 1 like), g227 (semaphorin 1A like), g171 (anti-apoptotic protein), and the six IAPs from the PF2 gene family. We hypothesized that four genes, g046 (CHH), g155 (innexin), g158 (BAX inhibitor 1 like), and g227 (semaphorin 1A like), were likely derived from cellular genes, but had been harnessed by Nimav-1_LVa for its own advantage. This notion was based on the following observations. First, intronic genes are normally very rare in viruses, and all WSSV genes are non-splicing; however, the exon-intron structure is found in g046 (CHH), g155 (innexin), and g158 (BAX inhibitor 1 like) in Nimav-1_LVa. Second, to our knowledge, CHH (g046) gene has never been reported in a virus genome before. Despite being reported in a few parasitoid viruses, innexin/vinnexin (g155) genes are still considered acquired host genes [69,70]. The occurrence of innexin/vinnexin in both Nimav-1_LVa and polydnavirus Campoletis sonorensis ichnovirus (CsIV) is likely the result of convergent evolution, suggesting Nimav-1_LVa virus, to some extent, may not be a virulent virus. Third, all the genes had been reported, or suggested, being involved in immune regulation after virus infection. Lastly, g155 (innexin), g158 (BAX inhibitor 1 like), and g227 (semaphorin 1A like) are all membrane protein genes. In summary, to get a comprehensive perspective on the evolution in Nimaviridae, our preliminary results highlight the need for completed assemblies in more endogenous nimaviruses.

5. Conclusions

A ~279 Kb contiguous consensus sequence, designated as Nimav-1_LVa, was successfully reconstructed from the genome sequence of the whiteleg shrimp Penaeus vannamei breed Kehai No. 1. The consensus putatively represented the complete genome of a nimavirus that had been endogenized in the shrimp genome. Out of 117 protein genes, Nimav-1_LVa encoded a dozen of genes involved in the host-pathogen interactions, albeit some were acquired host genes. The data suggested Nimav-1_LVa virus might take a different strategy than WSSV, aiming at a long-term or benign relationship with the host. The genome of Nimav-1_LVa could facilitate a better understanding of evolution in virus family Nimaviridae and could also be applicable in the shrimp breeding, traceability of farmed shrimp, WSSV diagnosis, and treatment of WSD [26,87].

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/2073-4425/11/1/94/s1, Figure S1: Conserved blocks in the alignment of 140p, wsv112 (AAL33116.1), and other homologs, Figure S2: Conserved blocks in the alignment of 225p, wsv226 (AIX03672.1), and other homologs, Figure S3: Conserved blocks in the alignment of 034p, 139p, wsv206 (AAL33210.1), and other homologs, File S1: Nimav-1_LVa consensus sequence and 117 encoded protein sequences, Table S1: Shrimp (Kehai No. 1) genomic fragments derived from Nimav-1_LVa.

Author Contributions

Conceptualization, W.B.; methodology, W.B.; formal analysis, W.B.; resources, A.A.-W.; data curation, W.B.; writing—original draft preparation, W.B.; writing—review and editing, W.B., A.A.-W., and K.F.J.T.; project administration, A.A.-W.; funding acquisition, A.A.-W. All authors have read and agreed to the published version of the manuscript.

Funding

This research is part of The Shrimp Epigenome (ShrimpENCODE) Project initially funded by the Foundation for Conservation of Biodiversity (FUCOBI) of Quito, Ecuador, and Environmental Genomics Inc. of Southborough, MA USA. Funding for this research was provided by the U.S. Marine Shrimp Farming Consortium, Cooperative State Research, Education, and Extension Service (CSREES), USDA, under Grant No. 2002-38808-01345. (A.A.-W. was a Technical Committee member 1992-2005, Tufts University). Partial funding for sequencing with PacBio technology was provided by the USDA NRSP-8 Aquaculture group (to A.A.-W., Environmental Genomics Inc.). The APC was funded by Environmental Genomics, Inc.

Acknowledgments

We thank Shaun Moss and the staff at The Oceanic Institute in Honolulu, HI for supplying the broodstock used in this study; Dawn Meehan at Tufts University Cummings School of Veterinary Medicine, Grafton, MA for assistance with collection of hemolymph and other tissues used for gDNA extraction; Robert Bogden, Quanzhou Tao, Suresh Iyer, Galina Mikhaylenko, Jon Wittendorp, Amy Mraz, and Evan Hart from Amplicon Express for their efforts to prepare HMW gDNA for BAC library construction that was ideal for making 20Kb PacBio SMRTCell libraries and running on the Pacific Bioscience RSII platform with P5 chemistry; and Emily Hatas, Steven Kujawa, Joan Wilson, and Karl Voss from Pacific Biosciences for their time assisting with the pilot genome sequencing of the first SPF P. vannamei domesticated in the United States by the USMSFP consortium.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Alcivar-Warren, A. The Shrimp Genome and Epigenome: A Review of Genome Sizes, Transposable Elements, Simple Sequence Repeats, Integrated Viruses, and Epigenetic Components of Penaeids. J. Shellfish Res. 2020, in press. [Google Scholar]
  2. Lightner, D.V. Biosecurity in shrimp farming: Pathogen exclusion through use of SPF stock and routine surveillance. J. World Aquac. Soc. 2005, 36, 229–248. [Google Scholar] [CrossRef]
  3. Alday-Sanz, V.; Brock, J.; Flegel, T.W.; McIntosh, R.; Bondad-Reantaso, M.G.; Salazar, M.; Subasinghe, R. Facts, truths and myths about SPF shrimp in Aquaculture. Rev. Aquac. 2018. [Google Scholar] [CrossRef]
  4. Bao, W.; Bogden, R.; Tao, Q.; Iyer, S.; Mikhaylenko, G.; Wittendorp, J.; Mraz, A.; Hart, E.; Hatas, E.; Kujawa, S.; et al. Transposable Elements, Simple Sequence Repeats, and Integrated Viruses in Specific Pathogen-Free (SPF) Shrimp, Penaeus (Litopenaeus) Vannamei, Domesticated by the Breeding Program of the US Marine Shrimp Farming Program (USMSFP). Genes 2020, in press. [Google Scholar]
  5. Bao, W.; Kojima, K.K.; Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 2015, 6, 11. [Google Scholar] [CrossRef] [Green Version]
  6. Zhang, X.; Yuan, J.; Sun, Y.; Li, S.; Gao, Y.; Yu, Y.; Liu, C.; Wang, Q.; Lv, X.; Zhang, X.; et al. Penaeid shrimp genome provides insights into benthic adaptation and frequent molting. Nat. Commun. 2019, 10, 356. [Google Scholar] [CrossRef] [Green Version]
  7. Yang, Q.; Dong, X.; Xie, G.; Fu, S.; Zou, P.; Sun, J.; Wang, Y.; Huang, J. Comparative genomic analysis unravels the transmission pattern and intra-species divergence of acute hepatopancreatic necrosis disease (AHPND)-causing Vibrio parahaemolyticus strains. Mol. Genet. Genom. 2019, 294, 1007–1022. [Google Scholar] [CrossRef]
  8. Feng, S.Y.; Liang, G.F.; Xu, Z.S.; Li, A.F.; Du, J.X.; Song, G.N.; Ren, S.Y.; Yang, Y.L.; Jiang, G. Meta-analysis of antiviral protection of white spot syndrome virus vaccine to the shrimp. Fish Shellfish Immunol. 2018, 81, 260–265. [Google Scholar] [CrossRef] [Green Version]
  9. Oakey, J.; Smith, C.; Underwood, D.; Afsharnasab, M.; Alday-Sanz, V.; Dhar, A.; Sivakumar, S.; Sahul Hameed, A.S.; Beattie, K.; Crook, A. Global distribution of white spot syndrome virus genotypes determined using a novel genotyping assay. Arch. Virol. 2019, 164, 2061–2082. [Google Scholar] [CrossRef] [Green Version]
  10. Stentiford, G.D.; Lightner, D.V. Cases of white spot disease (WSD) in European shrimp farms. Aquaculture 2011, 319, 302–306. [Google Scholar] [CrossRef]
  11. Zhan, W.; Wang, Y.; Fryer, J.L.; Yu, K.; Fukuda, H.; Meng, Q. White spot syndrome virus infection of cultured shrimp in China. J. Aquat. Anim. Health 1998, 10, 405–410. [Google Scholar] [CrossRef]
  12. Knibb, W.; Le, C.; Katouli, M.; Bar, I.; Lloyd, C. Assessment of the origin of white spot syndrome virus DNA sequences in farmed Penaeus monodon in Australia. Aquaculture 2018, 494, 26–29. [Google Scholar] [CrossRef]
  13. Mohan, C.V.; Shankar, K.M.; Kulkarni, S.; Sudha, P.M. Histopathology of cultured shrimp showing gross signs of yellow head syndrome and white spot syndrome during 1994 Indian epizootics. Dis. Aquat. Organ. 1998, 34, 9–12. [Google Scholar] [CrossRef] [PubMed]
  14. Walker, P.J.; Mohan, C.V. Viral disease emergence in shrimp aquaculture: Origins, impact and the effectiveness of health management strategies. Rev. Aquac. 2009, 1, 125–154. [Google Scholar] [CrossRef]
  15. Tang, K.F.J.; Le Groumellec, M.; Lightner, D.V. Novel, closely related, white spot syndrome virus (WSSV) genotypes from Madagascar, Mozambique and the Kingdom of Saudi Arabia. Dis. Aquat. Org. 2013, 106, 1–6. [Google Scholar] [CrossRef] [PubMed]
  16. Van Hulten, M.C.; Witteveldt, J.; Peters, S.; Kloosterboer, N.; Tarchini, R.; Fiers, M.; Sandbrink, H.; Lankhorst, R.K.; Vlak, J.M. The white spot syndrome virus DNA genome sequence. Virology 2001, 286, 7–22. [Google Scholar] [CrossRef] [Green Version]
  17. Sánchez-Paz, A. White spot syndrome virus: An overview on an emergent concern. Vet. Res. 2010, 41, 43. [Google Scholar] [CrossRef] [Green Version]
  18. Wang, H.C.; Hirono, I.; Maningas, M.B.B.; Somboonwiwat, K.; Stentiford, G.; Ictv, R.C. ICTV Virus Taxonomy Profile: Nimaviridae. J. Gen. Virol. 2019, 100, 1053–1054. [Google Scholar] [CrossRef]
  19. Stentiford, G.D.; Bonami, J.R.; Alday-Sanz, V. A critical review of susceptibility of crustaceans to Taura syndrome, Yellowhead disease and White Spot Disease and implications of inclusion of these diseases in European legislation. Aquaculture 2009, 291, 1–17. [Google Scholar] [CrossRef]
  20. Jiang, L.; Xiao, J.; Liu, L.; Pan, Y.; Yan, S.; Wang, Y. Characterization and prevalence of a novel white spot syndrome viral genotype in naturally infected wild crayfish, Procambarus clarkii, in Shanghai, China. Virusdisease 2017, 28, 250–261. [Google Scholar] [CrossRef]
  21. Parrilla-Taylor, D.P.; Vibanco-Pérez, N.; Durán-Avelar, M.J.; Gomez-Gil, B.; Llera-Herrera, R.; Vázquez-Juárez, R. Molecular variability and genetic structure of white spot syndrome virus strains from northwest Mexico based on the analysis of genomes. FEMS Microbiol. Lett. 2018, 365. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Utari, H.B.; Soowannayan, C.; Flegel, T.W.; Whityachumnarnkul, B.; Kruatrachue, M. Variable RNA expression from recently acquired, endogenous viral elements (EVE) of white spot syndrome virus (WSSV) in shrimp. Dev. Comp. Immunol. 2017, 76, 370–379. [Google Scholar] [CrossRef] [PubMed]
  23. Taengchaiyaphum, S.; Srisala, J.; Bunphimpapha, P.; Supungul, P.; Tassanakajon, A.; Chaiyapechara, S.; Bowornpinyo, S.; Sritunyalucksana, K.; Flegel, T.W. Mendelian inheritance of endogenous viral elements (EVE) of white spot syndrome virus (WSSV) in shrimp. Dev. Comp. Immunol. 2019, 96, 144–149. [Google Scholar] [CrossRef] [PubMed]
  24. Niu, G.J.; Wang, S.; Xu, J.D.; Yang, M.C.; Sun, J.J.; He, Z.H.; Zhao, X.F.; Wang, J.X. The polymeric immunoglobulin receptor-like protein from Marsupenaeus japonicus is a receptor for white spot syndrome virus infection. PLoS Pathog. 2019, 15, e1007558. [Google Scholar] [CrossRef] [Green Version]
  25. Cuéllar-Anjel, J.; White-Noble, B.; Schofield, P.; Chamorro, R.; Lightner, D.V. Report of significant WSSV-resistance in the Pacific white shrimp, Litopenaeus vannamei, from a Panamanian breeding program. Aquaculture 2012, 368, 36–39. [Google Scholar] [CrossRef]
  26. Trang, T.T.; Hung, N.H.; Ninh, N.H.; Knibb, W.; Nguyen, N.H. Genetic Variation in Disease Resistance Against White Spot Syndrome Virus (WSSV) in Liptopenaeus vannamei. Front. Genet. 2019, 10, 264. [Google Scholar] [CrossRef]
  27. Rozenberg, A.; Brand, P.; Rivera, N.; Leese, F.; Schubart, C.D. Characterization of fossilized relatives of the White Spot Syndrome Virus in genomes of decapod crustaceans. BMC Evol. Biol. 2015, 15, 142. [Google Scholar] [CrossRef] [Green Version]
  28. Kawato, S.; Shitara, A.; Wang, Y.; Nozaki, R.; Kondo, H.; Hirono, I. Crustacean Genome Exploration Reveals the Evolutionary Origin of White Spot Syndrome Virus. J. Virol. 2019, 93, e01144-18. [Google Scholar] [CrossRef] [Green Version]
  29. Bao, W. DNA viruses from the shrimp genome. Repbase Rep. 2018, 18, 1352. [Google Scholar]
  30. Bao, W.; Alcivar-Warren, A.; Bogden, R.; Tao, Q.; Iyer, S.; Mikhaylenko, G.; Wittendorp, J.; Mraz, A.; Hart, E.; Hatas, E.; et al. A fossilized white spot syndrome virus-like element (DNAV-1_LVa) in the genome of the original specific pathogen-free (SPF) shrimp Penaeus (Litopenaeus) vannamei domesticated by the breeding program of the US Marine Shrimp Farming Program (USMSFP) from Hawaii, USA. In Proceedings of the Aquaculture 2019, New Orleans, LA, USA, 7–11 March 2019; p. 80. [Google Scholar]
  31. Iranzo, J.; Krupovic, M.; Koonin, E.V. The Double-Stranded DNA Virosphere as a Modular Hierarchical Network of Gene Sharing. mBio 2016, 7, e00978-16. [Google Scholar] [CrossRef] [Green Version]
  32. Smit, A.F.A.; Hubley, R. RepeatModeler Open-1.0. 2008–2015. Available online: http://www.repeatmasker.org (accessed on 2 November 2019).
  33. Benson, G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999, 27, 573–580. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Solovyev, V.; Kosarev, P.; Seledsov, I.; Vorobyev, D. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol. 2006, 7, S10.1–S10.12. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Katoh, K.; Rozewicki, J.; Yamada, K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform 2019, 20, 1160–1166. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Waterhouse, A.M.; Procter, J.B.; Martin, D.M.; Clamp, M.; Barton, G.J. Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics 2009, 25, 1189–1191. [Google Scholar] [CrossRef] [Green Version]
  37. Kohany, O.; Gentles, A.J.; Hankus, L.; Jurka, J. Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor. BMC Bioinform. 2006, 7, 474. [Google Scholar] [CrossRef] [Green Version]
  38. HMMER: Biosequence Analysis Using Profile Hidden Markov Models. Available online: http://hmmer.org/ (accessed on 2 November 2019).
  39. Alcivar-Warren, A.; Meehan-Meola, D.; Wang, Y.; Guo, X.; Zhou, L.; Xiang, J.; Moss, S.; Arce, S.; Warren, W.; Xu, Z.; et al. Isolation and mapping of telomeric pentanucleotide (TAACC)n repeats of the Pacific whiteleg shrimp, Penaeus vannamei, using fluorescence in situ hybridization. Mar. Biotechnol. (NY) 2006, 8, 467–480. [Google Scholar] [CrossRef]
  40. Arbuckle, J.H.; Medveczky, M.M.; Luka, J.; Hadley, S.H.; Luegmayr, A.; Ablashi, D.; Lund, T.C.; Tolar, J.; De Meirleir, K.; Montoya, J.G.; et al. The latent human herpesvirus-6A genome specifically integrates in telomeres of human chromosomes in vivo and in vitro. Proc. Natl. Acad. Sci. USA 2010, 107, 5563–5568. [Google Scholar] [CrossRef] [Green Version]
  41. Pantry, S.N.; Medveczky, P.G. Latency, Integration, and Reactivation of Human Herpesvirus-6. Viruses 2017, 9, 194. [Google Scholar] [CrossRef] [Green Version]
  42. Wood, M.L.; Royle, N.J. Chromosomally Integrated Human Herpesvirus 6: Models of Viral Genome Release from the Telomere and Impacts on Human Health. Viruses 2017, 9, 184. [Google Scholar] [CrossRef] [Green Version]
  43. Osterrieder, N.; Wallaschek, N.; Kaufer, B.B. Herpesvirus Genome Integration into Telomeric Repeats of Host Cell Chromosomes. Annu. Rev. Virol. 2014, 1, 215–235. [Google Scholar] [CrossRef]
  44. Kheimar, A.; Previdelli, R.L.; Wight, D.J.; Kaufer, B.B. Telomeres and Telomerase: Role in Marek’s Disease Virus Pathogenesis, Integration and Tumorigenesis. Viruses 2017, 9, 173. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Silke, J.; Vucic, D. IAP family of cell death and signaling regulators. In Methods in enzymology; Ashkenazi, A., Wells, J.A., Yuan, J., Eds.; Elsevier: Amsterdam, The Netherlands, 2014; Volume 545, pp. 35–65. [Google Scholar]
  46. Jin, H.S.; Lee, D.H.; Kim, D.H.; Chung, J.H.; Lee, S.J.; Lee, T.H. cIAP1, cIAP2, and XIAP act cooperatively via nonredundant pathways to regulate genotoxic stress-induced nuclear factor-kappaB activation. Cancer Res. 2009, 69, 1782–1791. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Galbán, S.; Duckett, C.S. XIAP as a ubiquitin ligase in cellular signaling. Cell Death Differ. 2010, 17, 54–60. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Wu, C.; Yang, F. Localization studies of two white spot syndrome virus structural proteins VP51 and VP76. Virol. J. 2006, 3, 76. [Google Scholar] [CrossRef] [Green Version]
  49. Wang, P.H.; Huang, T.; Zhang, X.; He, J.G. Antiviral defense in shrimp: From innate immunity to viral infection. Antivir. Res. 2014, 108, 129–141. [Google Scholar] [CrossRef]
  50. Radons, J. The human HSP70 family of chaperones: Where do we stand? Cell Stress Chaperones 2016, 21, 379–404. [Google Scholar] [CrossRef] [Green Version]
  51. Yan, F.; Xia, D.; Hu, J.; Yuan, H.; Zou, T.; Zhou, Q.; Liang, L.; Qi, Y.; Xu, H. Heat shock cognate protein 70 gene is required for prevention of apoptosis induced by WSSV infection. Arch. Virol. 2010, 155, 1077–1083. [Google Scholar] [CrossRef]
  52. Tassanakajon, A.; Somboonwiwat, K.; Supungul, P.; Tang, S. Discovery of immune molecules and their crucial functions in shrimp immunity. Fish Shellfish Immunol. 2013, 34, 954–967. [Google Scholar] [CrossRef]
  53. Valentim-Neto, P.A.; Moser, J.R.; Fraga, A.P.; Marques, M.R. Hsp70 expression in shrimp Litopenaeus vannamei in response to IHHNV and WSSV infection. Virusdisease 2014, 25, 437–440. [Google Scholar] [CrossRef] [Green Version]
  54. Janewanthanakul, S.; Supungul, P.; Tang, S.; Tassanakajon, A. Heat shock protein 70 from Litopenaeus vannamei (LvHSP70) is involved in the innate immune response against white spot syndrome virus (WSSV) infection. Dev. Comp. Immunol. 2020, 102, 103476. [Google Scholar] [CrossRef]
  55. Dong, C.W.; Zhang, Y.B.; Zhang, Q.Y.; Gui, J.F. Differential expression of three Paralichthys olivaceus Hsp40 genes in responses to virus infection and heat shock. Fish Shellfish Immunol. 2006, 21, 146–158. [Google Scholar] [CrossRef] [PubMed]
  56. Vidya, R.; Gireesh-Babu, P.; Pani Prasad, K. White spot syndrome virus Manipulates Ubiquitin Gene Expression in Penaeus monodon. Indian J. Virol. 2013, 24, 82–84. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Yi, S.; Li, Y.; Shi, L.; Zhang, L. Novel Insights into Antiviral Gene Regulation of Red Swamp Crayfish, Procambarus clarkii, Infected with White Spot Syndrome Virus. Genes 2017, 8, 320. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  58. Lertwimol, T.; Sangsuriya, P.; Phiwsaiya, K.; Senapin, S.; Phongdara, A.; Boonchird, C.; Flegel, T.W. Two new anti-apoptotic proteins of white spot syndrome virus that bind to an effector caspase (PmCasp) of the giant tiger shrimp Penaeus (Penaeus) monodon. Fish Shellfish Immunol. 2014, 38, 1–6. [Google Scholar] [CrossRef]
  59. Ventura-López, C.; Gómez-Anduro, G.; Arcos, F.G.; Llera-Herrera, R.; Racotta, I.S.; Ibarra, A.M. A novel CHH gene from the Pacific white shrimp Litopenaeus vannamei was characterized and found highly expressed in gut and less in eyestalk and other extra-eyestalk tissues. Gene 2016, 582, 148–160. [Google Scholar] [CrossRef]
  60. Ohira, T. Crustacean Hyperglycemic Hormone. In Handbook of Hormones; Takei, Y., Ando, H., Tsutsui, K., Eds.; Elsevier: Amsterdam, The Netherlands, 2016. [Google Scholar]
  61. Zuo, H.; Yuan, J.; Niu, S.; Yang, L.; Weng, S.; He, J.; Xu, X. A molting-inhibiting hormone-like protein from Pacific white shrimp Litopenaeus vannamei is involved in immune responses. Fish Shellfish Immunol. 2018, 72, 544–551. [Google Scholar] [CrossRef]
  62. Wanlem, S.; Supamattaya, K.; Tantikitti, C.; Prasertsan, P.; Graidist, P. Expression and applications of recombinant crustacean hyperglycemic hormone from eyestalks of white shrimp (Litopenaeus vannamei) against bacterial infection. Fish Shellfish Immunol. 2011, 30, 877–885. [Google Scholar] [CrossRef]
  63. Xu, L.; Pan, L.; Zhang, X.; Wei, C. Crustacean hyperglycemic hormone (CHH) affects hemocyte intracellular signaling pathways to regulate exocytosis and immune response in white shrimp Litopenaeus vannamei. Peptides 2019, 116, 30–41. [Google Scholar] [CrossRef]
  64. Phelan, P.; Stebbings, L.A.; Baines, R.A.; Bacon, J.P.; Davies, J.A.; Ford, C. Drosophila Shaking-B protein forms gap junctions in paired Xenopus oocytes. Nature 1998, 391, 181–184. [Google Scholar] [CrossRef]
  65. Güiza, J.; Barría, I.; Sáez, J.C.; Vega, J.L. Innexins: Expression, Regulation, and Functions. Front. Physiol. 2018, 9, 1414. [Google Scholar] [CrossRef]
  66. Wang, S.P.; Chen, F.Y.; Dong, L.X.; Zhang, Y.Q.; Chen, H.Y.; Qiao, K.; Wang, K.J. A novel innexin2 forming membrane hemichannel exhibits immune responses and cell apoptosis in Scylla paramamosain. Fish Shellfish Immunol. 2015, 47, 485–499. [Google Scholar] [CrossRef] [PubMed]
  67. Liu, T.; Li, M.; Zhang, Y.; Pang, Z.; Xiao, W.; Yang, Y.; Luo, K. A role for Innexin2 and Innexin3 proteins from Spodoptera litura in apoptosis. PLoS ONE 2013, 8, e70456. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  68. Chen, Y.-B.; Xiao, W.; Li, M.; Zhang, Y.; Yang, Y.; Hu, J.-S.; Luo, K.-J. N-terminally elongated SpliInx2 and SpliInx3 reduce baculovirus-triggered apoptosis via hemichannel closure. Arch. Insect Biochem. Physiol. 2016, 92, 24–37. [Google Scholar] [CrossRef] [PubMed]
  69. Turnbull, M.; Webb, B. Perspectives on polydnavirus origins and evolution. Adv. Virus Res. 2002, 58, 203–254. [Google Scholar] [PubMed]
  70. Tanaka, K.; Lapointe, R.; Barney, W.E.; Makkay, A.M.; Stoltz, D.; Cusson, M.; Webb, B.A. Shared and species-specific features among ichnovirus genomes. Virology 2007, 363, 26–35. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Dupuy, C.; Huguet, E.; Drezen, J.M. Unfolding the evolutionary story of polydnaviruses. Virus Res. 2006, 117, 81–89. [Google Scholar] [CrossRef] [PubMed]
  72. Chae, H.J.; Kim, H.R.; Xu, C.; Bailly-Maitre, B.; Krajewska, M.; Krajewski, S.; Banares, S.; Cui, J.; Digicaylioglu, M.; Ke, N.; et al. BI-1 regulates an apoptosis pathway linked to endoplasmic reticulum stress. Mol. Cell 2004, 15, 355–366. [Google Scholar] [CrossRef]
  73. Bultynck, G.; Kiviluoto, S.; Henke, N.; Ivanova, H.; Schneider, L.; Rybalchenko, V.; Luyten, T.; Nuyts, K.; De Borggraeve, W.; Bezprozvanny, I.; et al. The C Terminus of Bax Inhibitor-1 Forms a Ca2+-permeable Channel Pore. J. Biol. Chem. 2012, 287, 2544–2557. [Google Scholar] [CrossRef] [Green Version]
  74. Hückelhoven, R. BAX Inhibitor-1, an ancient cell death suppressor in animals and plants with prokaryotic relatives. Apoptosis 2004, 9, 299–307. [Google Scholar] [CrossRef]
  75. Roney, K.; Holl, E.; Ting, J. Immune plexins and semaphorins: Old proteins, new immune functions. Protein Cell 2013, 4, 17–26. [Google Scholar] [CrossRef] [Green Version]
  76. Nishide, M.; Kumanogoh, A. The role of semaphorins in immune responses and autoimmune rheumatic diseases. Nat. Rev. Rheumatol. 2018, 14, 19–31. [Google Scholar] [CrossRef] [PubMed]
  77. Nakagawa, Y.; Takamatsu, H.; Okuno, T.; Kang, S.; Nojima, S.; Kimura, T.; Kataoka, T.R.; Ikawa, M.; Toyofuku, T.; Katayama, I.; et al. Identification of semaphorin 4B as a negative regulator of basophil-mediated immune responses. J. Immunol. 2011, 186, 2881–2888. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  78. Takamatsu, H.; Kumanogoh, A. Diverse roles for semaphorin-plexin signaling in the immune system. Trends Immunol. 2012, 33, 127–135. [Google Scholar] [CrossRef] [PubMed]
  79. Zhou, L.; Gui, J. Applications of genetic breeding biotechnologies in Chinese aquaculture. In Aquaculture in China: Success Stories and Modern Trends; Gui, J.-F., Tang, Q., Li, Z., Liu, J., De Silva, S.S., Eds.; Wiley Online Library: Hoboken, NJ, USA, 2018. [Google Scholar]
  80. Thézé, J.; Leclercq, S.; Moumen, B.; Cordaux, R.; Gilbert, C. Remarkable Diversity of Endogenous Viruses in a Crustacean Genome. Genome Biol. Evol. 2014, 6, 2129–2140. [Google Scholar] [CrossRef] [Green Version]
  81. Orosco, F.L.; Lluisma, A.O. Variation in virome diversity in wild populations of Penaeus monodon (Fabricius 1798) with emphasis on pathogenic viruses. Virusdisease 2017, 28, 262–271. [Google Scholar] [CrossRef]
  82. Drezen, J.M.; Provost, B.; Espagne, E.; Cattolico, L.; Dupuy, C.; Poirié, M.; Periquet, G.; Huguet, E. Polydnavirus genome: Integrated vs. free virus. J. Insect Physiol. 2003, 49, 407–417. [Google Scholar] [CrossRef]
  83. Whitfield, J.B.; Asgari, S. Virus or not? Phylogenetics of polydnaviruses and their wasp carriers. J. Insect Physiol. 2003, 49, 397–405. [Google Scholar] [CrossRef]
  84. He, Y.; Yang, K.; Zhang, X. Viral microRNAs targeting virus genes promote virus infection in shrimp in vivo. J. Virol. 2014, 88, 1104–1112. [Google Scholar] [CrossRef] [Green Version]
  85. Wang, P.H.; He, J.G. Nucleic Acid Sensing in Invertebrate Antiviral Immunity. Int. Rev. Cell Mol. Biol. 2019, 345, 287–360. [Google Scholar]
  86. Peruzza, L.; Shekhar, M.S.; Kumar, K.V.; Swathi, A.; Karthic, K.; Hauton, C.; Vijayan, K.K. Temporal changes in transcriptome profile provide insights of White Spot Syndrome Virus infection in Litopenaeus vannamei. Sci. Rep. 2019, 9, 13509. [Google Scholar] [CrossRef]
  87. Verbruggen, B.; Bickley, L.; van Aerle, R.; Bateman, K.; Stentiford, G.; Santos, E.; Tyler, C. Molecular Mechanisms of White Spot Syndrome Virus Infection and Perspectives on Treatments. Viruses 2016, 8, 23. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. (A) Schematic representation of Nimav-1_LVa endogenous nimavirus and the encoded genes. Uppermost part indicates the location of genes and their transcription orientation (in pink or light blue). Gene names in black are 20 hypothetical genes, gene names in blue (n = 3) indicate no viral homologs are found, genes in green (n = 94) indicate viral homologs are found. The brown boxes indicate the locations of long tracts of simple tandem repeats. Solid lines below the Nimav-1_LVa bar represent some larger Nimav-1_LVa segments present in the Kehai No. 1 assembly. The accession number and location of the segments can be found in Supplementary Table S1; (B) Sequence alignment of the terminal regions of the linear Nimav-1_LVa and the flanking sequences. The green-shaded regions belong to Nimav-1_LVa. The telomeric (TAACC/GGTTA)n microsatellite regions are shaded in grey.
Figure 1. (A) Schematic representation of Nimav-1_LVa endogenous nimavirus and the encoded genes. Uppermost part indicates the location of genes and their transcription orientation (in pink or light blue). Gene names in black are 20 hypothetical genes, gene names in blue (n = 3) indicate no viral homologs are found, genes in green (n = 94) indicate viral homologs are found. The brown boxes indicate the locations of long tracts of simple tandem repeats. Solid lines below the Nimav-1_LVa bar represent some larger Nimav-1_LVa segments present in the Kehai No. 1 assembly. The accession number and location of the segments can be found in Supplementary Table S1; (B) Sequence alignment of the terminal regions of the linear Nimav-1_LVa and the flanking sequences. The green-shaded regions belong to Nimav-1_LVa. The telomeric (TAACC/GGTTA)n microsatellite regions are shaded in grey.
Genes 11 00094 g001
Figure 2. Conserved blocks in the alignment of 217p, wsv308 (AAL33310.1), and other homologs. 217p is the protein encoded by gene g217 in Nimav-1_LVa, 217p_Mj denote the homolog of 217p encoded in Mj nimavairus. The other proteins and the hosting genome are AAL33310.1 (WSSV), GBG35584.1 (Si nimavirus), GAV93231.1 (Chionoecetes opilio bacilliform virus), GBG35376.1 (Ht nimavirus), and GBG35522.1 (Pm nimavirus). Numbers after the slash indicate the length of that protein. Numbers at either side of the blocks indicate the locations of the preceding/following amino acids in each protein.
Figure 2. Conserved blocks in the alignment of 217p, wsv308 (AAL33310.1), and other homologs. 217p is the protein encoded by gene g217 in Nimav-1_LVa, 217p_Mj denote the homolog of 217p encoded in Mj nimavairus. The other proteins and the hosting genome are AAL33310.1 (WSSV), GBG35584.1 (Si nimavirus), GAV93231.1 (Chionoecetes opilio bacilliform virus), GBG35376.1 (Ht nimavirus), and GBG35522.1 (Pm nimavirus). Numbers after the slash indicate the length of that protein. Numbers at either side of the blocks indicate the locations of the preceding/following amino acids in each protein.
Genes 11 00094 g002
Table 1. Seven representative nimaviruses of the seven major phylogenetic groups.
Table 1. Seven representative nimaviruses of the seven major phylogenetic groups.
NimavirusesGenBank Accessions Size (Kb) 1
White spot syndrome virus (WSSV)AF332093.3305.119
C. opilio bacilliform virus (CoBV)BDLS01000001 and BDLS01000002237.1
M. japonicus endogenous nimavirus (Mj)BFCD01000001 and AP010878~220
P. monodon endogenous nimavirus (Pm)BFCF01000001 to BFCF01000003191.8
H. takanoi endogenous nimavirus (Ht)BFCC01000001 to BFCC01000006218.1
M. ensis endogenous nimavirus (Me)BFCE01000001 to BFCE01000010232.4
S. intermedium endogenous nimavirus (Si)BFCG01000001 to BFCG01000014189
1 Except for the complete genomes of various WSSV strains, the genomes of the other nimaviruses are all incomplete so far. According to Kawato et al. [28], the M. japonicus endogenous nimavirus regions in the bacterial artificial chromosome (BAC) clone sequences (AP010878 and BFCD01000001) are added to be only ~220 Kb, excluding the terminal non-viral regions.
Table 2. Three types of nimavirus sequences were detected in two shrimp species.
Table 2. Three types of nimavirus sequences were detected in two shrimp species.
Nimavirus TypeLength (Identity 1)
P. monodonM. japonicus
Nimav-1_LVa>141 Kb (>99%)>33 Kb (>99%)
Mj-like>200 Kb (>91%)>49 Kb (>88%)
Pm-like>226 Kb (>88%)>199 Kb (>88%)
1 The identity in the parenthesis indicates the minimum sequence identity to the known nimavirus for most majority of the homologous sequences detected in each search.
Table 3. A total of 117 protein-coding genes predicted in Nimav-1_LVa endogenous nimavirus.
Table 3. A total of 117 protein-coding genes predicted in Nimav-1_LVa endogenous nimavirus.
Genes 1CDS startCDS endDirectionProtein (AA)Viral Homolog 2Comment 3
g00213881990d201g009PF1
g00370028099r217AKS10635.1PF2, 4 exons, 2 BIR domains (cd00022)
g00487929187d132BFCD01000001.1 (98,829–99,209)
g00610,03711,149d371g008PF1
g00811,21912,316d366g006PF1
g00912,52713,123r199g002PF1
g01013,46714,258r264g161PF1
g01114,74915,966d406g006PF1
g01216,31919,114r725AKS10635.1PF2, 4 exons, 3 BIR domains (cd00022), 1 RING-HC_BIRC2_3_7 (cd16713)
g01719,25621,711r710AKS10635.1PF2, 4 exons, 3 BIR domains (cd00022), 1 RING-HC_BIRC2_3_7 (cd16713)
g02124,39324,710r106
g02227,84931,305d782AP010878.1 (14,919–16,765)5 exons
g02631,55633,238r561AP010878.1 (4458-6026)
g02733,24333,857r205AP010878.1 (3787–4395)
g03034,48835,048r187AKS10635.1PF2, 1 BIR domain (cd00022)
g0313,547837,592d705GBG35399.1wsv220-like, capsid protein
g03337,83838,707d290
g03438,72939,364r212GBG35402.1wsv206-like, PF6, containing macro domain (cd02749), a high-affinity ADP-ribose binding module, as shown in GBG35398.
g03639,65645,952d2099BFCD01000001.1 (62,195–69,574)
g03846,08748,771d895BFCD01000001.1 (57,879–62,171)
g04048,89949,639r247
g04249,55753,573r1339GBG35397.1wsv026-like
g04554,48557,460d1138GBG35396.1wsv115-like, envelope protein
g04657,83358,502r123 3 exons, CHH-like, containing crust_neurohorm domain (pfam01147)
g04758,78260,584d448AKS10635.1PF2, 4 exons, 2 BIR domains (cd00022), 1 RING-HC_BIRC4_8 (cd16714)
g04960,85162,533d412AKS10635.1PF2, 5 exons, 2 BIR domains (cd00022), 1 RING-HC_BIRC4_8 (cd16714)
g05062,86264,124d421g051PF3
g05164,43565,805d457g050PF3
g05266,04269,944d1301BFCD01000001.1 (167,202–171,290)PF3
g05670,36771,194d276g269PF5
g05875,54576,801r419BFCD01000001.1 (79,831–81,120)
g06076,97277,379r136BFCD01000001.1 (79,176–79,601)
g06177,38278,608r409GBG35401.1wsv415-like, capsid protein
g06278,77979,048d90BFCD01000001.1 (77,744–77,460)
g06379,40582,122r906GBG35400.1wsv216-like, envelope protein
g06582,20482,896r231
g06683,30183,948d216
g06884,16284,830d223
g07186,83188,126r432GBG35404.1wsv161-like
g07288,34491,454r1037GBG35405.1wsv011-like, envelope protein
g07791,61294,842r1077GBG35406.1wsv313-like
g08195,57796,821d415GBG35407.1wsv282-like
g08397,302115,505d6068GBG35408.1wsv360-like, capsid protein
g098115,730119,659d1310GBG35428.1wsv037-like, capsid protein
g103119,938122,058r707GBG35427.1molecular chaperone DnaK (HSP70) protein domain (COG0443)
g106122,293123,042d250BFCD01000001.1 (191,561–192,277)
g107123,054123,677d208GBG35426.1wsv021-like, envelope protein
g108123,758127,078d1107GBG35425.1wsv139-like
g110127,163128,206d348GBG35424.1wsv137-like
g112128,419131,400d994GBG35423.1wsv192-like
g115131,798134,191d798GBG35356.1SCV_095-like, ATP-dependent DNA ligase I (dnl1) domain (TIGR00574) and Poly (ADP-ribose) polymerase and DNA-Ligase Zn-finger (pfam00645)
g118134,634135,728d365 DnaJ/Hsp40 protein, containing DnaJ-class molecular chaperone with C-terminal Zn finger domain (COG0484)
g123136,705137,067d121GBG35422.1wsv136-like
g125137,840138,184r115
g126139,067139,678d204BFCD01000001.1 (181,829–182,488)
g130140,199143,633r1145GBG35421.1wsv271-like, capsid protein
g131143,809145,152d448GBG35420.1wsv131-like
g132145,203145,433r77BFCD01000001.1 (176,011–176,340)ubiquitin-like (Ubl) domain (cd01803) found in ubiquitin
g133145,548146,687r380GBG35419.1wsv325-like, envelope protein
g134146,697147,203d169BFCD01000001.1 (172,895–173,647)
g135147,318148,061d248GBG35417.1wsv133-like
g136147,643148,566d308GBG35418.1wsv134-like
g137148,715148,927r71
g139149,321149,815d165GBG35402.1wsv206-like, PF6, containing macro domain (cd02749), a high-affinity ADP-ribose binding module, as shown in GBG35398.
g140150,049151,434d462BFCC01000003.1 (801–2426)wsv112-like, dUTPase, containing deoxyuridine 5’-triphosphate nucleotidohydrolase (dut) domain (TIGR00576)
g141151,587152,777d397g143PF1
g143152,974154,158d395g141PF1
g146154,360155,547d396g006PF1
g149156,541156,915d125
g150157,204158,148r315
g152158,385158,825d147
g153159,042160,274d411
g154160,465161,412d316
g155161,589163,290d428BFCD01000001.1 (51,019–52,483)4 exons, innexin domain (pfam00876)
g158163,455165,615r274 5 exons, Bax inhibitor (BI)-1 domain (cd10430).
g161165,152165,772d207g010PF1
g162166,492166,767d92
g163167,017167,787d257
g166168,473172,924d1484BBD20107.1wsv209-like, envelope protein
g170172,897173,529d211AP010878.1 (53,502–54,038)
g171173,590174,540d317BBD20108.1wsv267-like, anti-apoptotic protein
g172174,735175,523d263AP010878.1 (55,291–56,157)PF4
g173175,556176,485d310AP010878.1 (55,291–56,157)PF4
g175176,584177,510r309
g176177,844178,152d103BBD20109.1wsv293a-like, envelope protein
g177178,301183,058d1586GBG35554.1wsv289-like, capsid protein
g187183,225196,220r4332BBD20111.1wsv343-like
g206196,741197,286d182BFCG01000002.1 (22,541–22,065)SCV_028-like
g208197,905199,944r680BBD20112.1wsv327-like, envelope protein
g211200,119202,590d824BBD20113.1wsv332-like
g213203,182204,525d448BBD20114.1wsv306-like, tegument protein
g217204,533206,176d548AP010878.1 (81,486–83,258)wsv308-like, capsid protein
g220207,060207,515d152AP010878.1 (84,507–85,430)
g222207,948210,626d893BBD20115.1wsv285-like
g223210,916212,928d671AP010878.1 (89,194–91,152)
g225218,360220,426d689GBG35515.1wsv226-like
g227225,839227,851d671AP010878.1 (111,284–114,223)semaphorin 1A (Sema_1A) domain (cd11237)
g228228,056229,738d561AP010878.1 (109,235–110,863)
g231230,064232,973d970GBG35403.1wsv035-like, envelope protein
g234233,341234,681r447 In GenBank, part of 234p is computationally predicted as high mobility group protein DSP1-like (XP_027238145.1), 184 AA.
g236234,925237,936d1004BFCD01000001.1 (86,470–89,643)
g240241,130241,501d124
g241241,716242,537d274
g242242,655244,988r778GBG35414.1wsv303-like
g246245,095249,063r1323GBG35413.1wsv433-like
g251249,059249,457r133GBG35412.1wsv432-like
g252249,459251,285d609GBG35411.1wsv427-like
g253251,758254,091d778BFCD01000001.1 (134,943–137,015)
g254254,266255,603d446GBG35410.1wsv423-like, Protein kinase 1
g255255,856257,829d658GBG35409.1wsv440-like
g257258,137259,543r469g050PF3
g259259,757267,121r2455GBG35416.1wsv514-like, DNA polymerase
g262267,305272,629d1775GBG35415.1wsv447-like
g268272,942273,223d94
g269273,601274,653d351g271PF5
g271275,034277,334d767g269PF5
g276278,291279,160d290AP010878.1 (55,291–56,157)PF4
1 65 Mj-group-specific genes are indicated by bold font. 2 Viral homologs in this table refer to those present in Nimav-1_LVa, WSSV, Chionoecetes opilio bacilliform virus, and other endogenous nimaviruses (see Table 1 and methods section). Only the top homologous proteins or coding sequences are listed in this table. The parenthesized coordinates after the accession numbers indicate the homologous coding regions detected by TblastN. 3 Exon numbers here refer only to the coding exon. WSSV gene nomenclature indicated with “wsvNNN” is taken from the annotation in AF332093.3 (WSSV-CN strain). PF: paralog families; BIR: Baculovirus Inhibitor of apoptosis protein Repeat; RING-HC: Really Interesting New Gene finger domain of the C3HC4 type; BIRC: baculoviral inhibitor of apoptosis protein repeat containing protein.

Share and Cite

MDPI and ACS Style

Bao, W.; Tang, K.F.J.; Alcivar-Warren, A. The Complete Genome of an Endogenous Nimavirus (Nimav-1_LVa) From the Pacific Whiteleg Shrimp Penaeus (Litopenaeus) Vannamei. Genes 2020, 11, 94. https://0-doi-org.brum.beds.ac.uk/10.3390/genes11010094

AMA Style

Bao W, Tang KFJ, Alcivar-Warren A. The Complete Genome of an Endogenous Nimavirus (Nimav-1_LVa) From the Pacific Whiteleg Shrimp Penaeus (Litopenaeus) Vannamei. Genes. 2020; 11(1):94. https://0-doi-org.brum.beds.ac.uk/10.3390/genes11010094

Chicago/Turabian Style

Bao, Weidong, Kathy F. J. Tang, and Acacia Alcivar-Warren. 2020. "The Complete Genome of an Endogenous Nimavirus (Nimav-1_LVa) From the Pacific Whiteleg Shrimp Penaeus (Litopenaeus) Vannamei" Genes 11, no. 1: 94. https://0-doi-org.brum.beds.ac.uk/10.3390/genes11010094

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop