High Mutation Frequency and Significant Population Differentiation in Papaya Ringspot Virus-W Isolates

Khanal, Vivek; Ali, Akhtar

doi:10.3390/pathogens10101278

Open AccessArticle

High Mutation Frequency and Significant Population Differentiation in Papaya Ringspot Virus-W Isolates

by

Vivek Khanal

and

Akhtar Ali

^*

Department of Biological Science, The University of Tulsa, Tulsa, OK 74104, USA

^*

Author to whom correspondence should be addressed.

Pathogens 2021, 10(10), 1278; https://0-doi-org.brum.beds.ac.uk/10.3390/pathogens10101278

Submission received: 1 September 2021 / Revised: 29 September 2021 / Accepted: 30 September 2021 / Published: 4 October 2021

(This article belongs to the Special Issue Evolution of Plant Viruses)

Download

Browse Figures

Versions Notes

Abstract

:

A total of 101 papaya ringspot virus-W (PRSV-W) isolates were collected from five different cucurbit hosts in six counties of Oklahoma during the 2016–2018 growing seasons. The coat protein (CP) coding region of these isolates was amplified by reverse transcription-polymerase chain reaction, and 370 clones (3–5 clones/isolate) were sequenced. Phylogenetic analysis revealed three phylogroups while host, location, and collection time of isolates had minimal impact on grouping pattern. When CP gene sequences of these isolates were compared with sequences of published PRSV isolates (both P and W strains), they clustered into four phylogroups based on geographical location. Oklahoman PRSV-W isolates formed one of the four distinct major phylogroups. The permutation-based tests, including Ks, Ks *, Z *, Snn, and neutrality tests, indicated significant genetic differentiation and polymorphisms among PRSV-W populations in Oklahoma. The selection analysis confirmed that the CP gene is undergoing purifying selection. The mutation frequencies among all PRSV-W isolates were within the range of 1 × 10⁻³. The substitution mutations in 370 clones of PRSV-W isolates showed a high proportion of transition mutations, which gave rise to higher GC content. The N-terminal region of the CP gene mostly contained the variable sites with numerous mutational hotspots, while the core region was highly conserved.

Keywords:

selection pressure; mutation; coat protein; polymorphism; substitution

1. Introduction

Papaya ringspot virus (PRSV), a single-stranded, positive-sense RNA virus, belongs to the family Potyviridae and genus Potyvirus. The virus is primarily grouped into two serologically indistinguishable strains: papaya-infecting type (PRSV-P) and cucurbit-infecting type (PRSV-W) [1]. The host range of PRSV-W is limited to Chenopodiaceae and Cucurbitaceae, while PRSV-P can infect plants in the papaya family (Caricaceae) as well [1,2,3]. The site-directed mutagenesis study in recombinant viruses showed that lysine amino acid (aa) at position 27 in NIa-Pro determines the host specificity in PRSV-P, as a single aa change at that position from lysine to aspartic acid can change the host range of PRSV-P to non-papaya-infecting [4].

PRSV particles are non-enveloped, flexuous, and filamentous rods about 680–900 nm in length and 12–15 nm in diameter. They are transmitted by several aphid species in a non-persistent, non-circulative manner [5,6]. The virus can be mechanically inoculated, while no seed transmission has been reported. Similar to other potyviruses, PRSV has a linear single-stranded and positive-sense RNA (+ssRNA) genome that is approximately 10.3 kb. The PRSV genome comprises a 5ʹ untranslated region, a single open reading frame (ORF) that codes for a major polyprotein, and a 3’ untranslated region. The polyprotein is proteolytically processed by virus-encoded proteases into 10 mature proteins: P1 (63k), helper component protease (Hc-Pro, 52k), P3 (46k), 6K1 (6k), cylindrical inclusion (CI, 72k), 6K2 (6k), nuclear inclusion protein a-virus protein genome linked (NIa-Vpg, 27k), nuclear inclusion protein a-protease (NIa-Pro 21k), nuclear inclusion protein b (NIb, 59k), and coat protein (CP, 36K) [7]. A small protein called pretty interesting potyvirus ORF (PIPO) is synthesized by an additional ORF in the P3 region [8].

The CP gene of potyviruses, including PRSV, is located at the 3’ terminal end of the viral genome and encapsidates the RNA genome of the virus. The CP of potyviruses is a multifunctional protein and has a significant role in the viral life cycle. For instance, it is involved in aphid transmission in association with HC-Pro [9], cell-to-cell and systemic movement [10], virus assembly [7], and host adaptation [11]. Classification of potyviruses based on the CP gene was quite common until the mid-2000s as it is considered the most conserved protein among potyviruses. CP is the only structural protein in potyviruses, and its multiple subunits form a protective coat for the RNA genome. The cleavage motif for CP and Nib region in PRSV-W is VFHQ/S [12].

At least 94 viruses from 17 different families, including Potyviridae, have been reported to infect cucurbit crops [13], while 15 of those belong to the genus Potyvirus [6] PRSV-W is one of the major viruses infecting cucurbits that cause substantial yield losses of cucurbits worldwide [13,14,15,16,17,18,19,20]. PRSV-W induces a wide range of symptoms in different cucurbit crops, which includes mosaics, mottling, stunting, vein clearing, shoestrings on leaves, ringspots, and streaks on fruit stems and petioles, thereby reducing both quality and quantity of fruit production [13].

The highly error-prone, RNA-dependent RNA Polymerase (RdRp) of RNA viruses, including PRSV-W, lack proofreading ability, enabling them to generate a large pool of genetically distinct sequences (often referred to as a ‘mutant swarm’) in a short generation time compatible with the concept of viral quasispecies [21]. These attributes contribute to high levels of genetic diversity, the ability to adapt to changing environments, including new hosts, and to evade host resistance [22]. Numerous molecular studies have been conducted in recent decades on ecology, etiology, pathogenesis, molecular biology, diversity, evolution, and control strategies of PRSV-P, but very few about PRSV-W. Thus, further investigation of the evolution of the viral population is important to deduce reliable diagnostic tools and effective management strategies for combating PRSV-W.

Previously, an evolutionary study was conducted on 64 PRSV-W isolates from watermelon in Oklahoma during a single growing season with a limited area of sampling [12]. In this study, we performed a comprehensive evaluation of population differentiation and genetic diversity among >100 PRSV-W isolates from six counties of Oklahoma (Figure S1) during three growing seasons. We hypothesized that since the virus sequences were sampled within a short span of time, there would be strong purifying selection, and the population would consist of a mutant cloud or mutant swarm. Our other hypothesis was that the phylogenetic grouping of the isolates would be influenced by geographical location, hosts, and year of collection. We also evaluated mutation frequency and its pattern within individual clones of PRSV-W isolates collected from different regions of Oklahoma.

2. Results

2.1. PRSV-W Isolates and Confirmation by RT-PCR

A total of 101 PRSV-W isolates were collected from six counties (Blaine, Caddo, Cimarron, McCurtain, Muskogee, and Tulsa) of Oklahoma from five different hosts (cantaloupe, cucumber, pumpkin, squash, and watermelon) during the 2016–2018 growing seasons. The complete CP gene of these isolates was amplified by reverse transcription-polymerase chain reaction (RT-PCR), and the expected size DNA bands were gel-purified. PRSV-W positive samples were also checked by RT-PCR for mixed infection with specific primers of watermelon mosaic virus (WMV) and zucchini yellow mosaic virus (ZYMV) [13]. Fifty-five of these 101 PRSV-W had mixed infection with WMV, ZYMV, or both.

2.2. CP Gene Sequence Analysis

A total of 370 recombinant clones (3–5 clones from each isolate) were sequenced from 101 PRSV-W isolates in this study. The complete CP gene sequence of all PRSV-W clones was 864bp, which translated into 287 aa residues except one clone, which had 24 nucleotides (nt) deletion at the N-terminal region of the CP gene. The nt identity among consensus CP sequences of 101 PRSV-W isolates from this study ranged from 96 to 100%, and aa identity was 98–100%. The 165 Oklahoman isolates (101 from this study and 64 isolates from the previous study [12]) had 86–98% nt identity and 87–98% aa identity with other isolates from around the globe (Table 1).

2.3. Phylogenetic Relationship among PRSV-W Populations

The maximum likelihood (ML) phylogenetic tree constructed from the CP gene with 101 PRSV-W isolates in this study showed three distinct phylogroups circulating in cucurbit fields of Oklahoma (Figure 1). Phylogroup 1 was further divided into three subgroups. Subgroup 1a included 28 isolates that belong to four counties (Caddo, McCurtain, Muskogee, and Blaine). Subgroup 1b had 20 isolates from three counties (Muskogee, McCurtain, and Cimarron). Subgroup 1c had three isolates from Blaine County. Phylogroup 2 consisted of two subgroups; 2a consisted of 9 isolates collected from Tulsa County, and 2b of 15 isolates from three counties (Blaine, Cimarron, and Tulsa). Phylogroup 3 consisted of two subgroups; 3a consisted of 11 isolates from Blaine, and 3b contained 10 isolates from two counties (Blaine and McCurtain). The ML phylogenetic tree made with sequences from this study and 64 sequences from the previous study [12] also showed a similar pattern (Supplementary Figure S2). Although that study included PRSV-W isolates from two counties (Atoka and Jefferson) not included in this study, they also clustered together with isolates from other counties.

Selected PRSV-W isolates: 33 out of 101 from this study and 73 retrieved from GenBank were used to deduce the ML phylogenetic tree with the CP gene of WMV as an outgroup (Figure 2). The selection of sequences from Oklahoma for global phylogenetic analysis was based on their phylogenetic position within Oklahoma isolates. Four distinct clusters of PRSV-W isolates were observed, loosely based on geographical location. All Asian isolates were grouped into Cluster 1 (C1), while Cluster 2 (C2) contained both North and South American isolates, except isolates from Oklahoma. Cluster 3 (C3) included all the isolates from Oceania, while Cluster 4 (C4) contained all the isolates from Oklahoma. The two isolates from Europe clustered together with American (French isolate) and Oceania isolates (Polish isolate). In addition, another phylogenetic tree was constructed using 148 PRSV isolates that contained both PRSV strains (P and W), including 42 isolates of PRSV-P retrieved from GenBank (Supplementary Figure S3). The Oklahoma isolates are again grouped in a separate cluster (C4). The clustering pattern was similar to PRSV-W, with Asian isolates in C1, other American isolates in C2, and Oceanian isolates in C3.

2.4. Genetic Variation among PRSV-W Populations

The overall genetic diversity (d) in the CP gene of PRSV-W isolates from Oklahoma was 0.020; the average number of nt differences (K) was 16.09, and nt diversity (π) was 0.019 (Table 2). PRSV-W isolates from Caddo County were the least diverse (d = 0.001, K = 2.07, π = 0.002), while isolates from Blaine County were the most diverse (d = 0.023, K = 18.02, π = 0.022). The estimates of evolutionary distances in the CP gene between PRSV-W isolates from different counties of Oklahoma showed that populations from Caddo County and Muskogee County were least distant (d = 0.003), and populations between McCurtain County and Tulsa County were most distant (d = 0.036) (Supplementary Table S3). The estimates of evolutionary divergence in the CP gene among PRSV-W isolates within different hosts were fairly consistent with genetic distance and nucleotide diversity in the range of 0.15 < d < 0.24 and 0.015 < π < 0.02, respectively. Similarly, the divergence in the CP gene between PRSV-W populations from different hosts was also close to the mean overall distance (Supplementary Table S3). The genetic distances within the CP gene of PRSV-W populations collected in 2016, 2017, and 2018 were 0.012, 0.013, and 0.024, respectively. Similarly, the nt diversities in the CP gene among PRSV-W in 2016, 2017, and 2018 were 0.013, 0.014, and 0.015, respectively. The CP gene of PRSV-W populations collected between 2016 and 2017 had a genetic distance of 0.014; between 2016 and 2018 was 0.022, while between 2017 and 2018 was 0.021. The mean genetic distances (and nt diversity) within PRSV-W populations in phylogroups 1, 2, and 3 were 0.003 (π = 0.04), 0.015 (π = 0.015), and 0.019 (π = 0.016), respectively (Table 2). The divergence between phylogroup 1 and 2 was 0.027, phylogroup 1 and 3 was 0.024, and between phylogroup 2 and 3 was 0.036 (Supplementary Table S3). The haplotype diversity (Hd) for each of the aforementioned groups was high, with an overall Hd of 0.98 (Table 2).

For further analysis, we used 33 selected CP gene sequences from this study, 73 PRSV-W isolates, and 42 PRSV-P isolates from GenBank to calculate genetic diversity among PRSV global isolates (Table 3). The overall mean genetic diversity in the CP gene of all selected PRSV-W isolates (n = 106) was 0.071, and all PRSV isolates (n = 148) was 0.085 (Table 3). The overall average number of nt differences (K) between PRSV-W isolates was 51.71, and nt diversity (π) was 0.07 (Table 3). The average number of nt differences and nt diversity was higher (K = 61.89, π = 0.08) in the overall PRSV group (n = 148). Between different phylogroups of PRSV, Asian isolates were most diverse (d = 0.11, K = 74.95, π = 0.09) and Oceania isolates were least diverse (d = 0.02, K = 18.27, π = 0.021). The haplotype diversity for each of these groups was >0.95, with an overall Hd of 0.99 (Table 3 and Supplementary Table S4).

2.5. Population Differentiation within PRSV-W Populations

The total genetic variability in a population compared to the total population and gene flow estimated from F statistics (Fst) showed that there is infrequent gene flow of PRSV-W between different counties (Fst >0.33) except between Blaine and McCurtain counties (Fst=0.24) and Caddo and Muskogee (Fst =0.14), where gene flow was frequent (Table 4). Similarly, the Nm value for all pairs of counties was <1 except CD and MK (Nm = 1.60). The Fst values for all pairs of PRSV-W hosts were <0.33, with only three pairs of hosts (cucumber and pumpkin, cucumber and watermelon, and pumpkin and squash) having an Nm value <1. The three phylogroups of PRSV-W within this study had infrequent gene flow and distinct population differentiation. However, the population of PRSV-W within the three years of the collection had frequent gene flow (Fst < 0.33, Nm > 1) but were genetically distinct based on other statistical tests. The P values for all permutation-based tests Ks, Ks*, Z*, and Snn were <0.01 for almost all the population groups showing significant genetic differentiation (Supplementary Table S5). The values of all three neutrality tests (Fu and Li’s D and F and Tajima’s D) for all population groups were negative and occasionally significant (Supplementary Table S6). The negative values on neutrality tests indicate polymorphisms within PRSV-W populations.

Population differentiation analyses were performed for different PRSV populations based on their phylogenetic groups and geographical locations. The Fst value for PRSV-W populations between Asia and other parts was >0.33, indicating an absence of gene flow between these groups (Table 5). The significant Fst value was also supported by low Nm values. All other PRSV-W groups had an Fst value <0.33. However, when PRSV-P isolates were added to the study, the Fst value for all groups dropped below 0.33. All the permutation-based tests confirmed that there was distinct genetic differentiation between these groups (Supplementary Table S7). All the phylogroups from various parts of the world had non-significant negative values on three neutrality tests (Supplementary Table S8).

2.6. Mutation Frequency

The overall mutation frequency (f) among 370 sequences obtained from 101 PRSV-W isolates was 1.18 × 10⁻³. The mutation frequency within a single infection of PRSV-W isolates was 1.22 × 10^−3, and that of PRSV-W mixed with one or more viruses was 1.15 × 10⁻³. The average mutation frequencies for each county, host, and collection year in a single and mixed population were determined and compared (Table 6, Table 7 and Table 8). The average mutation frequency in different counties ranged from 0.79 × 10⁻³ (McCurtain County) to 1.37 × 10⁻³ (Tulsa County). The mutation frequencies of the virus in different counties were generally higher in single infections than in mixed infections, except in Blaine County (Table 6). Similarly, mutation frequencies in different hosts ranged from 0.98 × 10⁻³ (squash) to 1.5 × 10⁻³ (cucumber). The mutation frequencies in single and mixed infections were fairly close to the overall mutation frequency within the same host (Table 7). The average mutation frequencies of PRSV-W populations in 2016 was 1.23 × 10⁻³, which increased slightly in 2017 (1.3 × 10⁻³) and was down to 1.07 × 10⁻³ in 2018. As in the case of different hosts, mutation frequencies in single and mixed infections were similar to overall mutation frequency in all three years. The average mutation frequency (1.60 × 10⁻³) in the C-terminal region of the CP gene was highest among the three regions of the gene, followed by the N-terminal (1.10 × 10⁻³) and core region (1.01 × 10⁻³).

2.7. Selection Pressure Analysis

In almost all aforementioned populations, the number of non-synonymous mutations exceeded synonymous mutations. However, dN/dS of value did not equal or exceed 1 in any of the populations, indicating negative (purifying) selection. The selection analysis using four independent tests (FUBAR, FEL, MEME, and SLAC) available in the Datamonkey server showed a number of codons undergoing negative (purifying) selection (Table 9). While FEL, MEME, and SLAC did not show evidence of positive selection within the CP gene of Oklahoman isolates, FUBAR analysis showed two codons at positions 76 and 172 undergoing positive selection. All four tests showed the presence of a few positively selected codons in PRSV global populations. At least two tests showed evidence of positive selection in codon positions 14, 16, 43, 48, 82, 90, and 256.

2.8. Mutational Pattern

There were 376 substitution mutations among 370 clones of 101 PRSV-W isolates (Figure 3). Transition (293, 77.93%) were three-fold higher than transversion (82, 22.07%). A substitution from Adenine (A) to Guanine (G) was most common (110, 29.26%), followed by its reverse substitution G–A (80, 21.28%). Another transition substitution, Thymine (T) to Cytosine (C) (60, 15.96%) and its reverse substitution C–T (39 (10.37%), were also frequent. All transversion had a frequency <8%, with substitution from A to T being most of the lot (27, 7.18%), while substitution from C to G was null. There was one clone of PRSV-W from Caddo County collected in 2016 (CD-2) with a deletion of 24 nt from positions 57–80. Among those 376 substitutions, 218 were non-synonymous and had 77 combinations of aa substitutions. Twenty-six of those aa substitutions were only observed once, and twenty-four were only observed two times. The most common aa substitution among the CP gene of the PRSV-W population involved Lysine to Arginine, with a frequency of 14 out of 218 non-synonymous substitutions (6.34%) (Table 10). Similarly, a change of aa from Asparagine to Aspartic acid was observed 12 times. The aa changes from Arginine to Lysine (9 times), Alanine to Valine, Glutamic acid to Glycine, and Leucine to Proline (8 times) were also frequent. There were also two mutations leading to the stop codon. The top 5 aa involved in non-synonymous substitutions were Arginine, Lysine, Asparagine, Aspartic acid, and Glycine (Figure 4). There were no mutations observed involving tryptophan. A total of 231 sites (nt positions) had at least one substitution. The number of non-synonymous sites was 145, and that of synonymous sites was 86. Substitution on 12 of these sites occurred >3 times, 20 sites occurred 3 times, 53 sites occurred 2 times, and 146 sites occurred only once. nt positions 44 and 495 of the CP genome had the highest number of nt changes (9), followed by positions 480 and 849, with 6 nt changes (Figure 5a–c). Among these 4 nt changes, the only substitution on position 44 (aa position 15) was non-silent. The other frequent non-synonymous sites were aa positions 266 (5 changes), 61, and 278 (4 changes).

The entire CP gene was subdivided into three regions: N-terminal, core, and C- terminal. The N-terminal region included 1–195 nt (1–65aa), the core region included 196–654 nt (66–218aa) and the C-terminal included 655–864 nt (219–287aa). Out of 287 aa in the CP gene, 229 aa sites were conserved among all PRSV-W isolates obtained in this study from Oklahoma. The core region was highly conserved as nearly 78% of the sites were completely conserved (without any mutations), in addition to 8.5% of the sites with silent mutations (Table 11). Conversely, the N region had only 62.6% of the sites, which were conserved with a higher number of mutational hotspots in the region. The mean genetic diversity was also highest (0.019) in the N-region. Most of the frequent non-silent mutations were observed in either the N-terminal or C-terminal of the CP gene. The most conserved site within CP was at the core region, with 29 consecutively conserved aa sequences from positions 121 to 149 among the consensus sequences from this study.

3. Discussion

This study confirms our first hypothesis that there is a strong purifying selection within the PRSV-W populations. The selection is likely acting on removing deleterious mutations caused by error-prone replication. The high mutation frequency within all PRSV-W populations from Oklahoma confirms that the population exists as mutant clouds compatible with the quasispecies concept. The PRSV-W populations within this study were collected in a relatively quick time frame, which makes high mutant clouds the likely outcome [23]. Additionally, these mutation frequencies were remarkably consistent in all the populations (based on geography, hosts, collection years, and single/mixed infections) within the range of 10⁻³. Mutations are the major driving forces for monopartite RNA virus variation in addition to recombination. The error rate of RNA replication ranges from 10⁻³–10⁻⁵ base per copying cycle giving rise to high diversity within populations due to large mutant clouds. The high mutation rates reflect an evolutionary strategy as these mutant clouds usually work in favor of viruses for adaptation during environmental stress [24,25]. The majority of the mutation events in this study were substitutions, and rarely any insertion/deletions (indels) were found. The indel mutations are usually rare but are lethal in most cases [26]. Due to the abundance of deleterious mutations, the level of dN is generally higher in closely related sequences compared to distantly related sequences [27], which explains slightly higher purifying selection in Oklahoma isolates compared to global isolates.

The replication rates of most RNA viruses are swift, so they are able to reach exceptionally large population sizes within a brief period of time [26]. However, this large population size is not, in fact, an effective population size, as a substantial part of this population consists of mutants that will not pass to the next generation [25]. The genetic bottleneck reduces the population size below a threshold level to facilitate the transmission of fittest variants, thereby limiting the size of the effective population [28]. The genetic bottleneck usually is the product of the biology of the vector and its feeding habit and can also occur at different moments of the viral life cycle, such as virus movement between plant cells during systemic infection and horizontal transmission [29,30,31,32]. In addition, the purifying selection helps viruses maintain genetic stability by eliminating less-fit mutants with deleterious effects [33]. The genetic variations due to mutation are also structured by gene flow [34]. The gene flow among different hosts, geographical regions, and different parts of the same plant helps in shaping the global genetic diversity [35]. The low level of long-distance movement or gene flow might be the reason behind the non-uniform and variable viral populations in this study. However, this low level of gene flow was enough to accommodate variants from different phylogroups occurring in the same geographical area.

Utmost care was taken to reduce the mutations during RT-PCR steps by employing a number of strategies: high-fidelity reverse transcriptase and Taq polymerase were used, the number of PCR cycles was limited to 25, and any mutations found in only one direction of Sanger sequencing were not considered. Despite these precautions, there is a chance that some of these mutations might be due to experimental error. Similar to the study conducted by Simmons et al. [36], we calculated the highest possible number of erroneous mutations due to RT-PCR. The total number of possible mutations due to RT was ∼9 (2.9 × 10⁻⁵ mutations × 864 sites × 370 clones), and PCR was ∼7 (2.28 × 10⁻⁵ mutations × 864 sites × 370 clones), which adds to a total of ∼16 mutations. Even if we deduct these (possible) artifact mutations (16) from the total number of mutations (376) observed in the study, the mutation frequency remains in a similar range. However, the actual erroneous mutations due to experimental error might be significantly less than these calculated values due to the aforementioned experimental considerations.

The PRSV-W isolates collected from the same county, host, and growing season were grouped in different phylogroups, while isolates collected from different counties were grouped in the same phylogroup. This rejects our second hypothesis that these factors play a role in phylogenetic clustering. For instance, PRSV-W isolates collected from Blaine County in a single growing season (2017) from the same host (pumpkin) grouped in three different phylogroups (Figure 1). Similarly, PRSV-W isolates from Blaine, Cimarron, and McCurtain counties collected in 2018 fell in two different phylogroups. This diversity is also well supported by the higher within-group mean evolutionary distance of PRSV-W isolates from these counties compared to others (Table 2). Some of the isolates from geographically distant locations (Muskogee and Caddo counties) even had identical sequences. In addition, isolates from two far corners of Oklahoma with more than 900 kilometers of distance were grouped together closely in the same phylogroup (phylogroup 1), irrespective of their collection year and host. The close evolutionary distance between isolates from various locations might have caused the close evolutionary relationship. The lack of geographical connectivity in phylogeny among these isolates can be attributed to different possibilities. First, all these isolates from Oklahoma might have been derived from the same most recent common ancestors (MRCA). Second, the virus or virus harboring aphids likely travels with harvested plants and fruits to various parts of the state, thereby facilitating spread in new locations. In both cases, the virus population can use the wild host as their reservoir during times other than the growing seasons of their primary hosts [13]. In addition, none of the phylogroups within Oklahoma had distinct fixed mutations in terms of nt and aa, indicating the recent common ancestry of all these populations. The anomaly was the isolates from Tulsa, which had three distinct aa changes compared to other populations at positions 44 (Alanine–Threonine), 76 (Valine–Isoleucine), and 120 (Serine–Asparagine).

Similarly, other parameters considered in the study, viz. host and collection years, also did not have a significant effect on phylogeny. For instance, the genetic differentiation between different hosts and their diversity was not significant in all populations of PRSV, and none of the frequent mutations were observed in specific hosts. The PRSV-W virus isolates collected at different points of time (2008–2018) from the same location did not cluster according to the collection years. However, if the isolates collected in the same year fell in the same phylogroup, they tended to group together, indicating a loose association between collection time and their evolutionary fate. This is further bolstered by the fact that mutation frequencies of virus isolates collected from different hosts and in different collection years remained highly similar (Table 8 and Table 9).

The clustering pattern of global isolates showed distinct geographical clustering, with Asian, American, and Oceania isolates falling in different phylogroups. The two European isolates from France and Poland were anomalous as they were grouped with two distinct groups. More isolates from Europe are needed to evaluate if all of them cluster together with either American and Oceanian isolates or form separate clusters among themselves. The grouping pattern in the phylogeny of PRSV-W in this study is similar to recent studies [37,38], which showed that isolates from different parts of the world grouped together with one phylogroup and a few Asian isolates in different phylogroup. The PRSV isolates from other parts of the US were close to Mexican and other American isolates, as observed previously [39,40]. The distinct geographical clustering of the PRSV population based on continents shows PRSV-W populations do not have recent travel history across the continents (North and South America are referred to here as ‘Americas’). While the aforementioned (refer to the previous paragraph) factors explain geographical connectivity among virus populations in nearby locations, longer distance movement from these modes of transmission is unlikely. Similar to the clustering pattern in Oklahoma, the global isolates did not have distinct phylogenetic differentiation among hosts and collection years.

More than 99% of the mutations observed were substitution mutations. These substitution mutations were biased towards transitions with a high proportion (>75%) of purine to purine or pyrimidine to pyrimidine nt change (Figure 4). The transition mutation biases are common in viral systems and were noted in several previous studies [41,42,43,44]. All the combinations of nt substitutions involving G and C had the mutations favoring change to these nt, thereby favoring gain of net GC content. This net gain of GC content was also observed in a study conducted by Nigam et al. [44]. In addition, the resulting aa changes from these GC-rich mutations mostly involved aa Arginine, Lysine, Asparagine, Aspartic acid, Glycine, and Alanine with either loss or gain of the net charge. Interestingly, all these aa are also disorder-promoting [45]. Conversely, the order-promoting aa, such as Tryptophan and Cysteine, were rarely involved in substitution mutations.

Mutation frequency within the N-terminal region of CP was highest, followed by the C-terminal region and core region. Although mutations were frequent in core regions, they were disproportionately silent and included less positively selected sites in comparison to N- and C-terminal regions. This further consolidates the evidence of a highly conserved core region. All but seven isolates from McCurtain County, which were collected in 2018, had a DAG motif at amino acid positions from 7 to 9 in this study. These seven isolates have Threonine instead of Alanine. The Alanine to Threonine mutation was also observed in three other global isolates of PRSV; one from the USA and two from Taiwan. In addition, few isolates had NAG, DSG, and DSA instead of DAG motif. The DAG motif, highly conserved among Potyvirus CP genes, has a significant role in virus transmission by aphids [46] and is exposed on the viral surface [47]. However, the mutation in this region has been reported in many studies with efficient aphid transmission [48,49,50,51,52,53]. The other two motifs, PTK and KITC, present in another Potyvirus gene, Hc-Pro, and their interaction with the CP gene also have a vital role in viral transmission by aphids [54,55], and these motifs could facilitate aphid transmission in the absence of the DAG motif [49]. A number of conserved motifs described in Potyvirus CP previously were present in PRSV-W populations in this study with minor or no mutations. In addition, there were more highly conserved motifs in the core region of CP, which are specific to PRSV (Table 12). These conserved motifs were present in PRSV populations regardless of the biotype and might carry some evolutionary role. Further study is desired to decode the evolutionary messages conveyed by these motifs. The presence/absence of these motifs nevertheless could be useful in diagnostic tools such as primer design.

Recombination analysis of all 101 PRSV-W isolates sequenced in this study, as well as all 165 Oklahoman PRSV-W isolates (101 isolates from this study and 64 isolates from the previous study), was conducted using RDP software. Only two recombination events were detected by two algorithms in 101 PRSV-W isolates and were not significant (data not shown). These results indicate that recombination events are not frequent in the CP-gene-coding region.

To our knowledge, this is the first study on mutational analysis within the quasispecies population of PRSV-W isolates. The present study provides a broad analysis of a wide range of PRSV-W populations isolated from diverse geographical locations of the state, host, and collection time based on the various aspects of evolution such as mutation, genetic differentiation, and phylogeny. The insights provided by this study will enhance existing knowledge of PRSV-W evolution and epidemiology and will be helpful in developing viable management strategies. Specifically, the high diversity of PRSV populations in different geographical locations and the possibility of multiple viral introductions in the same geographical location demand careful consideration towards accommodating different genetic aspects of the virus in multiple locations while developing sustainable control strategies.

4. Materials and Methods

4.1. Sample Collection and Detection of PRSV-W

Surveys were conducted during the three growing seasons of 2016–2018 in multiple counties of Oklahoma from different cucurbit hosts, and dot immunobinding assay (DIBA) was performed against 10 different viruses [13]. Confirmation of PRSV-W was completed by RT-PCR using the primers specific for the CP gene of PRSV-W (PRSVCPF; 5ʹCTGATGATTATCAACTTGTT3ʹ, PRSVCPR; 5ʹTAAGGTGAAACAGGGTGGAG3ʹ) as described previously with minor modifications [13]. Total RNA was extracted by the Tri-reagent (Molecular Research Centre Inc, USA) method using 100 mg of infected plant tissue [17,63,64]. PRSV-W positive isolates were also tested with WMV- and ZYMV-specific primers as described previously [13]. High-fidelity DNA polymerase enzyme (Pfu, Stratagene) was used along with Taq polymerase, and the number of PCR cycles was limited to 25 to reduce potential mutations generated during RT-PCR.

4.2. Cloning and Sequencing

The purified PCR products from each isolate were ligated in the pGEM-T Easy Vector (Promega Corp, Madison, WI, USA). The ligated products were transformed into Escherichia coli DH5α competent cells (New England Biolabs, Ipswich, MA, USA) and were subjected to blue–white screening using Luria-Bertani agar (LBA), carbenicillin, isopropyl-thiogalactopyranoside (IPTG), and X-gal. Three to five clones from each isolate were used for Sanger sequencing in both directions using applied biosystems 3130 at the Department of Biological Science, the University of Tulsa, Oklahoma [65]. The details of virus isolates, their host, geographical location, and collection year are depicted in Supplementary Table S1.

4.3. Sequence Analysis

The purified PCR products were sequenced in both forward and reverse directions using Sanger sequencing method. Sequences were retrieved, analyzed in Finch TV, and compared with GenBank sequences using the basic local alignment search tool (BLAST). Sequence alignment was completed for the clones using the Clustal X program and MEGAlign™ incorporated within the DNASTAR suite of programs (Madison, WI, USA). For CP sequences, consensus nt sequences for each isolate were obtained using the Editseq™. The consensus sequences of each virus isolate were deposited in the NCBI database, and their GenBank accession numbers are listed in Supplementary Table S1.

4.4. Phylogenetic Analysis

Consensus sequences of each isolate for CP gene of PRSV-W were used for phylogenetic analysis. The MEGA7 [66] software was used to construct four sets of ML phylogenetic trees: first, comparing PRSV-W isolates within this study (n = 101); second, comparing PRSV-W isolates from Oklahoma (101 isolates from this study and 64 selected isolates from GenBank); third, PRSV-W isolates from around the world (33 selected isolates from this study and 73 isolates from GenBank); and fourth, comparing all PRSV isolates irrespective of W or P strains (33 selected isolates from this study and 115 isolates from GenBank). The CP gene sequences of PRSV-W and PRSV-P retrieved from GenBank for phylogenetic analysis are listed in Supplementary Table S2. The best fit model selection test was completed in MEGA7, and the model with the lowest BIC value was selected for each phylogenetic tree reconstruction. For evaluation of statistical confidence in tree nodes, 1000 bootstrapping was completed. Each of these trees was visualized in Figtree version 1.4.3.

4.5. Genetic Diversity and Population Genetics

4.5.1. Genetic Diversity

Genetic diversity of all CP sequences of PRSV-W isolates was determined based on host, geographical location, year of collection, and their respective phylogroups within Oklahoma using the Kimura 2 parameter in MEGA7. For each of the aforementioned categories, the number of haplotypes, haplotype diversity, number of segregating sites, average number of nt differences, and average nt diversity was calculated using DNASP6. These analyses were also completed separately for PRSV-W isolates and PRSV isolates (including both strains) from around the world.

4.5.2. Gene Flow and Population Differentiation

The extents of genetic flow and differentiation were estimated using the fixation index (Fst) and the number of migrants successfully incoming per generation (Nm) values. The Fst value ranges from 0, indicating no genetic differentiation, to 1, indicating clear differentiation. The absolute Fst value of >0.33 indicates infrequent gene flow between the populations. Nm < 1 indicates reduced gene flow and increased genetic drift, which results in local population differentiation [67].

Similarly, the population differentiation was analyzed using permutation-based statistical tests Ks, Ks *, Z *, and Snn [67]. These tests are considered the most powerful statistical tests for analyzing sequence-based genetic differentiation among the highly mutating population in small sample sizes [68]. The Ks * was calculated as the average number of differences between sequences regardless of geographical origin. Under the null hypothesis, Kst * is expected to be near zero, meaning there is no genetic differentiation. The Z * statistic is a logarithmic variant of the rank statistic (Z), and smaller values indicate less genetic differentiation, and higher values with significant P values indicate higher genetic differentiation. The nearest neighbor statistic (Snn) measures the frequency of nearest neighbor sequences in the same locality. The value of Snn ranges from ½ in the case of panmixia to 1 when populations are distinctly differentiated.

4.5.3. Neutrality Tests

The values of segregating sites (S), the average number of nt differences (K), and a total number of mutations were used for testing the neutrality hypothesis using Tajima’s, Fu, and Li’s DandF among different population groups. Tajima’s D test is based on the differences between the two estimators, Tajima’s estimator (based on K) and Watterson’s estimator (based on S) [68]. The positive value on Tajima’s D statistic means an abundance of polymorphic alleles, while negative values indicate the presence of rare alleles. Fu and Li’s D test is based on the difference between singleton mutation sites and total mutation sites, and Fu and Li’s F test is based on a difference in singleton mutation sites and K [69]. The negative values for these tests mean a low-frequency polymorphism [70].

4.5.4. Selection Analysis

Hyphy and Datamonkey packages were used for the isolate-selection analysis of different PRSV and PRSV-W populations. The dN/dS value was determined by the HyPhy packages included in MEGA7. The dN/dS value <1 shows evidence of the purifying selection, dN/dS=1 indicates neutral selection, and dN/dS >1 indicates positive selection. Similarly, FUBAR, FEL, MEME, and SLAC programs incorporated in Datamonkey (https://www.datamonkey.org, accessed on 5 March 2021 were used to deduce the numbers of positively and negatively selected codons.

4.5.5. Mutation Frequency and Pattern within the CP Gene

The total mutation count in the CP gene was completed for each isolate. The consensus nt sequence was inferred from 3–5 clones/isolate, and all mutations (substitution or indels) observed in those clones were counted. The total number of mutations within the different populations was determined, and mutation frequency was calculated using the following formula:

Mutation frequency = total number of mutations in n isolates with m clones/total number of nt in n × m

Whenever multiple nt mutations occurred consecutively, each individually mutated base was counted [43,71]. Any mutations that were only observed in either the forward or reverse direction were not considered as true mutations to reduce the possibility of artifact mutations. Mutation frequencies were determined for each isolate, location (county of collection), host, year of collection, and type of infection (single or mixed). For each of these categories, synonymous and non-synonymous mutations were also determined.

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/article/10.3390/pathogens10101278/s1, Figure S1: Geographical locations of different counties of Oklahoma; Figure S2: Maximum likelihood (ML) phylogenetic tree constructed in MEGA 7 using general time reversible (GTR) model for coat protein gene of 165 PRSV-W isolates from Oklahoma (101 from this study and 64 from GenBank; Figure S3: Maximum likelihood (ML) phylogenetic tree constructed in MEGA 7 using general time reversible (GTR) model for coat protein gene of 148 PRSV isolates (33 from this study and 115 from GenBank). Table S1: List of papaya ringspot virus-W isolates collected and sequenced the coat protein gene in this study; Table S2: Nucleotide sequences of the coat protein gene of papaya ringspot virus isolates (W and P) retrieved from GenBank. Table S3: Estimates of evolutionary divergence among the coat protein gene sequences of papaya ringspot virus-W isolates collected from different counties of Oklahoma, hosts, collection years and phylogroups; Table S4: Estimates of evolutionary divergence among the coat protein gene sequences of papaya ringspot virus isolates (both W and P strains) different phylogroups; Table S5: Genetic differentiation estimates in the coat protein gene sequences of papaya ringspot virus-W isolates from different counties, hosts, phylogroups and collection years; Table S6: Neutrality tests of coat protein gene sequences among the PRSV-W isolates from different counties, hosts, phylogroups and collection years; Table S7: Gene flow and genetic differentiation estimates based on the coat protein gene sequences of papaya ringspot virus –W isolates between different phylogroups from around the world; Table S8: Neutrality tests of coat protein gene sequences of papaya ringspot virus isolates among different population groups.

Author Contributions

V.K. and A.A. collected samples. V.K. performed the laboratory experiments, and wrote the original draft. A.A. conceived the idea, acquired the funding, supervised the project, and reviewed and edited the draft. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the US Department of Agriculture’s (USDA) Agricultural Marketing Service through grant 15SCBGPOK0019. The contents are solely the responsibility of the authors and do not necessarily represent the official views of the USDA.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All 101 coat protein gene sequences of PRSV-W isolates presented in this study (Supplementary Table S1) were submitted to NCBI database. The accession numbers from (MZ099456-MZ099556) can be found online https://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/genbank/ (accessed on 1 September 2021).

Acknowledgments

The authors also acknowledge extra funding support from the Office of Research and Sponsored Programs, The University of Tulsa, Oklahoma.

Conflicts of Interest

No conflict of interest.

References

Tripathi, S.; Suzuki, J.Y.; Ferreira, S.A.; Gonsalves, D. Papaya ringspot virus-P: Characteristics, pathogenicity, sequence variability and control. Mol. Plant Pathol. 2008, 9, 269–280. [Google Scholar] [CrossRef]
Gonsalves, D.; Tripathi, S.; Carr, J.B.; Suzuki, J.Y. Papaya ringspot virus. Plant Health Instr. 2010, 10, 1094. [Google Scholar] [CrossRef]
Akhter, M.S.; Basavaraj, Y.B.; Akanda, A.M.; Mandal, B.; Jain, R.K. Genetic Diversity Based on Coat Protein of Papaya ringspot virus (Pathotype P) Isolates from Bangladesh. Indian J. Virol. 2013, 24, 70–73. [Google Scholar] [CrossRef] [Green Version]
Chen, K.-C.; Chiang, C.-H.; Raja, J.A.J.; Liu, F.-L.; Tai, C.-H.; Yeh, S.-D. A Single Amino Acid of NIaPro of Papaya ringspot virus Determines Host Specificity for Infection of Papaya. Mol. Plant-Microbe Interact. 2008, 21, 1046–1057. [Google Scholar] [CrossRef] [Green Version]
Wylie, S.J.; Adams, M.; Chalam, C.; Kreuze, J.; Lopez-Moya, J.J.; Ohshima, K.; Praveen, S.; Rabenstein, F.; Stenger, D.; Wang, A.; et al. ICTV Virus Taxonomy Profile: Potyviridae. J. Gen. Virol. 2017, 98, 352–354. [Google Scholar] [CrossRef] [PubMed]
Ali, A. Epidemiology and evolution of poytviruses infecting cucurbits. In Applied Plant Virology, 1st ed.; Awasthi, L.P., Ed.; Academic Press: Cambridge, MA, USA, 2020; pp. 405–417. [Google Scholar]
Revers, F.; García, J.A. Molecular Biology of Potyviruses. Adv. Virus Res. 2015, 92, 101–199. [Google Scholar] [CrossRef]
Chung, B.Y.-W.; Miller, W.A.; Atkins, J.; Firth, A.E. An overlapping essential gene in the Potyviridae. Proc. Natl. Acad. Sci. USA 2008, 105, 5897–5902. [Google Scholar] [CrossRef] [Green Version]
Brault, V.; Uzest, M.; Monsion, B.; Jacquot, E.; Blanc, S. Aphids as transport devices for plant viruses. Comptes Rendus Biol. 2010, 333, 524–538. [Google Scholar] [CrossRef] [PubMed]
Andersen, K.; Johansen, I. A Single Conserved Amino Acid in the Coat Protein Gene of Pea Seed-Borne Mosaic Potyvirus Modulates the Ability of the Virus to Move Systemically in Chenopodium quinoa. Virology 1998, 241, 304–311. [Google Scholar] [CrossRef] [Green Version]
Carbonell, A.; Maliogka, V.; Pérez, J.; Salvador, B.; León, D.S.; Garcia, J.A.; Simón-Mateo, C. Diverse Amino Acid Changes at Specific Positions in the N-Terminal Region of the Coat Protein Allow Plum pox virus to Adapt to New Hosts. Mol. Plant-Microbe Interact. 2013, 26, 1211–1224. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Abdalla, O.A.; Ali, A. Genetic diversity in the 3′-terminal region of papaya ringspot virus (PRSV-W) isolates from watermelon in Oklahoma. Arch. Virol. 2011, 157, 405–412. [Google Scholar] [CrossRef]
Khanal, V.; Wells, H.; Ali, A. High Prevalence of Three Potyviruses Infecting Cucurbits in Oklahoma and Phylogenetic Analysis of Cucurbit Aphid-Borne Yellows Virus Isolated from Pumpkins. Pathogens 2021, 10, 53. [Google Scholar] [CrossRef]
Yuki, V.A.; Rezende, J.A.M.; Kitajima, E.W.; Barroso, P.A.V.; Kuniyuki, H.; Groppo, G.A.; Pavan, M.A. Occurrence, distribution, and relative incidence of five viruses infecting cucurbits in the state of Sao Paulo, Brazil. Plant Dis. 2000, 84, 516–520. [Google Scholar] [CrossRef]
Walters, S.A.; Kindhart, J.D.; Hobbs, H.A.; Eastburn, D.M. Viruses associated with cucurbit production in southern Illinois. HortScience 2003, 38, 65–66. [Google Scholar] [CrossRef]
Papayiannis, L.C.; Ioannou, N.; Boubourakas, I.N.; Dovas, C.I.; Katis, N.I.; Falk, B.W. Incidence of Viruses Infecting Cucurbits in Cyprus. J. Phytopathol. 2005, 153, 530–535. [Google Scholar] [CrossRef]
Ali, A.; Mohammad, O.; Khattab, A. Distribution of Viruses Infecting Cucurbit Crops and Isolation of Potential New Virus-Like Sequences from Weeds in Oklahoma. Plant Dis. 2012, 96, 243–248. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Juarez, M.; Legua, P.; Mengual, C.; Kassem, M.; Sempere, R.; Gómez, P.; Truniger, V.; Aranda, M. Relative incidence, spatial distribution and genetic diversity of cucurbit viruses in eastern Spain. Ann. Appl. Biol. 2013, 162, 362–370. [Google Scholar] [CrossRef]
Herrera-Vásquez, J.A.; Córdoba-Sellés, M.C.; Cebrián, M.C.; Font-San-Ambrosio, M.I.; Alfaro-Fernández, A.; Jordá, C. Viruses of cucurbits in Panama. J. Plant Pathol. 2013, 95, 435–440. [Google Scholar]
Nagendran, K.; Mohankumar, S.; Aravintharaj, R.; Balaji, C.; Manoranjitham, S.; Singh, A.; Rai, A.; Singh, B.; Karthikeyan, G. The occurrence and distribution of major viruses infecting cucurbits in Tamil Nadu state, India. Crop. Prot. 2017, 99, 10–16. [Google Scholar] [CrossRef]
Drake, J.W.; Holland, J.J. Mutation rates among RNA viruses. Proc. Natl. Acad. Sci. USA 1999, 96, 13910–13913. [Google Scholar] [CrossRef] [Green Version]
Holmes, E.C. The Evolution and Emergence of RNA Viruses; Oxford University Press: New York, NY, USA, 2009. [Google Scholar]
Duchêne, S.; Holmes, E.; Ho, S.Y.W. Analyses of evolutionary dynamics in viruses are hindered by a time-dependent bias in rate estimates. Proc. R. Soc. B Boil. Sci. 2014, 281, 20140732. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Domingo, E.; Holland, J.J. RNA Virus Mutations and Fitness for Survival. Ann. Rev. Microbiol. 1997, 51, 151–178. [Google Scholar] [CrossRef] [PubMed]
García-Arenal, F.; Fraile, A.; Malpica, J.M. Variability and genetic structure of plant virus populations. Ann. Rev. Phytopathol. 2001, 39, 157–186. [Google Scholar] [CrossRef] [PubMed]
Elena, S.F.; Agudelo-Romero, P.; Carrasco, P.; Codoner, F.M.; Martin, S.; Torres-Barceló, C.; Sanjuán, R. Experimental evolution of plant RNA viruses. Heredity 2008, 100, 478–483. [Google Scholar] [CrossRef] [Green Version]
Hughes, A.L.; Hughes, M.A.K. More effective purifying selection on RNA viruses than in DNA viruses. Gene 2007, 404, 117–125. [Google Scholar] [CrossRef] [Green Version]
Ali, A.; Roossinck, M.J. Genetic bottlenecks. In Plant Virus Evolution; Springer: Berlin/Heidelberg, Germany, 2008; pp. 123–131. [Google Scholar]
French, R.; Stenger, D.C. Evolution of Wheat streak mosaic virus: Dynamics of population growth within plants may explain limited variation. Ann. Rev. Phytopathol. 2003, 41, 199–214. [Google Scholar] [CrossRef]
Ali, A.; Li, H.-Y.; Schneider, W.L.; Sherman, D.J.; Gray, S.; Smith, D.; Roossinck, M.J. Analysis of Genetic Bottlenecks during Horizontal Transmission of Cucumber Mosaic Virus. J. Virol. 2006, 80, 8345–8350. [Google Scholar] [CrossRef] [Green Version]
Ali, A.; Roossinck, M.J. Genetic bottlenecks during systemic movement of Cucumber mosaic virus vary in different host plants. Virology 2010, 404, 279–283. [Google Scholar] [CrossRef] [Green Version]
Kaye, A.C.; Moyer, J.W.; Parks, E.J.; Carbone, I.; Cubeta, M.A. Population Genetic Analysis of Tomato spotted wilt virus on Peanut in North Carolina and Virginia. Phytopathology 2011, 101, 147–153. [Google Scholar] [CrossRef] [Green Version]
Rubio, L.; Galipienso, L.; Ferriol, I. Detection of Plant Viruses and Disease Management: Relevance of Genetic Diversity and Evolution. Front. Plant Sci. 2020, 11, 1092. [Google Scholar] [CrossRef] [PubMed]
Roossinck, M.J. Plant RNA virus evolution. Curr. Opin. Microbiol. 2003, 6, 406–409. [Google Scholar] [CrossRef]
Moya, A.; Holmes, E.; González-Candelas, F. The population genetics and evolutionary epidemiology of RNA viruses. Nat. Rev. Genet. 2004, 2, 279–288. [Google Scholar] [CrossRef]
Simmons, H.E.; Holmes, E.C.; Stephenson, A.G. Rapid turnover of intra-host genetic diversity in Zucchini yellow mosaic virus. Virus Res. 2011, 155, 389–396. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Desbiez, C.; Wipf-Scheibel, C.; Millot, P.; Berthier, K.; Girardot, G.; Gognalons, P.; Hirsch, J.; Moury, B.; Nozeran, K.; Piry, S.; et al. Distribution and evolution of the major viruses infecting cucurbitaceous and solanaceous crops in the French Mediterranean area. Virus Res. 2020, 286, 198042. [Google Scholar] [CrossRef]
Maina, S.; Coutts, B.A.; Edwards, O.R.; De Almeida, L.; Ximenes, A.; Jones, R. Papaya ringspot virus Populations from East Timorese and Northern Australian Cucurbit Crops: Biological and Molecular Properties, and Absence of Genetic Connectivity. Plant Dis. 2017, 101, 985–993. [Google Scholar] [CrossRef] [Green Version]
Noa-Carrazana, J.C.; González-De-León, D.; Silva-Rosales, L. Molecular characterization of a severe isolate of papaya ringspot virus in Mexico and its relationship with other isolates. Virus Genes 2006, 35, 109–117. [Google Scholar] [CrossRef]
Silva-Rosales, L.; Becerra-Leor, N.; Ruiz-Castro, S.; Téliz-Ortiz, D.; Noa-Carrazana, J.C. Coat protein sequence comparisons of three Mexican isolates of papaya ringspot virus with other geographical isolates reveal a close relationship to American and Australian isolates. Arch. Virol. 2000, 145, 835–843. [Google Scholar] [CrossRef]
Mansky, L.M.; Temin, H.M. Lower in vivo mutation rate of human immunodeficiency virus type 1 than that predicted from the fidelity of purified reverse transcriptase. J. Virol. 1995, 69, 5087–5094. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Schneider, W.L.; Roossinck, M.J. Evolutionarily related Sindbis-like plant viruses maintain different levels of population diversity in a common host. J. Virol. 2000, 74, 3130–3134. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Schneider, W.L.; Roossinck, M.J. Genetic Diversity in RNA Virus Quasispecies Is Controlled by Host-Virus Interactions. J. Virol. 2001, 75, 6566–6571. [Google Scholar] [CrossRef] [Green Version]
Nigam, D.; LaTourrette, K.; de Souza, P.F.N.; Garcia-Ruiz, H. Genome-Wide Variation in Potyviruses. Front. Plant Sci. 2019, 10, 1439. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Campen, A.; Williams, R.; Brown, C.; Meng, J.; Uversky, V.; Dunker, A. TOP-IDP-Scale: A New Amino Acid Scale Measuring Propensity for Intrinsic Disorder. Protein Pept. Lett. 2008, 15, 956–963. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Atreya, P.L.; Lopez-Moya, J.J.; Chu, M.; Atreya, C.D.; Pirone, T.P. Mutational analysis of the coat protein N-terminal amino acids involved in potyvirus transmission by aphids. J. Gen. Virol. 1995, 76, 265–270. [Google Scholar] [CrossRef] [PubMed]
Ng, J.C.K.; Perry, K.L. Transmission of plant viruses by aphid vectors. Mol. Plant Pathol. 2004, 5, 505–511. [Google Scholar] [CrossRef]
Johansen, I.E.; Keller, K.E.; Dougherty, W.G.; Hampton, R.O. Biological and molecular properties of a pathotype P-1 and a pathotype P-4 isolate of pea seed-borne mosaic virus. J. Gen. Virol. 1996, 77, 1329–1333. [Google Scholar] [CrossRef]
Flasinski, S.; Cassidy, B.G. Potyvirus aphid transmission requires helper component and homologous coat protein for maximal efficiency. Arch. Virol. 1998, 143, 2159–2172. [Google Scholar] [CrossRef]
Lopez-Moya, J.J.; Wang, R.Y.; Pirone, T.P. Context of the coat protein DAG motif affects potyvirus transmissibility by aphids. J. Gen. Virol. 1999, 80, 3281–3288. [Google Scholar] [CrossRef] [PubMed]
Moradi, Z.; Mehrvar, M.; Nazifi, E.; Zakiaghl, M. Iranian johnsongrass mosaic virus: The complete genome sequence, molecular and biological characterization, and comparison of coat protein gene sequences. Virus Genes 2016, 53, 77–88. [Google Scholar] [CrossRef]
Wijayasekara, D.; Ali, A. Complete Genome Characterization and Coat Protein Genealogy of Isolates of Maize dwarf mosaic virus from Johnsongrass and Maize in Oklahoma and Missouri. Plant Dis. 2020, 104, 1214–1223. [Google Scholar] [CrossRef]
Moradi, Z.; Mehrvar, M. Molecular characterization of two highly divergent Iranian johnsongrass mosaic virus isolates from Zea mays. VirusDisease 2021, 32, 155–160. [Google Scholar] [CrossRef]
Huet, H.; Gal-On, A.; Meir, E.; Lecoq, H.; Raccah, B. Mutations in the helper component protease gene of zucchini yellow mosaic virus affect its ability to mediate aphid transmissibility. J. Gen. Virol. 1994, 75, 1407–1414. [Google Scholar] [CrossRef]
Blanc, S.; Dolja, V.V.; Garcia-Lampasona, S.; Baker, J.; Llave, C.; Ammar, E.D.; Pirone, T.P. Mutations in the potyvirus helper component protein: Effects on interactions with virions and aphid stylets. J. Gen. Virol. 1998, 79, 3119–3122. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Worrall, E.A.; Hayward, A.C.; Fletcher, S.J.; Mitter, N. Molecular characterization and analysis of conserved potyviral motifs in bean common mosaic virus (BCMV) for RNAi-mediated protection. Arch. Virol. 2018, 164, 181–194. [Google Scholar] [CrossRef]
Zheng, L.; Wayper, P.J.; Gibbs, A.J.; Fourment, M.; Rodoni, B.C.; Gibbs, M.J. Accumulating Variation at Conserved Sites in Potyvirus Genomes Is Driven by Species Discovery and Affects Degenerate Primer Design. PLoS ONE 2008, 3, e1586. [Google Scholar] [CrossRef] [PubMed]
Puli’Uvea, C.; Khan, S.; Chang, W.-L.; Valmonte, G.; Pearson, M.N.; Higgins, C.M. First complete genome sequence of vanilla mosaic strain of Dasheen mosaic virus isolated from the Cook Islands. Arch. Virol. 2016, 162, 591–595. [Google Scholar] [CrossRef] [PubMed]
Moura, M.F.; Marubayashi, J.M.; Mituti, T.; Gioria, R.; Kobori, R.F.; Pavan, M.A.; Krause-Sakate, R. Comparative analysis of coding region for the coat protein of PepYMV and PVY isolates collected in sweetpepper. Summa Phytopathol. 2012, 38, 93–96. [Google Scholar] [CrossRef] [Green Version]
Maciel, S.C.; Da Silva, R.F.; Reis, M.S.; Jadão, A.S.; Rosa, D.D.; Giampan, J.S.; Kitajima, E.W.; Rezende, J.A.M.; Camargo, L.E. Characterization of a new potyvirus causing mosaic and flower variegation in Catharanthus roseus in Brazil. Sci. Agric. 2011, 68, 687–690. [Google Scholar] [CrossRef]
Li, Y.; Jia, A.; Qiao, Y.; Xiang, J.; Zhang, Y.; Wang, W. Virome analysis of lily plants reveals a new potyvirus. Arch. Virol. 2017, 163, 1079–1082. [Google Scholar] [CrossRef] [PubMed]
Perotto, M.C.; Pozzi, E.A.; Celli, M.G.; Luciani, C.E.; Mitidieri, M.S.; Conci, V.C. Identification and characterization of a new potyvirus infecting cucurbits. Arch. Virol. 2017, 163, 719–724. [Google Scholar] [CrossRef]
Khanal, V.; Ali, A. First Complete Genome Sequence of Cucurbit Aphid-Borne Yellows Virus from Pumpkin in the United States. Microbiol. Resour. Announc. 2019, 8, e01448-18. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Khanal, V.; Ali, A. Complete genome sequence of a zucchini yellow mosaic virus isolated from pumpkin in Oklahoma. Microbiol. Resour. Announc. 2019, 8, e01583-18. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Khanal, V.; Ali, A. First Report of Cucurbit aphid-borne yellows virus Infecting Cucurbita pepo in Oklahoma. Plant Dis. 2018, 102, 1046. [Google Scholar] [CrossRef]
Kumar, S.; Stecher, G.; Tamura, K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 2016, 33, 1870–1874. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hudson, R.R.; Boos, D.D.; Kaplan, N.L. A statistical test for detecting geographic subdivision. Mol. Biol. Evol. 1992, 9, 138–151. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Tajima, F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 1989, 123, 585–595. [Google Scholar] [CrossRef] [PubMed]
Fu, Y.X.; Li, W.-H. Statistical tests of neutrality of mutations. Genetics 1993, 133, 693–709. [Google Scholar] [CrossRef]
Wei, T.-Y.; Yang, J.-G.; Liao, F.-L.; Gao, F.-L.; Lu, L.-M.; Zhang, X.-T.; Li, F.; Wu, Z.-J.; Lin, Q.-Y.; Xie, L.-H.; et al. Genetic diversity and population structure of rice stripe virus in China. J. Gen. Virol. 2009, 90, 1025–1034. [Google Scholar] [CrossRef] [PubMed]
Ali, A.; Roossinck, M.J. Analysis of quasispecies variation in single and mixed viral infection. Virus Evol. 2017, 3, vex037. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Maximum likelihood (ML) phylogenetic tree constructed in MEGA 7 using general time-reversible (GTR) model for coat protein gene of 101 PRSV-W isolates in this study. Name of the isolates is shown at the tip of the tree. The bootstrap values >50 are shown at the respective nodes. The phylogenetic grouping is shown at right side of the phylogenetic tree. The PRSV-P isolate from Cuba was used as an outgroup.

Figure 2. Maximum likelihood (ML) phylogenetic tree constructed in MEGA 7 using general time-reversible (GTR) model for coat protein gene of 106 PRSV-W isolates (33 from this study and 73 from GenBank). The isolates from Oklahoma are shown in red color. GenBank accession number, isolate name, and country of origin are shown on each node (GenBank accession number is not shown for isolates from this study). The bootstrap values >50 are shown at the respective nodes. The phylogenetic grouping is shown on right side of the phylogenetic tree. The complete coat protein sequence of WMV isolated from Blaine County was used as an outgroup.

Figure 3. Nature of substitution mutations observed in 370 recombinant clones from 101 PRSV-W isolates. The Y-axis shows the frequency of nucleotide substitutions observed within the CP gene of PRSV-W isolates, and X-axis shows the type of nucleotide substitution.

Figure 4. Bar chart showing frequency of amino acids involved in substitution mutations within PRSV-W coat protein gene. The total number of amino acids and stop codons involved was 436, as there were 218 non-synonymous mutations.

Figure 5. Frequency of substitution mutations at different nucleotide positions within 370 recombinant clones from 101 PRSV-W isolates. Total of 864 sites was present, corresponding to each nucleotide in the coat protein region. (a) Frequency of total substitution mutations. (b) Frequency of silent mutations. (c) Frequency of non-silent mutations.

Table 1. Nucleotide and amino acid identities among PRSV-W isolates obtained in this study and their comparison to PRSV isolates from other countries.

Geographical Location of Isolates	Number of Isolates Used for Analysis	Nt Identity (%)	Amino Acid Identity (%)
This study	101	96–100	98–100
Oklahoma	165 ^a	96–100	98–100
Other parts of the USA	4	94–97	96–98
Australia	5	95–98	96–98
Bangladesh	2	88–90	87–93
Brazil	6	93–95	96–98
China	12	89–92	92–97
Columbia	2	93–95	95–96
Cuba	3	93–97	94–97
East Timor	3	91–92	95–96
France	1	95–97	97–98
India	16	86–95	89–97
Japan	5	89–91	91–94
Mexico	2	95–98	94–97
Myanmar	4	88–91	89–96
PNG	6	96–98	97–98
Poland	1	96–97	97–98
Taiwan	14	89–92	94–95
Thailand	2	90–91	92–93
South Korea	2	91–92	95–97
Venezuela	4	95–97	96–98

^a Represents isolates from this study and 64 isolates from the previous study [12].

Table 2. Population genetics analysis based on the coat protein gene sequences of PRSV-W isolates collected in different counties of Oklahoma: hosts, years of collection, and phylogroups.

County	No. of Sequences (n)	Mean Genetic Distance (d)	Haplotype Diversity (Hd)	Average # of Nt Difference (K)	Nt Diversity (π)	dN/dS
Blaine	99	0.023 ± 0.003	0.96	18.02	0.022	0.30
Caddo	49	0.001 ± 0.000	0.85	2.07	0.002	0.65
McCurtain	41	0.009 ± 0.002	0.87	10.97	0.013	0.28
Muskogee	112	0.016 ± 0.003	0.97	3.79	0.005	0.25
Cimarron	30	0.004 ± 0.001	0.95	7.14	0.009	0.95
Tulsa	39	0.006 ± 0.001	0.91	4.68	0.006	0.26
Host
Cantaloupe	13	0.023 ± 0.003	0.96	16.59	0.020	0.17
Cucumber	10	0.021 ± 0.004	0.93	12.82	0.015	0.15
Pumpkin	217	0.016 ± 0.002	0.97	13.69	0.016	0.41
Squash	60	0.022 ± 0.003	0.96	16.83	0.020	0.25
Watermelon	72	0.020 ± 0.003	0.97	14.95	0.018	0.33
Year of collection
2016	108	0.012 ± 0.002	0.93	10.56	0.013	0.47
2017	86	0.013 ± 0.002	0.96	11.69	0.014	0.37
2018	176	0.024 ± 0.003	0.98	18.21	0.022	0.36
Phylogroups
Phylogroup 1	205	0.003 ± 0.001	0.96	3.47	0.004	0.72
Phylogroup 2	87	0.015 ± 0.003	0.97	12.90	0.015	0.44
Phylogroup 3	78	0.019 ± 0.004	0.92	13.04	0.016	0.35
Overall	370	0.020 ± 0.003	0.98	16.09	0.019	0.36

Table 3. Population genetic analysis based on the coat protein gene sequences of PRSV-W and PRSV populations in different phylogroups around the world.

Group	No. of Sequences (n)	Mean Genetic Distance (d)	Haplotype Diversity (Hd)	Average # of Nt Difference (K)	Nt Diversity (π)
PRSV-W
This study	101	0.020 ± 0.003	0.95	16.12	0.02
Oklahoma ^a	165	0.029 ± 0.004	0.95	16.57	0.02
Oceania	11	0.021 ± 0.003	0.96	17.91	0.02
Americas ^b	11	0.051 ± 0.007	0.95	45.51	0.05
Asia	29	0.112 ± 0.014	0.99	79.86	0.09
PRSV-W and P
Americas ^b	22	0.060 ± 0.005	0.99	46.94	0.06
Asia	60	0.106 ± 0.008	0.99	74.95	0.09
PRSV-W ^c	106	0.060 ± 0.005	0.99	51.71	0.07
PRSV-P	42	0.011 ± 0.008	0.99	75.47	0.09
Overall ^d	148	0.085 ± 0.006	0.99	61.90	0.08

^a The sequences include 101PRSV-W isolates from this study and 64 PRSV-W isolates from GenBank. ^b The sequences exclude PRSV-W sequences from Oklahoma and include other isolates from North and South America. ^c Includes selected 33 PRSV-W consensus sequences from this study (out of 101) and 73 PRSV-W sequences retrieved from GenBank. ^d Includes selected 33 PRSV-W consensus sequences from this study (out of 101), 73 PRSV-W, and 42 PRSV-P sequences retrieved from GenBank.

Table 4. Gene flow and genetic differentiation estimates based on the coat protein gene sequences of PRSV-W isolates from different counties, hosts, phylogroups, and collection years.

Counties	Fst	Nm
BL vs. CD	0.41	0.36
BL vs. CM	0.39	0.39
BL vs. MC	0.24	0.78
BL vs. MK	0.37	0.42
BL vs. TL	0.56	0.20
CD vs. CM	0.76	0.08
CD vs. MC	0.50	0.25
CD vs. MK	0.14	1.60
CD vs. TL	0.87	0.04
CM vs. MC	0.61	0.16
CM vs. MK	0.72	0.10
CM vs. TL	0.73	0.09
MC vs. MK	0.46	0.29
MC vs. TL	0.71	0.10
MK vs. TL	0.83	0.05
Hosts
CT vs. CU	0.15	1.42
CT vs. PM	0.17	1.24
CT vs. SQ	0.02	14.31
CT vs. WM	0.12	1.76
CU vs. PM	0.23	0.86
CU vs. SQ	0.11	2.02
CU vs. WM	0.25	0.74
PM vs. SQ	0.22	0.89
PM vs. WM	0.06	4.05
SQ vs. WM	0.18	1.14
Phylogroups
PG1 vs. PG2	0.64	0.14
PG1 vs. PG3	0.61	0.16
PG2 vs. PG3	0.54	0.21
Collection years
16 vs. 17	0.13	1.73
16 vs. 18	0.18	1.10
17 vs. 18	0.13	1.61

Table 5. Gene flow and genetic differentiation estimates based on the coat protein gene sequences of PRSV populations between different phylogroups from around the world.

Region/Phylogroups	Fst	Nm
PRSV-W populations
Oklahoma vs. Americas	0.25	0.74
Oklahoma vs. Oceania	0.23	0.84
Oklahoma vs. Asia	0.43	0.34
Americas vs. Oceania	0.25	0.74
Americas vs. Asia	0.34	0.50
Oceania vs. Asia	0.44	0.32
PRSV-W and P populations
Americas vs. Oceania	0.20	1.00
Americas vs. Asia	0.29	0.62
PRSV-P vs. PRSV-W	0.12	1.89

Table 6. Mutational statistics in the coat protein gene sequences of PRSV-W isolates from different counties of Oklahoma in single and mixed infections.

	Counties
	Blaine			Caddo			Cimarron			McCurtain			Muskogee			Tulsa
	S ^a	M ^b	T ^c	S	M	T	S	M	T	S	M	T	S	M	T	S	M	T
No. of isolates	10	16	26	1	12	13	1	8	9	7	5	12	26	5	31	1	9	10
No. of clones	39	60	99	4	45	49	4	26	30	23	18	41	94	18	112	5	34	39
Mutation frequency (10⁻³)	1.25	1.29	1.27	0.58	1.23	1.18	1.49	1.07	1.12	1.0	0.5	0.79	1.26	0.84	1.99	2.16	1.33	1.37
dN/dS	0.21	0.25	0.30	0.00	0.65	0.65	0.32	0.92	0.28	0.19	0.14	0.25	0.81	0.47	0.65	0.09	0.29	0.26

^a S, sequences originated from plants with single infection (PRSV-W only); ^b M, sequences originating from plants with mixed infection (PRSV-W together with WMV, ZYMV, or both); ^c T, all sequences, regardless of single or mixed infection.

Table 7. Mutational statistics in the coat protein gene sequences of PRSV-W isolates from different hosts in single and mixed infections.

	Host
	Cantaloupe			Cucumber			Pumpkin			Squash			Watermelon
	S	M	T	S	M	T	S	M	T	S	M	T	S	M	T
No. of samples	2	2	4	1	2	3	30	29	59	-	16	16	13	6	19
No. of clones	6	7	13	4	6	10	109	106	215	-	60	60	51	21	72
Mutation frequency (10⁻³)	1.16	1.16	1.16	1.45	1.54	1.5	1.25	1.24	1.24	-	0.98	0.98	1.13	1.05	1.11
dN/dS	0.45	0.16	0.17	0.32	0.95	0.11	0.33	0.35	0.41	-	0.25	0.25	0.34	0.17	0.33

S, sequences originated from plants with single infection (PRSV-W only); M, sequences originating from plants with mixed infection (PRSV-W together with WMV, ZYMV, or both); T, all sequences regardless of single or mixed infection.

Table 8. Mutational statistics in the coat protein gene sequences of PRSV-W clones from isolates collected in different years in single and mixed infections.

	Year
	2016			2017			2018			Total
	S ^a	M ^b	T ^c	S	M	T	S	M	T	S	M	T
No. of isolates	14	15	29	16	9	25	16	31	47	46	55	101
No. of clones	52	56	108	56	30	86	61	115	176	169	201	370
Mutation frequency (10⁻³)	1.09	1.36	1.23	1.34	1.35	1.35	1.21	0.99	1.07	1.22	1.15	1.18
dN/dS	0.48	0.35	0.47	0.39	0.21	0.37	0.27	0.31	0.36	0.41	0.38	0.36

^a S, sequences originated from plants with single infection (PRSV-W only); ^b M, sequences originating from plants with mixed infection (PRSV-W together with WMV, ZYMV, or both); ^c T, all sequences regardless of single or mixed infection.

Table 9. Selection pressure analysis in the coat protein gene sequences among different PRSV populations.

PRSV Populations	Number of Sequences Used	Number of Negatively Selected Codons				Number of Positively Selected Codons
PRSV Populations	Number of Sequences Used	FUBAR	FEL	MEME	SLAC	FUBAR	FEL	MEME	SLAC
PRSV-W this study	101	12	27	-	4	2	0	0	0
PRSV-W Oklahoma	165 ^a	28	50	-	13	2	0	0	0
PRSV-W global sequences	106 ^b	214	214	-	197	3	3	11	3
PRSV global sequences	148 ^c	229	250	-	205	3	3	13	5

^a Includes 101 PRSV-W sequences from this study and 64 PRSV-W sequences from GenBank. ^b Includes 33 PRSV-W sequences from this study, 73 PRSV-W sequences from GenBank. ^c Includes 33 PRSV-W sequences from this study, 73 PRSV-W, and 42 PRSV-P sequences from GenBank.

Table 10. List of the most frequent amino acid changes among 218 non-silent mutations observed in the coat protein gene sequences of PRSV-W isolates in this study.

Rank	Amino Acid Change	Frequency (%)
1	Lysine–Arginine	14 (6.34)
2	Asparagine–Aspartic acid	12 (5.43)
3	Arginine–Lysine	9 (4.07)
4	Alanine–Valine	8 (3.62)
4	Glutamic acid–Glycine	8 (3.62)
4	Leucine–Proline	8 (3.62)
7	Aspartic acid–Asparagine	7 (3.16)
7	Asparagine–Serine	7 (3.16)

Table 11. Mutational statistics of different regions of coat protein gene sequences of PRSV-W isolates.

	N-Terminal	Core	C-Terminal
Total nt sites	195	459	210
Mutation frequency	1.60 × 10⁻³	1.01 × 10⁻³	1.10 × 10⁻³
dN/dS	0.36	0.23	0.35
Conserved sites (%)	122 (62.6)	357 (77.8)	154 (13.3)
Silent sites (%)	26 (13.3)	39 (8.5)	21 (10)
Non-silent sites (%)	47 (24.1)	63 (13.7)	35 (16.7)
Mean genetic diversity (d)	0.019 ± 0.005	0.006 ± 0.001	0.009 ± 0.003

Table 12. Conserved amino acid motifs observed in PRSV population and their respective amino acid sequences with their corresponding positions within coat protein gene.

CP Region	Conserved Motif	Oklahoma	PRSV-W	PRSV	References
N-ter	DAG	7-D[A/T]G-9	7-D[A/T/S]G-9	7-[D/N] [A/T/S]G-9	[46]
N-ter	DVN[A/V]GT	61-DVN[A/V]GT-66	61-DVN[A/V]GT-66	61-DVN[A/V]GT-66	This study
Core	DISNTRAT	105-QIDISNTRATQSQFEKWYEGV-125	107-DISNTRAT-114	107-DISNTRAT-114	This study
Core	MVWCI[E/D]NGTSP	129-DYGLNDNEMQVMLNGLMVWCIENGTSPDI-156	139-MLNGLM[V/G]WCIENGTSPD-155	139-MLNGLM[V/G]WCIENGTSPD-155	[56]
Core	W[V/T]MMDG[D/E/N]	157-SGVWVMMDGE-166	157-SGVWVMM[D/G/E][G/E]E-166	157-SGVWVMM[D/G/E][G/E]E-166	[57]
Core	FRQIMAHFSNAAEA	181-ATPSFRQIMAHFSNAAEA-198	185-FRQIMAHFSNAAEA-198	185-FRQIMAHFSNAAEA-198	This study
Core	[P/R/A]YMPRYG	208-[R/G]YMPRYGIKR-217	208-[R/K/G]YMPRYG[I/L]KR-217	208-[R/K/G]YMPRYG[I/L]KR-217	[58]
Core	YAFDFYE	219-LTDISLAR[Y/H]AFDFYEVNSKTP-239	221-DISLAR[Y/H]AFDFYE[V/I]NSKTP-239	221-D[I/T]SLAB[V/I]NSKTP-239	[59]
C-ter	QMKAAAL	246-HMQ[M/V]KAAALR-255	248-Q[M/V]KAAALR-255	248-Q[M/V]KAAALR-255	[60,61,62]
C-ter	E[N/D]TERH	269-SNKEE[N/D/S]TERHTVEDVNR-280	272-EE[N/D/S]TERHTV-280	272-EE[N/D/S]TERHTV-280	[56,59]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Khanal, V.; Ali, A. High Mutation Frequency and Significant Population Differentiation in Papaya Ringspot Virus-W Isolates. Pathogens 2021, 10, 1278. https://0-doi-org.brum.beds.ac.uk/10.3390/pathogens10101278

AMA Style

Khanal V, Ali A. High Mutation Frequency and Significant Population Differentiation in Papaya Ringspot Virus-W Isolates. Pathogens. 2021; 10(10):1278. https://0-doi-org.brum.beds.ac.uk/10.3390/pathogens10101278

Chicago/Turabian Style

Khanal, Vivek, and Akhtar Ali. 2021. "High Mutation Frequency and Significant Population Differentiation in Papaya Ringspot Virus-W Isolates" Pathogens 10, no. 10: 1278. https://0-doi-org.brum.beds.ac.uk/10.3390/pathogens10101278

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

High Mutation Frequency and Significant Population Differentiation in Papaya Ringspot Virus-W Isolates

Abstract

1. Introduction

2. Results

2.1. PRSV-W Isolates and Confirmation by RT-PCR

2.2. CP Gene Sequence Analysis

2.3. Phylogenetic Relationship among PRSV-W Populations

2.4. Genetic Variation among PRSV-W Populations

2.5. Population Differentiation within PRSV-W Populations

2.6. Mutation Frequency

2.7. Selection Pressure Analysis

2.8. Mutational Pattern

3. Discussion

4. Materials and Methods

4.1. Sample Collection and Detection of PRSV-W

4.2. Cloning and Sequencing

4.3. Sequence Analysis

4.4. Phylogenetic Analysis

4.5. Genetic Diversity and Population Genetics

4.5.1. Genetic Diversity

4.5.2. Gene Flow and Population Differentiation

4.5.3. Neutrality Tests

4.5.4. Selection Analysis

4.5.5. Mutation Frequency and Pattern within the CP Gene

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI