Next Article in Journal
Molecular Mechanism of Evolution and Human Infection with SARS-CoV-2
Previous Article in Journal
Systems Biology Analysis of the Antagonizing Effects of HIV-1 Tat Expression in the Brain over Transcriptional Changes Caused by Methamphetamine Sensitization
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

In Silico Analysis of Genetic Diversity of Human Hepatitis B Virus in Southeast Asia, Australia and New Zealand

1
School of Biomedical Sciences, Faculty of Health, Queensland University of Technology, Kelvin Grove, Queensland 4059, Australia
2
Research and Development Australian Red Cross Lifeblood, Kelvin Grove, Queensland 4059, Australia
3
School of Health and Sport Sciences, University of Sunshine Coast, Petrie, Queensland 4502, Australia
4
Institute for Health and Biomedical Innovation, Queensland University of Technology, Herston, Queensland 4006, Australia
*
Author to whom correspondence should be addressed.
Submission received: 6 March 2020 / Revised: 6 April 2020 / Accepted: 8 April 2020 / Published: 9 April 2020
(This article belongs to the Section Animal Viruses)

Abstract

:
The extent of whole genome diversity amongst hepatitis B virus (HBV) genotypes is not well described. This study aimed to update the current distribution of HBV types and to investigate mutation rates and nucleotide diversity between genotypes in Southeast Asia, Australia and New Zealand. We retrieved 930 human HBV complete genomes from these regions from the NCBI nucleotide database for genotyping, detection of potential recombination, serotype prediction, mutation identification and comparative genome analyses. Overall, HBV genotypes B (44.1%) and C (46.2%) together with predicted serotypes adr (36%), adw2 (29%) and ayw1 (19.9%) were the most commonly circulating HBV types in the studied region. The three HBV variants identified most frequently were p.V5L, c.1896G>A and double mutation c.1762A>T/c.1764G>A, while genotypes B and C had the widest range of mutation types. The study also highlighted the distinct nucleotide diversity of HBV genotypes for whole genome and along the genome length. Therefore, this study provided a robust update to HBV currently circulating in Southeast Asia, Australia and New Zealand as well as an insight into the association of HBV genetic hypervariability and prevalence of well reported mutations.

1. Introduction

Hepatitis B virus (HBV) infection is a global health burden. 257 million people had chronic hepatitis B (CHB) infection worldwide in 2015 [1]. CHB can develop into liver cirrhosis, and hepatocellular carcinoma (HCC), resulting in 720,000, and 470,000, deaths worldwide in 2015, respectively [1].
HBV is a small, enveloped DNA virus of the Hepadnaviridae family. Its genome (3.2 kbp) is circular and partially double-stranded with four overlapping open reading frames (ORFs) encoding for reverse transcriptase (RT), pre-surface/surface (pre-S/S) proteins, precore/core (PC/C) proteins and transcriptional transactivator X protein [2]. The overlapping structure of pre-S/S and RT ORFs is essential for HBV surface antigen (HBsAg) antigenicity and RT functionality [3]. The PC/C region is responsible for formation of two polypeptides: (1) HBV core antigen (HBcAg)—a structural protein of the nucleocapsid translated from pregenomic RNA and (2) HBV e antigen (HBeAg)—a viral secreted protein translated from PC RNA [4,5]. The core promoter is located immediately upstream of the PC/C region and consists of the basal core promoter (BCP) and the upper regulatory region (URR). URR is composed of the negative regulatory element (NRE) and the core upstream regulatory sequence (CURS), with a possible role in regulating core promoter activity [6]. The X ORF encodes for X protein, which is associated with viral regulation, cell growth and carcinogenesis by disrupting several cellular signaling pathways [7,8,9].
Several factors contribute to HBV genetic diversity. HBV possesses a unique replication cycle via an RNA intermediate, using an error-prone RT enzyme [2,10,11]. This results in a nucleotide substitution rate of >10−5 substitution/site/year, which is higher than most DNA viruses and similar to RNA viruses with low mutation rate [2,10,11]. Moreover, it has a rapid replication rate with over 1011 virus particles released in circulation per day [12].
Due to its high sequence heterogeneity, HBV has 10 recognised genotypes (A to I, and a provisional genotype J) with at least 8% nucleotide divergence, and over 35 subgenotypes with at least 4–8% nucleotide divergence over the whole genome [13,14]. Viral strains typically exhibit geographical and ethnic distributions [13,15]. Generally, genotypes A and D are highly prevalent in Europe, North Africa, North America and Asia, whilst in East and Southeast Asia, Australia and New Zealand genotypes B and C are more commonly found [15]. Genotypes E and A predominate in Sub-Saharan Africa, whilst genotypes F, G and H are recorded in Latin America [15]. To date, only a single instance of genotype J was identified in Japan [16].
Recombination is common in HBV, classified into 44 different patterns where at least two different HBV genotypes or subgenotypes exchange genetic information [17,18]. For instance, subgenotypes B2, B3 and B4 are recombinants of genotypes B and C while fragments of genotypes A, C and G are observed in genotype I [2,19]. There is also evidence of recombinants C/D and D/E [20].
HBV can be categorised into different serotypes based on serological reactivity of HBsAg including ayw1, ayw2, ayw3, ayw4, ayr, adw2, adw4, and adr. The presence of subdeterminants, d/y and r/w, depends on variation at amino acid positions 122, 160, 127, 159 and 140 [21,22]. Although the same HBV genotypes can encode different serotypes and one serotype can belong to different genotypes, the distinct variants associated with viral serotypes are consistently found at the genomic level [21,23].
The genetic variability of HBV plays an important role in clinical outcomes and planning HBV prevention strategies. Acute genotype A infections persist longer than genotype D. Genotypes A and D surpass B and C in viral chronicity, possibly due to their different roles in host-viral interactions [24,25,26]. Genotype C infections have an increased risk of higher viral loads and development of cirrhosis and HCC than genotype B [24,27]. Interferon-based treatment was shown to be more effective for genotypes A and B than genotypes C and D, demonstrated through lower expression of HBsAg, HBeAg and decreased HBV DNA levels [28].
Most HBV vaccines are generated from genotype A2 or C strains [29,30]. The vaccines are protective against non-A/C genotype infections by largely boosting the production of antibodies targeted against determinant a at amino acid positions 121–149 which corresponds to the first loop of B-cell epitope of HBsAg for all HBV genotypes and serotypes [3,21,30,31]. However, cases of cross-genotype HBV infection of vaccinated individuals have been recorded, raising questions as to the complete protection of HBV immunoprohylaxis, especially in countries where genotypes A and C are not prevalent [32,33,34].
Several HBV mutations within particular coding regions are associated with important clinical manifestations. Mutations across the major hydrophilic region (MHR) of HBsAg, where determinant a is located, disrupt the conformation of surface antigen, affect protein hydrophilicity and alter the binding of neutralizing antibodies. This allows HBV to escape host immune responses for clearance [35,36], resulting in failed immunoprophylaxis [37,38,39,40,41,42]. RT mutations are involved in the development of drug-resistant viral strains and loss of efficacy of nucleos(t)ide analogs and anti-viral drugs. These include p.M204IVS, p.V173L, p.A181T and p.L80VI for lamivudine, p.N236T and p.181V for adenovir, p.A194T for tenofovir, and p.M250V/p.I169T and p.T184G/p.S202I for entecavir [43,44]. Mutations within the PC region acquired after HBeAg seroconversion (c.1896G>A and p.P130T) can lead to defective HBV replication, decreased or lost secretion of HBeAg, HBeAg-negative CHB and even HCC with poorer prognosis [5,7,45,46,47]. The two HBcAg mutations p.I/F97L and p.P130T are frequently observed concurrently in CHB patients [44], leading to the secretion of virions containing immature genomes and hypermaturations, respectively [48]. Meanwhile, double mutations c.1762A>T / c.1764G>A in the BCP region and mutation c.1613G>A within NRE, are associated with the suppressed expression of HBeAg, increased viral load in HBV carriers and increased risk of liver carcinogenesis [6,49,50]. A study on HBV X mutations reported the presence of the mutant p.V5M/L amongst Korean patients with severe liver diseases such as cirrhosis and HCC [51].
The rate of international travel to Southeast Asia, Australia, New Zealand has increased in recent years. Given the geographically distinct distribution of HBV genotypes and serotypes, this trend could influence the prevalence of HBV strains currently circulating in this region. Moreover, there have been conflicting reports of the predominant genotype in a number of countries within this area. A study in 2004 [13] identified only HBV genotypes B and C in the region, except for Australia, in which genotypes D and C were found. Meanwhile, a recent systematic literature review [15] reported genotypes B, C, D and A circulating in these countries. This study therefore aimed to robustly update the situation of HBV types in this region. Moreover, given the potential complications arising from aforementioned HBV mutations, we sought to estimate a number of mutations across the viral genome by genotypes. We also investigated the nucleotide variability of HBV genotypes in the same region to explore whether there is an association between HBV heterogeneity and mutation rates for each genotype.

2. Materials and Methods

2.1. Data Collection from NCBI Nucleotide Database

A search for human HBV complete genomes listed on the NCBI nucleotide database was conducted in August 2019, firstly using the term [(“HBV” OR “hepatitis B virus”) AND “complete genome”]. We then applied the following filters: viruses for species, gDNA/RNA for molecule types, International Nucleotide Sequence Database Collaboration (INSDC) (GenBank) and [country name]. The studied countries were located in Southeast Asia, Australia and New Zealand, with names as described by the United Nations (https://unstats.un.org/unsd/methodology/m49/). Each genome was manually checked for inclusion, including HBV complete genome, Homo sapiens host, country area, and unique sequences. The country of origin of each genome was checked either by using the information given on NCBI or listed in publications.

2.2. Analysis of HBV Genotypes

For each sequence, we used NCBI HBV genotyping [52] to assess the agreement between genotypes submitted on GenBank and results provided on the HBVdb genotyping tool [53]. We included sequences KF214650|I, EU833891|I, JF899337|I, and AB486012|J for genotypes I and J as references. A genotype was assigned based on (1) concurrence of genotype assigned by three sources (Genbank submission, NCBI HBV genotyping and HBVdb genotyping tool) or (2) the first E-value given by NCBI BLAST < 1 × 10−75 in the case of genotype assignment disagreement from one or more sources/tools. We nominated unverified genotypes or potential recombinants by phylogenetic analysis using FastTree 2.1.11 [54,55] and RAxML 8.2.11 [56]. For FastTree, the default tree building setting was used. For RaxML, we used a general time-reversible (GTR) GAMMA I model, rapid bootstrapping and 1000 replicates. Figure 1 illustrates the steps of genotyping the collected sequences. Sequences were aligned using MAFFT [57,58] and Geneious Prime 2019.2.1 for the whole study.

2.3. HBV Recombination Analysis

We utilised the Recombination Detection Program (RDP) 4 [59] to detect potential recombination events within our data. Five built-in methods (RDP [60], Geneconv [61], Maxchi [62], Chimaera [63] and 3SEQ [64]) were run iteratively with their default settings, to identify potential recombinants by each method. Sequences detected as potential recombinants with overlapping breakpoints by all 5 methods were then pooled together with the sequences of unverified genotypes/recombinants for the phylogenetic analysis as outlined in 2.2. Reference sequences were included (Table S1).

2.4. Representative Phylogenetic Analysis

A number of sequences were randomly selected from the complete dataset for the identified genotypes to establish a representative phylogeny. The number of sequences chosen for the representative phylogeny was proportional to the number of sequences of each genotype in our dataset: 5 sequences for genotype A, 20 for B, 20 for C, 10 for D, 1 for G and 5 for I. Potential recombinants were also included in the population. Approximated maximum-likelihood trees were built by using FastTree 2.1.11 [54,55], RAxML 8.2.11 [56] and PHYML 3.3.20180621 [65]. Model parameters for FastTree and RAxML were as described in 2.2, Model parameters for PHYML were established by using jmodeltest: GTR+I+G, proportion of invariable sites 0.463, gamma distribution of 0.854 and 1000 bootstraps. Reference sequences were included (Table S1).

2.5. HBV Serotype Prediction

Serotypes for each HBV genome were predicted using the decision tree developed by Purdy et al. (2007) [22]. Translated HBsAg sequences were analysed with the mutation reporter tool [66] to determine serotypes. We used FLLVLLDY as an input for anchor motif at amino acid position 93. We confirmed our predictions by using the HBV serotyper tool (http://hvdr.bioinf.wits.ac.za/SmallGenomeTools/) [67] or manually. Fourteen sequences that were not predicted to any serotypes by either of the two methods above were individually investigated for their amino acid composition.

2.6. Nucleotide Diversity

We used DnaSP 6 for a comprehensive analysis of sequence variation by each genotype [68]. All degenerate nucleotides were converted to Ns prior to analysis as required by the software. Sites with gaps in the complete data file were excluded from the analysis. To evaluate the degree of polymorphism within genotypes, we estimated the average number of nucleotide differences per site between two sequences, called nucleotide diversity Pi (μ), with window length of 100 bp and step size of 25 bp.

2.7. Analysis of Nucleotide and Amino Acid Variations

Analysis of HBV variations was performed across five regions—NRE, BCP, PC, C, HBsAg, polymerase/RT, and X. Variations selected for analysis were all reported to be associated with viral life cycle and/or clinical complications (Figure 2) [69,70]. Drug resistance mutations over the RT ORF and escape mutations within HBsAg ORF were identified by HIV-grade HBV drug resistance interpretation (DRI) [70]. Other variants were determined based on nucleotides or amino acids substituted at the selected positions along nucleotide sequences or translated sequences using Geneious Prime 2019.2.1.

3. Results

3.1. Data Collection from NCBI Nucleotide Database

Of the 9182 complete HBV genomes identified on NCBI nucleotide database, 930 genomes met the inclusion criteria (File S1 and Figure 3). Five pairs of HBV genomes, despite having different GenBank accession numbers, were identical, therefore, one sequence of each pair was excluded. No human HBV complete genomes were found for the following geographical locations: Brunei Darussalam, Singapore, Timor-Leste, Christmas Island, Cocos Islands, Norfolk Island, Heard Island and McDonald Island. No relevant information regarding clinical condition was available for any of the sequences collected.

3.2. Analysis of HBV Genotypes

Over 50% of sequences originated from Malaysia and Vietnam, with 296 and 180 sequences, respectively. We identified six HBV genotypes (A, B, C, D, G and I) circulating in Southeast Asia, Australia and New Zealand (Table 1 and Figure 4). The frequencies of each HBV genotype varied widely between countries. In the studied region, genotypes C and B were the most prevalent, constituting 46.2% and 44.1%. Genotype I (2.3%) was observed only in Laos, Vietnam and Thailand, whilst genotype D (5.3%) was found in New Zealand, Indonesia and Malaysia. Only one sequence was genotyped G and originated in Thailand. Countries with the widest range of circulating HBV genotypes were Malaysia (A, B, C and D) and Thailand (B, C, G and I), followed by Vietnam (B, C and I), Philippines (A, B and C) and Indonesia (B, C and D).

3.3. HBV Recombination Analysis

We ran five automatic detection methods in RDP4 to detect potential recombination within our data (Table 2). In total, 65 genomes were identified as recombinant by all five methods; 35 of them had overlapping breakpoint positions, including 10 with the same major parents who contribute greater proportion of sequence. The 65 suspected recombinants were pooled together with sequences of ambiguous genotype or suspected recombination for further analysis with FastTree and RaxML (Figures S1 and S2). According to these phylogenies, 34 of the 65 sequences were clustered in genotype B, particularly B2 and B3, whilst 13 clustered in genotype C and 12 in genotype A. Of the seven sequences that were initially suspected as B/BC, C/CB or G/CG recombinants (Table 1) by HBV genotyping, two C/CB were located between the clusters of genotypes B2 and C, two G/GC between G and C, while the three B/BC were positioned within the cluster of B2 and B3. Results of recombination and phylogeny analyses are summarised in Table S2. They all shared approximately 92% similarity with their corresponding reference sequences.

3.4. Representative Phylogenetic Analysis

Generally, all three trees built with FastTree, RAxML and PHYML (Figures S3–S5) were consistent in clustering the sequences of the same genotypes identified in 3.2, supporting our genotyping results. The divisions into genotypes A, B, D and I were highly supported by all three methods. Lower support values were calculated for genotype G (82% by RAxML and 78.1% by PHYML) and C (62% by RAxML and 53.5% by PHYML). PHYML and RAxML trees were generally congruent, except for five sequences, one of genotype B and four of genotype C, though still clustering within clades of their corresponding genotypes. Differently, for FastTree tree, genotype roots were relocated and a bigger number of sequences of genotypes B, C, D and F changed their arrangement within the same clade.

3.5. HBV Serotype Prediction

We predicted serotypes for the 930 HBV genomes using the decision tree of Purdy et al. [22]. Table 3 summarises the distribution of the predicted HBV serotypes by genotypes. Serotypes adr, adw2 and ayw1 were most prevalent in the studied region, accounting for 36%, 29% and 19.9%, respectively. They were predominantly found with HBV genotypes B and C, which are common in Southeast Asia and Australia. All sequences of serotype adr and ayr were within genotype C. Serotypes adw2 and ayw1 were found in a wider range of genotypes than the other serotypes. Serotypes adw3, ayw2 and ayw3 were scattered among genotypes B, C and D with no discernible distribution pattern.

3.6. Nucleotide Diversity

Nucleotide diversity analysis indicates a wide range of genetic variability for whole genome (Table 4) and along the HBV genome length (Figure 5). Genotype G was not included because only one sequence of this genotype was identified. Genotypes B and C demonstrated a higher number of segregating sites, Pi values and average number of nucleotide differences, indicating greater genetic diversity compared to other genotypes (Table 4). Despite the lowest number of genomes included in the dataset, genotype A demonstrated Pi and k values greater than genotypes D and I. This suggests that genotype D and I were the most conserved in their overall nucleotide sequences.
Moreover, we found genetic differences appeared to be unevenly distributed along the genome length (Figure 5). At a genotypic level, genotypes B and C were the most diverse within the overlapping RT and pre-S/S coding genes, while genotype A expressed more genetic variations located near or within X gene and PC/C gene. Genotypes D and I were more conserved than the others over the whole genome, except for a distinct locus in RT ORF.

3.7. Analysis of Single Nucleotide and Amino Acid Variations

Our dataset was investigated for the frequency of the selected variations by genotype (Table 5). We observed the p.V5L variation was present in almost all sequences analysed (≥83% prevalence for all genotypes). c.1896G>A was the second most common variant, particularly with HBV genotypes B, D, I and C, while c.1762A>T/c.1764G>A ranked third but was less genotype-specific. It was noted that the compound nucleotide substitution at positions 1762 and 1764 within BCP was more likely to occur together, rather than to have either substitution singularly. Escape mutations in the viral surface coding gene (14.4%) and c.1613G>A in NRE (12.8%) also occurred frequently, predominantly in genotypes B and C. Additionally, p.P130T in core gene and p.V5M in the X gene were mainly acquired in genotypes B and C, while drug resistance mutations occurred sporadically for genotypes A–D.

4. Discussion

This in silico study of the distribution of HBV genotypes and recombinants in Southeast Asia, Australia and New Zealand was performed through the phylogenetic, recombination and comparative genomic analysis of 930 human HBV complete genomes. We observed a high prevalence of genotypes C and B in the studied region. This observation is consistent with previous studies [13,15]. The exception is Laos, where we identified genotype I as the dominant genotype, rather than genotypes B and C as reported by Norder [13] and Velkov [15]. We also detected frequent recombinants between genotypes B and C, all from Thailand and Malaysia.
This study also investigated the distribution of HBsAg subtypes in the study region predicted using a decision tree [22]. The three most prevalent serotypes were adr (36%), adw2 (29%) and ayw1 (19.9%). We found serotypes adr, ayr and ayw3 most commonly associated with genotype C, adw2 for B and C, adw3 and ayw1 for B, and ayw2 for D. These findings differ to Norder’s study [13], which reported that the serotypes adw2, ayw1 and ayw3 were more pronounced amongst genotypes A, B and D. Moreover, our observations of HBsAg subtype distribution for genotypes B and D are dissimilar to the finding of Chen et al. [71] who reported that adr and ayw2 were prevalent among genotype B strains while most of genotype D sequences were assigned to adr [71]. This is likely due to the geographical selection of HBV sequences used in this study. We targeted Southeast Asia, Australia and New Zealand, while the Norder study [13] was worldwide and the Chen study [71] focused on China.
To our knowledge, this study is also the first to investigate nucleotide diversity between HBV genotypes. We found that levels of nucleotide diversity varied amongst the different genotypes. Genotypes C and B showed an overall higher genetic diversity and more mutation types than genotypes A, D and I. Furthermore, we found that each genotype demonstrated varying genetic diversity within different ORFs (Figure 5). Specifically, genotypes B and C displayed the highest heterogeneity in the polymerase and pre-S/S regions while genotype A showed the highest diversity within the X and PC/C genes.
Information on the distribution of HBV mutations along genome length amongst genotypes is limited. Our study showed that the frequency of these mutants varied with genotypes (Table 5). This observation was also reported by Bayliss et al. [69]. We found that sequences with mutations, except for c.1896G>A, were mainly associated with to genotype C. However, we observed that p.V5L, c.1896G>A and c.1762A>T/c.1764G>A were the three most prevalent variants, contrasting to c.G1613A, c.1762A>T/c.1764G>A and c.1896G>A in the Bayliss study. These differences are likely attributable to sample selection criteria. In the Bayliss study, only HBeAg-positive patients with CHB were included, which may favour the presence of particular mutations. In our study, there was very little information linked to clinical diseases available on the NCBI nucleotide database for the collected genomes. Interestingly, in our study, escape mutations associated with risk of failed immunoprophylaxis [37,38,39,40,41,42] occurred significantly for genotypes A and C (totally 18%), compared to non-A/C genotypes (totally 10.6%) (X2 = 10.588, df = 1, p = 0.001138), given that most currently used HBV vaccines are generated from genotype A or C strains [29,30]. Due to the unavailability of relevant clinical information for these entries in GenBank, vaccine status for these sequences is unknown.
In considering whether an association between genetic hypervariability and mutation rate exists for each HBV genotype, we observed that the PC/C ORF of genotype A demonstrated high nucleotide diversity had a high proportion of c.1762A>T/c.1764G>A (42%) and c.1613G>A (16%) mutations. By contrast, although the polymerase and pre-S/S genes of genotypes B and C demonstrated the highest diversity compared with other ORFs, the percentage of mutations within these regions, including drug resistance and escape mutations, were much lower than all other mutations within the PC/C region combined (Table 5). This observation may be explained by a finding of Osiowy et al. [10], that the RT region has a higher ratio of synonymous to non-synonymous substitutions (dS/dN) than the core coding region. In other words, nucleotide subtitutions in the RT ORF are less likely to alter amino acid sequence, and therefore are less likely to be associated with clinically significant mutations than variation in the core region. This feature may allow HBV to play critical roles of the overlapping pre-S/S and RT regions on maintaining HBV transcription and HBsAg functionality against host immune system [3].
We recognise that our study has limitations. Our sequence selection was regionally focused, and HBV strains circulating in countries outside our studied area could account for genetic diversity of genotypes and subgenotypes which are absent or less frequent in Southeast Asia, Australia and New Zealand. There is a need for a global study of HBV genetic variability, to better understand contemporary distribution pattern—particularly considering the rise in human migration and ease of global travel.
In conclusion, this investigation is an up-to-date, thorough report on the distribution of HBV types circulating in Southeast Asia, Australia and New Zealand. We highlighted HBV nucleotide variability, and the prevalence of different mutations within genotypes and ORFs. Given that several HBV variants are associated with failure of immunoprophylaxis, drug resistance and liver disease progression, the understanding of variability and mutations within genotypes highlighted in our study may guide clinicians in predicting the risks associated with failed protection and unsuccessful treatment.

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/1999-4915/12/4/427/s1, Figure S1: FastTree maximum likelihood tree built from sequences of ambiguous genotypes and potential recombinants identified by RDP4, Figure S2: RAxML maximum likelihood tree built from sequences of ambiguous genotypes and potential recombinants identified by RDP4, Figure S3: FastTree maximum likelihood tree built from representative population, Figure S4: RAxML maximum likelihood tree built from representative population, Figure S5: PHYML maximum likelihood tree built from representative population, Table S1: Reference sequences used in this study, Table S2: Features of finally identified potential recombinants, File S1: Summarised results of 930 human HBV genomes from Southeast Asia, Australia and New Zealand.

Author Contributions

Conceptualization, N.M.H.P., E.R., H.F.; methodology, N.M.H.P. and E.R.; investigation, N.M.H.P. and E.R.; data curation, N.M.H.P. and E.R.; writing—original draft preparation, N.M.H.P.; writing—review and editing, all authors; visualization, N.M.H.P., E.R.; supervision, E.R., H.F., R.F. and K.S.; project administration, H.F., R.F. and E.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding. Australian governments fund Australian Red Cross Lifeblood for the provision of blood, blood products and services to the Australian community.

Acknowledgments

Computational and data visualization resources and services used in this work were provided by HPC and Research Support Group, Queensland University of Technology, Brisbane, Australia as well as by ICT Services Desk, Australian Red Cross Blood Service, Brisbane, Australia. The authors gratefully acknowledge Elvina Viennet for her advice on using R mapping.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. World Health Organization. Global Hepatitis Report 2017; World Health Organization: Geneva, Switzerland, 2017. [Google Scholar]
  2. McNaughton, A.L.; D’arienzo, V.; Ansari, M.A.; Lumley, S.F.; Littlejohn, M.; Revill, P.; McKeating, J.A.; Matthews, P.C. Insights from deep sequencing of the hbv genome—Unique, tiny, and misunderstood. Gastroenterology 2019, 156, 384–399. [Google Scholar] [CrossRef] [Green Version]
  3. Cento, V.; Mirabelli, C.; Dimonte, S.; Salpini, R.; Han, Y.; Trimoulet, P.; Bertoli, A.; Micheli, V.; Gubertini, G.; Cappiello, G.; et al. Overlapping structure of hepatitis b virus (hbv) genome and immune selection pressure are critical forces modulating hbv evolution. J. Gen. Virol. 2013, 94, 143. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Schödel, F.; Peterson, D.; Zheng, J.; Jones, J.E.; Hughes, J.L.; Milich, D.R. Structure of hepatitis b virus core and e-antigen. A single precore amino acid prevents nucleocapsid assembly. J. Biol. Chem. 1993, 268, 1332. [Google Scholar] [PubMed]
  5. Kim, H.; Lee, S.A.; Do, S.Y.; Kim, B.J. Precore/core region mutations of hepatitis b virus related to clinical severity. World J. Gastroenterol. 2016, 22, 4287–4296. [Google Scholar] [CrossRef] [PubMed]
  6. Li, M.S.; LauTerrence, C.K.; ChanSophie, K.P.; Wong, C.H.; NgPatrick, K.S.; SungJoseph, J.Y.; ChanHenry, L.Y.; TsuiStephen, K.W. The g1613a mutation in the hbv genome affects hbeag expression and viral replication through altered core promoter activity. PLoS ONE 2011, 6, e21856. [Google Scholar] [CrossRef] [PubMed]
  7. Zhang, Z.; Wu, C.; Chen, X.; Li, X.; Li, J.; Lu, M. Genetic variation of hepatitis b virus and its significance for pathogenesis. World J. Gastroenterol. 2016, 22, 126–144. [Google Scholar] [CrossRef]
  8. Wang, W.; Shi, Y.E.; Bai, G.; Tang, Y.; Yuan, Y.; Zhang, T.; Li, C. Hbxag suppresses apoptosis of human placental trophoblastic cell lines via activation of the pi3k/akt pathway. Cell Biol. Int. 2016, 40, 708. [Google Scholar] [CrossRef]
  9. Cha, M.-Y.; Kim, C.-M.; Park, Y.-M.; Ryu, W.-S. Hepatitis b virus x protein is essential for the activation of wnt/beta-catenin signaling in hepatoma cells. Hepatol. (Baltim. Md.) 2004, 39, 1683. [Google Scholar] [CrossRef]
  10. Osiowy, C.; Giles, E.; Tanaka, Y.; Mizokami, M.; Minuk, G.Y. Molecular evolution of hepatitis b virus over 25 years. J. Virol. 2006, 80, 10307. [Google Scholar] [CrossRef] [Green Version]
  11. Park, S.G.; Kim, Y.; Park, E.; Ryu, H.M.; Jung, G. Fidelity of hepatitis b virus polymerase. Eur. J. Biochem. 2003, 270, 2929–2936. [Google Scholar] [CrossRef] [Green Version]
  12. Nowak, M.A.; Bonhoeffer, S.; Hill, A.M.; Boehme, R.; Thomas, H.C.; Mcdade, H. Viral dynamics in hepatitis b virus infection. Proc. Natl. Acad. Sci. USA 1996, 93, 4398. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Norder, H.; Couroucé, A.-M.; Coursaget, P.; Echevarria, J.M.; Lee, S.-D.; Mushahwar, I.K.; Robertson, B.H.; Locarnini, S.; Magnius, L.O. Genetic diversity of hepatitis b virus strains derived worldwide: Genotypes, subgenotypes, and hbsag subtypes. Intervirology 2004, 47, 289–309. [Google Scholar] [CrossRef] [PubMed]
  14. Okamoto, H.; Tsuda, F.; Sakugawa, H.; Sastrosoewignjo, R.I.; Imai, M.; Miyakawa, Y.; Mayumi, M. Typing hepatitis b virus by homology in nucleotide sequence: Comparison of surface antigen subtypes. J. Gen. Virol. 1988, 69, 2575–2583. [Google Scholar] [CrossRef] [PubMed]
  15. Velkov, S.; Ott, J.; Protzer, U.; Michler, T. The global hepatitis b virus genotype distribution approximated from available genotyping data. Genes 2018, 9, 495. [Google Scholar] [CrossRef] [Green Version]
  16. Tatematsu, K.; Tanaka, Y.; Kurbanov, F.; Sugauchi, F.; Mano, S.; Maeshiro, T.; Nakayoshi, T.; Wakuta, M.; Miyakawa, Y.; Mizokami, M. A genetic variant of hepatitis b virus divergent from known human and ape genotypes isolated from a japanese patient and provisionally assigned to new genotype j. J. Virol. 2009, 83, 10538. [Google Scholar] [CrossRef] [Green Version]
  17. Pérez-Losada, M.; Arenas, M.; Galán, J.C.; Palero, F.; González-Candelas, F. Recombination in viruses: Mechanisms, methods of study, and evolutionary consequences. Infect. Genet. Evol. 2015, 30, 296–307. [Google Scholar] [CrossRef] [Green Version]
  18. Shi, W.; Carr, M.J.; Dunford, L.; Zhu, C.; Hall, W.W.; Higgins, D.G. Identification of novel inter-genotypic recombinants of human hepatitis b viruses by large-scale phylogenetic analysis. Virology 2012, 427, 51–59. [Google Scholar] [CrossRef] [Green Version]
  19. Liao, H.; Li, X.; Liu, Y.; Xu, Z.; Huang, P.; Nian, X.; Liu, X.; Xu, D. Intergenotype recombinant analysis of full-length hepatitis b virus genomes from 516 chinese patients with different illness categories. J. Med. Virol. 2017, 89, 139–145. [Google Scholar] [CrossRef]
  20. Boyce, C.; Ganova-Raeva, L.; Archampong, T.; Lartey, M.; Sagoe, K.; Obo-Akwa, A.; Kenu, E.; Kwara, A.; Blackard, J. Identification and comparative analysis of hepatitis b virus genotype d/e recombinants in africa. Virus Genes 2017, 53, 538–547. [Google Scholar] [CrossRef] [Green Version]
  21. Norder, H.; Courouce, A.M.; Magnius, L.O. Molecular basis of hepatitis b virus serotype variations within the four major subtypes. J. Gen. Virol. 1992, 73, 3141–3145. [Google Scholar] [CrossRef]
  22. Purdy, M.A.; Talekar, G.; Swenson, P.; Araujo, A.; Fields, H. A new algorithm for deduction of hepatitis b surface antigen subtype determinants from the amino acid sequence. Intervirology 2007, 50, 45–51. [Google Scholar] [CrossRef] [PubMed]
  23. Norder, H.; Hammas, B.; Lofdahl, S.; Courouce, A.M.; Magnius, L.O. Comparison of the amino acid sequences of nine different serotypes of hepatitis b surface antigen and genomic classification of the corresponding hepatitis b virus strains. J. Gen. Virol. 1992, 73, 1201–1208. [Google Scholar] [CrossRef]
  24. Lin, C.-L.; Kao, J.-H. Hepatitis b virus genotypes and variants. Cold Spring Harb. Perspect. Med. 2015, 5, a021436. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Ito, K.; Yotsuyanagi, H.; Yatsuhashi, H.; Karino, Y.; Takikawa, Y.; Saito, T.; Arase, Y.; Imazeki, F.; Kurosaki, M.; Umemura, T.; et al. Risk factors for long-term persistence of serum hepatitis b surface antigen following acute hepatitis b virus infection in japanese adults. Hepatology 2014, 59, 89–97. [Google Scholar] [CrossRef] [PubMed]
  26. Mayerat, C.; Mantegani, A.; Frei, P.C. Does hepatitis b virus (hbv) genotype influence the clinical outcome of hbv infection? J. Viral Hepat. 1999, 6, 299–304. [Google Scholar] [CrossRef] [PubMed]
  27. Yu, M.-W.; Yeh, S.-H.; Chen, P.-J.; Liaw, Y.-F.; Lin, C.-L.; Liu, C.-J.; Shih, W.-L.; Kao, J.-H.; Chen, D.-S.; Chen, C.-J. Hepatitis b virus genotype and DNA level and hepatocellular carcinoma: A prospective study in men. J. Natl. Cancer Inst. 2005, 97, 265–272. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Zhang, Y.; Wu, Y.; Ye, S.; Wang, T.; Zhao, R.; Chen, F.; Abe, K.; Jin, X. The response to interferon is influenced by hepatitis b virus genotype in vitro and in vivo. Virus Res. 2013, 171, 65–70. [Google Scholar] [CrossRef]
  29. Cassidy, A.; Mossman, S.; Olivieri, A.; Ridder, M.D.; Leroux-Roels, G. Hepatitis b vaccine effectiveness in the face of global hbv genotype diversity. Expert Rev. Vaccines 2011, 10, 1709–1715. [Google Scholar] [CrossRef] [Green Version]
  30. Hamada-Tsutsumi, S.; Iio, E.; Watanabe, T.; Murakami, S.; Isogawa, M.; Iijima, S.; Inoue, T.; Matsunami, K.; Tajiri, K.; Ozawa, T.; et al. Validation of cross-genotype neutralization by hepatitis b virus-specific monoclonal antibodies by in vitro and in vivo infection. PLoS ONE 2015, 10, e0118062. [Google Scholar] [CrossRef] [Green Version]
  31. Shokrgozar, M.A.; Shokri, F. Subtype specificity of anti-hbs antibodies produced by human b-cell lines isolated from normal individuals vaccinated with recombinant hepatitis b vaccine. Vaccine 2002, 20, 2215–2220. [Google Scholar] [CrossRef]
  32. Stramer, S.L.; Wend, U.; Candotti, D.; Foster, G.A.; Hollinger, F.B.; Dodd, R.Y.; Allain, J.-P.; Gerlich, W. Nucleic acid testing to detect hbv infection in blood donors. N. Engl. J. Med. 2011, 364, 236–247. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Griffiths, E.; Reeve, C.; Marley, J.V. Hepatitis b notifications in a vaccinated cohort of aboriginal people in the kimberley region. Med. J. Aust. 2014, 201, 343–346. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Seed, C.R.; Jones, N.T.; Pickworth, A.M.; Graham, W.R. Two cases of asymptomatic hbv “vaccine breakthrough” infection detected in blood donors screened for hbv DNA. Med. J. Aust. 2012, 196, 651–652. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Purdy, M. Hepatitis b virus s gene escape mutants. Asian J. Transfus. Sci. 2007, 1, 62–70. [Google Scholar] [CrossRef]
  36. Xiao, Y.; Sun, K.; Duan, Z.; Liu, Z.; Li, Y.; Yan, L.; Song, Y.; Zou, H.; Zhuang, H.; Wang, J.; et al. Quasispecies characteristic in “a” determinant region is a potential predictor for the risk of immunoprophylaxis failure of mother-to-child-transmission of sub-genotype c2 hepatitis b virus: A prospective nested case–control study. Gut 2020, 69, 933–941. [Google Scholar] [CrossRef]
  37. Huang, X.; Lu, D.; Ji, G.; Sun, Y.; Ma, L.; Chen, Z.; Zhang, L.; Huang, J.; Yu, L. Hepatitis b virus (hbv) vaccine-induced escape mutants of hbv s gene among children from qidong area, china. Virus Res. 2004, 99, 63–68. [Google Scholar] [CrossRef]
  38. Oon, C.J.; Chen, W. Current aspects of hepatitis b surface antigen mutants in singapore. J. Viral Hepat. 1998, 5, 17–23. [Google Scholar] [CrossRef]
  39. Kato, H.; Sugiyama, M.; Mizokami, M. Hepatitis b virus genotypes. In Hepatitis b Virus in Human Diseases, 1st ed.; Liaw, Y.-F., Zoulim, F., Eds.; Springer International Publishing: Cham, Switzerland, 2016. [Google Scholar]
  40. Ngui, S.L.; O’Connell, S.; Eglin, R.P.; Heptonstall, J.; Teo, C.G. Low detection rate and maternal provenance of hepatitis b virus s gene mutants in cases of failed postnatal immunoprophylaxis in england and wales. J. Infect. Dis. 1997, 176, 1360–1365. [Google Scholar] [CrossRef] [Green Version]
  41. Haruki, K.; Ayano, I.; Shuichiro, U.; Tomoyuki, T.; Tsuyoshi, S.; Yasuhiro, K.; Tomoo, F. Evaluation of the g145r mutant of the hepatitis b virus as a minor strain in mother-to-child transmission. PLoS ONE 2016, 11, e0165674. [Google Scholar]
  42. Mokaya, J.; McNaughton, A.L.; Hadley, M.; Beloukas, A.; Geretti, A.; Goedhals, D.; Matthews, P.C. A systematic review of hepatitis b virus (hbv) drug and vaccine escape mutations in africa: A call for urgent action. PLoS Negl. Trop. Dis. 2018, 12, e0006629. [Google Scholar] [CrossRef] [Green Version]
  43. Shaw, T.; Bartholomeusz, A.; Locarnini, S. Hbv drug resistance: Mechanisms, detection and interpretation. J. Hepatol. 2006, 44, 593–606. [Google Scholar] [CrossRef] [PubMed]
  44. Caligiuri, P.; Cerruti, R.; Icardi, G.; Bruzzone, B. Overview of hepatitis b virus mutations and their implications in the management of infection. World J. Gastroenterol. 2016, 22, 145. [Google Scholar] [CrossRef] [PubMed]
  45. Colombatto, P.; Barbera, C.; Bortolotti, F.; Maina, A.M.; Moriconi, F.; Cavallone, D.; Calvo, P.; Oliveri, F.; Bonino, F.; Brunetto, M.R. Hbv pre-core mutant in genotype-d infected children is selected during hbeag/anti-hbe seroconversion and leads to hbeag negative chronic hepatitis b in adulthood. J. Med. Virol. 2018, 90, 1232–1239. [Google Scholar] [CrossRef] [PubMed]
  46. Funk, M.L.; Rosenberg, D.M.; Lok, A.S.F. World-wide epidemiology of hbeag-negative chronic hepatitis b and associated precore and core promoter variants. J. Viral Hepat. 2002, 9, 52–61. [Google Scholar] [CrossRef] [Green Version]
  47. Xie, Y.; Liu, S.; Zhao, Y.; Zhang, L.; Liu, B.; Guo, Z. Precore/core region mutations in hepatitis b virus DNA predict postoperative survival in hepatocellular carcinoma. PLoS ONE 2015, 10, e0133393. [Google Scholar] [CrossRef]
  48. Wu, S.; Chang, Y.; Chu, T.; Shih, C. Persistence of hepatitis b virus DNA and the tempos between virion secretion and genome maturation in a mouse model. J. Virol. 2019, 93. [Google Scholar] [CrossRef]
  49. Nie, H.; Evans, A.A.; London, W.T.; Block, T.M.; Ren, X.D. Quantitative dynamics of hepatitis b basal core promoter and precore mutants before and after hbeag seroconversion. J. Hepatol. 2012, 56, 795–802. [Google Scholar] [CrossRef] [Green Version]
  50. Yang, Z.; Zhuang, L.; Lu, Y.; Xu, Q.N.; Tang, B.; Chen, X. Naturally occurring basal core promoter a1762t/g1764a dual mutations increase the risk of hbv-related hepatocellular carcinoma: A meta-analysis. Oncotarget 2016, 7, 12525–12536. [Google Scholar] [CrossRef] [Green Version]
  51. Kim, H.J.; Park, J.H.; Jee, Y.; Lee, S.A.; Kim, H.; Song, B.C.; Yang, S.; Lee, M.; Yoon, J.H.; Kim, Y.J.; et al. Hepatitis b virus x mutations occurring naturally associated with clinical severity of liver disease among korean patients with chronic genotype c infection. J. Med. Virol. 2008, 80, 1337–1343. [Google Scholar] [CrossRef]
  52. Rozanov, M.; Plikat, U.; Chappey, C.; Kochergin, A.; Tatusova, T. A web-based genotyping resource for viral sequences. Nucleic Acids Res. 2004, 32, W654–W659. [Google Scholar] [CrossRef] [Green Version]
  53. Hayer, J.; Jadeau, F.; Deléage, G.; Kay, A.; Zoulim, F.; Combet, C. Hbvdb: A knowledge database for hepatitis b virus. Nucleic Acids Res. 2013, 41, D566. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Price, M.N.; Dehal, P.S.; Arkin, A.P. Fasttree: Computing large minimum evolution trees with profiles instead of a distance matrix. Mol. Biol. Evol. 2009, 26, 1641–1650. [Google Scholar] [CrossRef] [PubMed]
  55. Price, M.N.; Dehal, P.S.; Arkin, A.P. Fasttree 2—Approximately maximum-likelihood trees for large alignments (fasttree 2). PLoS ONE 2010, 5, e9490. [Google Scholar] [CrossRef]
  56. Stamatakis, A. Raxml-vi-hpc: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 2006, 22, 2688–2690. [Google Scholar] [CrossRef] [PubMed]
  57. Katoh, K.; Standley, D.M. Mafft multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef] [Green Version]
  58. Katoh, K.; Misawa, K.; Kuma, K.-I.; Miyata, T. Mafft: A novel method for rapid multiple sequence alignment based on fast fourier transform. Nucleic Acids Res. 2002, 30, 3059. [Google Scholar] [CrossRef] [Green Version]
  59. Martin, D.P.; Murrell, B.; Golden, M.; Khoosal, A.; Muhire, B. Rdp4: Detection and analysis of recombination patterns in virus genomes. Virus Evol. 2015, 1, vev003. [Google Scholar] [CrossRef] [Green Version]
  60. Martin, D.; Rybicki, E. Rdp: Detection of recombination amongst aligned sequences. Bioinformatics 2000, 16, 562. [Google Scholar] [CrossRef]
  61. Padidam, M.; Sawyer, S.; Fauquet, C.M. Possible emergence of new geminiviruses by frequent recombination. Virology 1999, 265, 218–225. [Google Scholar] [CrossRef] [Green Version]
  62. Smith, J. Analyzing the mosaic structure of genes. J. Mol. Evol. 1992, 34, 126–129. [Google Scholar] [CrossRef]
  63. David, P.; Keith, A.C. Evaluation of methods for detecting recombination from DNA sequences: Computer simulations. Proc. Natl. Acad. Sci. USA 2001, 98, 13757. [Google Scholar]
  64. Lam, H.M.; Ratmann, O.; Boni, M.F. Improved algorithmic complexity for the 3seq recombination detection algorithm. Mol. Biol. Evol. 2018, 35, 247–251. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  65. Guindon, S.; Dufayard, J.-F.; Lefort, V.; Anisimova, M.; Hordijk, W.; Gascuel, O. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of phyml 3.0. Syst. Biol. 2010, 59, 307–321. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Bell, T.G.; Kramvis, A. Mutation reporter tool: An online tool to interrogate loci of interest, with its utility demonstrated using hepatitis b virus. Virol. J. 2013, 10, 62. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Bell, T.G.; Kramvis, A. Bioinformatics tools for small genomes, such as hepatitis b virus. Viruses 2015, 7, 781–797. [Google Scholar] [CrossRef] [Green Version]
  68. Rozas, J.; Ferrer-Mata, A.; Sánchez-DelBarrio, J.C.; Guirao-Rico, S.; Librado, P.; Ramos-Onsins, S.E.; Sánchez-Gracia, A. Dnasp 6: DNA sequence ppolymorphism analysis of large data sets. Mol. Biol. Evol. 2017, 34, 3299–3302. [Google Scholar] [CrossRef]
  69. Bayliss, J.; Yuen, L.; Rosenberg, G.; Wong, D.; Littlejohn, M.; Jackson, K.; Gaggar, A.; Kitrinos, K.M.; Subramanian, G.M.; Marcellin, P.; et al. Deep sequencing shows that hbv basal core promoter and precore variants reduce the likelihood of hbsag loss following tenofovir disoproxil fumarate therapy in hbeag-positive chronic hepatitis b. Gut 2017, 66, 2013. [Google Scholar] [CrossRef]
  70. Neumann-Fraune, M.; Beggel, B.; Kaiser, R.; Obermeier, M. Hepatitis B virus drug resistance tools: One sequence, two predictions. Intervirology 2014, 57, 232–236. [Google Scholar] [CrossRef]
  71. Chen, X.; Gao, J.; Ji, Z.; Zhang, W.; Zhang, L.; Xu, R.; Zhang, J.; Li, F.; Li, S.; Hu, S.; et al. A description of the hepatitis b virus genomic background in a high-prevalence area in China. Virol. J. 2014, 11, 101. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Flowchart of genotype assignment for each sequence retrieved from the NCBI nucleotide database.
Figure 1. Flowchart of genotype assignment for each sequence retrieved from the NCBI nucleotide database.
Viruses 12 00427 g001
Figure 2. Selected mutations across RT, HBsAg, NRE, BCP, PC, C and X ORFs for analysis. Nucleotide position 1 on an HBV genome corresponds to the HBV EcoR1 restriction site.
Figure 2. Selected mutations across RT, HBsAg, NRE, BCP, PC, C and X ORFs for analysis. Nucleotide position 1 on an HBV genome corresponds to the HBV EcoR1 restriction site.
Viruses 12 00427 g002
Figure 3. Number of HBV complete genomes retrieved through each step of data collection.
Figure 3. Number of HBV complete genomes retrieved through each step of data collection.
Viruses 12 00427 g003
Figure 4. Distribution of HBV genotypes and recombinants by country in Southeast Asia, Australia and New Zealand. Pie charts indicate the percentage of HBV genotypes with the diameters. Countries with human HBV complete genomes collectable from NCBI nucleotide database were highlighted in bold and uppercase.
Figure 4. Distribution of HBV genotypes and recombinants by country in Southeast Asia, Australia and New Zealand. Pie charts indicate the percentage of HBV genotypes with the diameters. Countries with human HBV complete genomes collectable from NCBI nucleotide database were highlighted in bold and uppercase.
Viruses 12 00427 g004
Figure 5. Nucleotide diversity (Pi) over HBV genome. The MAFF-aligned genomes had 3434 bp in length; genotypes were colour-coded (sites with gaps in the complete data file were excluded; window length 100 bp, step size 25 bp).
Figure 5. Nucleotide diversity (Pi) over HBV genome. The MAFF-aligned genomes had 3434 bp in length; genotypes were colour-coded (sites with gaps in the complete data file were excluded; window length 100 bp, step size 25 bp).
Viruses 12 00427 g005
Table 1. Identified HBV genotypes by country.
Table 1. Identified HBV genotypes by country.
GenotypeABCDGIB/BC or C/CBG/GCTotal
Country
Australia1 66 67 (7.2%)
Cambodia 424 28 (3%)
Indonesia 82519 142 (15.3%)
Laos 14 14 (1.5%)
Malaysia21701164 4 296 (31.8%)
Myanmar 18 18 (1.9%)
New Zealand 636 42 (4.5%)
Philippines9710 26 (2.8%)
Thailand 1696 1112117 (12.6%)
Vientam 13143 6 180 (19.4%)
Total12 (1.3%)410 (44.1%)430 (46.2%)49 (5.3%)1 (0.1%)21 (2.3%)5 (0.5%)2 (0.2%)930 (100%)
Table 2. HBV recombination detection by five automatic methods built in RDP4.
Table 2. HBV recombination detection by five automatic methods built in RDP4.
Detection MethodRDP3SEQChimaeraGeneconvMaxchi
Events/Recombinants
Recombination events8380498741
Potential recombinants (possibly present in different events)46761374411991355
Potential recombinants (after removing replicate recombinants)380566510857548
Recombinants identified by all methods: 83 (65 sequences from our data and 18 reference sequences)
+ with overlapping breakpoint positions: 35 (31 sequences from our data and 4 reference sequences)
+ with at least one same major parent: 10 (8 sequences from our data and 2 reference sequences)
Table 3. Predicted HBV serotypes by genotype and country in Southeast Asia, Australia and New Zealand.
Table 3. Predicted HBV serotypes by genotype and country in Southeast Asia, Australia and New Zealand.
Serotype adr
(n = 335, 36%)
adw2
(n = 270, 29%)
adw3
(n = 7, 0.8%)
ayr
(n = 6, 0.7%)
ayw1
(n = 185, 19.9%)
ayw2
(n = 45, 4.8%)
ayw3
(n = 70, 7.5%)
Unassigned (n = 12, 1.3%)
Genotype/
Country
A 9 2 1
Australia 1
Malaysia 1 1
Philippines 7 2
B 2185 172168
Cambodia 2 2
Indonesia 461 34 1
Malaysia 1414 201 4
Philippines 1 6
Thailand 12 4
Vietnam 16 106 63
C3312916 603
Australia6 60
Cambodia231
Indonesia453 3
Malaysia10751 3
Myanmar171
New Zealand6
Philippines19
Thailand867 3
Vietnam403
D 1444
Indonesia 18
Malaysia 4
New Zealand 36
G 1
Thailand 1
I 11 10
Laos 4 10
Thailand 1
Vietnam 6
B/BC111
Thailand 1
Malaysia11
C/CB11
Malaysia11
G/GC2
Thailand2
Table 4. DNA polymorphism of HBV genotypes identified in Southeast Asia, Australia and New Zealand.
Table 4. DNA polymorphism of HBV genotypes identified in Southeast Asia, Australia and New Zealand.
GenotypeNumber of Polymorphic (Segregating) SitesAverage Number of Nucleotide Differences (k)Nucleotide Diversity (Pi)
Genotype A (n = 12)4410.7880.00775
Genotype B (n = 410)45411.5150.00827
Genotype C (n = 430)44612.5480.00901
Genotype D (n = 49)687.7700.00558
Genotype I (n = 21)378.1480.00585
Total N = 930 (including genotype G and recombinants)67620.9440.01505
Table 5. Number and percentage of selected mutations by HBV genotype 1.
Table 5. Number and percentage of selected mutations by HBV genotype 1.
Mutation Type (Region)A (n = 12)B (n = 410)C (n = 430)D (n = 49)G (n = 1)I (n = 21)B/BC, C/CB, G/GC (n = 7)Total (n = 930)
Drug resistance mutations (polymerase)1 (8%)15 (3.7%)22 (5%)2 (4%)00040 (4.3%)
Escape mutations (pre-S/S)2 (16%)41 (10%)79 (18%)8 (16%)02 (10%)2 (27%)134 (14.4%)
c.1613G>A (NRE)2 (16%)44 (11%)67 (15%)01 (100%)4 (19%)1 (14%)119 (12.8%)
Only c.1762A>T (BCP)07 (2%)3 (0.7%)000010 (1.1%)
Only c.1764G>A (BCP)1 (8%)3 (0.7%)20 (5%)1 (2%)00025 (2.7%)
Double c.1762A>T/ c.1764G>A (BCP)5 (42%)60 (15%)159 (37%)7 (14%)1 (100%)5 (24%)2 (27%)239 (25.7%)
c.1896G>A (PC)0193 (47%)77 (18%)13 (27%)04 (19%)2 (27%)289 (31.1%)
p.P130T (C)028 (6.8%)60 (14%)01 (100%)0089 (9.6%)
p.V5M (X)08 (2%)11 (2.6%)0001 (14%)20 (2.2%)
p.V5L (X)10 (83%)401 (97%)395 (92%)48 (97%)1 (100%)21 (100%)6 (86%)882 (95%)
1 Note: One genome can acquire one or more variations.

Share and Cite

MDPI and ACS Style

Phan, N.M.H.; Faddy, H.; Flower, R.; Spann, K.; Roulis, E. In Silico Analysis of Genetic Diversity of Human Hepatitis B Virus in Southeast Asia, Australia and New Zealand. Viruses 2020, 12, 427. https://0-doi-org.brum.beds.ac.uk/10.3390/v12040427

AMA Style

Phan NMH, Faddy H, Flower R, Spann K, Roulis E. In Silico Analysis of Genetic Diversity of Human Hepatitis B Virus in Southeast Asia, Australia and New Zealand. Viruses. 2020; 12(4):427. https://0-doi-org.brum.beds.ac.uk/10.3390/v12040427

Chicago/Turabian Style

Phan, Ngoc Minh Hien, Helen Faddy, Robert Flower, Kirsten Spann, and Eileen Roulis. 2020. "In Silico Analysis of Genetic Diversity of Human Hepatitis B Virus in Southeast Asia, Australia and New Zealand" Viruses 12, no. 4: 427. https://0-doi-org.brum.beds.ac.uk/10.3390/v12040427

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop