Next Article in Journal
Molecular Characteristic, Protein Distribution and Potential Regulation of HSP90AA1 in the Anadromous Fish Coilia nasus
Previous Article in Journal
Genetics of Congenital Heart Defects: The NKX2-5 Gene, a Key Player
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Complete Genome of Brucella Suis 019 Provides Insights on Cross-Species Infection

1
College of Medicine, Shihezi University, Xinjiang 832000, China
2
Co-Innovation Center for Zoonotic Infectious Diseases in the western region, Shihezi University, Xinjiang 832000, China
3
College of Animal Science and Technology, Shihezi University, Xinjiang 832000, China
4
College of Life Sciences, Nankai University, Tianjin 300071, China
5
College of Life Sciences, Shihezi University, Xinjiang 832000, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Submission received: 29 October 2015 / Revised: 13 January 2016 / Accepted: 19 January 2016 / Published: 26 January 2016

Abstract

:
Brucella species are the most important zoonotic pathogens worldwide and cause considerable harm to humans and animals. In this study, we presented the complete genome of B. suis 019 isolated from sheep (ovine) with epididymitis. B. suis 019 has a rough phenotype and can infect sheep, rhesus monkeys and possibly humans. The comparative genome analysis demonstrated that B. suis 019 is closest to the vaccine strain B. suis bv. 1 str. S2. Further analysis associated the rsh gene to the pathogenicity of B. suis 019, and the WbkA gene to the rough phenotype of B. suis 019. The 019 complete genome data was deposited in the GenBank database with ID PRJNA308608.

1. Introduction

Brucella is a genus of Gram-negative bacteria. They are small (0.5 to 0.7 by 0.6 to 1.5 µm), non-encapsulated, flagellated, facultatively intracellular coccobacilli [1]. Brucella causes the brucellosis in wild and domestic animals, even when transmitted from human to human. The brucellosis can have a considerable impact on human and animal health, as well as on economics, especially in developing countries where rural income relies largely on livestock breeding and dairy products [2]. The genus Brucella is generally classified into 10 species, which are Brucella abortus, Brucella melitensis, Brucella suis, Brucella ovis, Brucella canis, Brucella neotomae, Brucella pinnipedialis, Brucella ceti, Brucella microti, and Brucella inopinata, based on host preference and phenotypic characteristics [3,4]. Among these species, B. melitensis, B. suis and B. canis distribute more widely and virulently.
The strain named B. suis 019 infected sheep (ovine), rhesus monkeys and possibly humans. The 019 strain was first discovered in the 1980’s when the sheep epididymitis, usually caused by the B. ovis, broke out widely in the province of Xinjiang, China. After that, the 019 strain was isolated from the semen of sick sheep (ovine) and initially identified as a B. ovis strain by the serological and bacteriological tests [5]. Then, this identification was confirmed by the biochemical tests [6]. Later, the significant differences between the 019 strain and the other B. ovis strains were found through a series of experiments. The animal experiments proved the 019 strain infected rhesus monkeys and caused damage to many organs [7]. The molecular biological experiments showed some featured genes of the 019 strain were quite different from those of B. ovis [8]. In 2010, Wang et al. revealed that there were significant differences between the 019 strain and the 63/290 reference strain on both DNA and amino acid levels and concluded that the 019 strain was a unique local strain to Xinjiang [9]. However, the taxonomic status and infection mechanism of the 019 strain were still confusing.
In 2013, we assembled the draft genome of B. suis 019 using 90 bp Next-generation sequencing (NGS) technology and performed the comparative genomic analysis to reveal that the 019 strain belongs to B. suis and is far from B. ovis or B. melitensis. Although the B. suis 019 draft genome made effective progress, the draft genome missed some important information, e.g., genomic structure variation or rearrangement. Since pathogenic bacteria often exhibit a high degree of genomic rearrangement [10], we assembled the complete genome of B. suis 019 using the 250 bp NGS technology with Sanger sequencing confirmation. We also compared the B. suis 019 complete genome with the other 15 Brucella complete genomes to reach two research goals: 1) to confirm the taxonomic status of B. suis 019 strain based on the complete genome analysis; 2) to associate B. suis 019 strain’s rough phenotype and pathogenicity to some sequence features on the genome level.

2. Results and Discussion

2.1. Complete Genome Sequencing, Assembly and Annotation

The raw NGS data contained 2 × 688,568 paired reads with the length of 251 bp. After removing low quality regions, adapters and viral sequences, a total of 1,368,448 cleaned reads were produced for genome assembly. Using the cleaned reads, 14 and 6 scaffolds were assembled for chromosome 1 and 2. Then, we used the PCR plus Sanger sequencing to fill the gaps (Methods), producing the B. suis 019 complete genome (80× depth) containing two chromosomes with the length 2,098,391 bp and 1,204,433 bp, respectively (Supplementary file 1). The assembled B. suis 019 complete genome has a total sequence length of 3,302,824 bp, which is 3717 bp longer than the total length of the draft genome. This complete genome has the GC content 57.27%, which is very close to the GC content 57.28% of the draft genome. We predicted 1972 and 1119 proteins for 019 chromosome 1 and 2 (Supplementary file 2). Compared to the predicted 3529 ORFs using the draft genome, 3091 is closer to the total protein number of the other Brucella complete genomes (Table 1). All of the predicted proteins were annotated by the NCBI NR database and the Gene Ontology terms (Supplementary file 3). These proteins were predicted to involve 125 KEGG metabolism pathways (Supplementary file 4).
Table 1. 18 Brucella complete genomes.
Table 1. 18 Brucella complete genomes.
StrainChr1_IDLengthGen#Chr2_IDLengthGen#CG%
* B. abortus A13334NC_016795.12,123,7732086NC_016777.11,162,259110257.40
B. abortus bv.1 str 9-941NC_006932.12,124,2412082NC_006933.11,162,204110357.22
B. abortus S19NC_010742.12,122,4872089NC_010740.11,161,449110657.22
B. canis ATCC 23365NC_010103.12,105,9692022NC_010104.11,206,800113157.24
B. canis HSKA 52141NC_016778.12,107,0232019NC_016796.11,170,489109857.24
B. melitensis 16MNC_003317.12,117,1442040NC_003318.11,177,787110757.22
B. melitensis ATCC 23457NC_012441.12,125,7012059NC_012442.11,185,518111757.22
B. melitensis biovar Abortus 2308NC_007618.12,121,3592086NC_007624.11,156,948110457.22
B. melitensis M28NC_017244.12,126,1332058NC_017245.11,185,615111857.22
B. melitensis M5-90NC_017246.12,126,4512062NC_017247.11,185,778111857.22
B. melitensis NINC_017248.12,117,7172051NC_017283.11,176,758111257.23
B. microti CCM 4915NC_013119.12,117,0502024NC_013118.11,220,319113557.25
B. ovis ATCC 25840NC_009505.12,111,3702068NC_009504.11,164,220112257.19
B. pinnipedialis B2/94NC_015857.12,138,3422081NC_015858.11,260,926118857.20
B. suis 1330NC_017251.12,107,7832014NC_017250.11,207,380113057.25
* B. suis ATCC 23445NC_010169.11,923,7631848NC_010167.11,400,844132757.21
B. suis VBI22NC_016797.12,108,6372015NC_016775.11,207,451113257.25
B. suis 019CP013963.12,098,3911972CP013964.11,204,433111957.27
* These data were not used in this study. Chr1_len is the length of chromosome 1. Chr2_len is the length of chromosome 2. Gen# is the total gene number on this chromosome. Chr1_ID and Chr2_ID is the RefSeq or Genbank Accession Number.

2.2. Phylogenetic Analysis

Using 2,537 homologous genes from 51 Brucella genomes including the B. suis 019 draft genome (Methods), Phylogenetic Tree 1 was built to show six well-supported clades including B. melitensis, B. abortus, B. ovis, Brucella from marine mammals, B. suis and canis, and others (Figure 1A). These results confirmed that B. suis 019 belongs to the B. suis and is closest to B. suis bv. 1 str. S2 (Figure 1B). The vaccine strain B. suis S2, which had been developed in China in the 1970’s, was effective for oral vaccination of sheep, goats, cattle and pigs [11]. B. suis S2 has been widely used for prevention of animal brucellosis in China over many years.
Using chromosome 1 and 2 sequences from 16 Brucella complete genomes (Table 1), we built Phylogenetic Trees 2 and 3, separately (Figure 1C,D). Phylogenetic Trees 2 and 3 had congruence with Phylogenetic Tree 1 on three points. The first point was that B. suis 019 belongs to B. suis species rather than B. ovis. The second point was that the debated B. melitensis biovar Abotus 2308 belongs to the B. abortus and is not a biovar of B. melitensis. The last point was that the B. ovis is far from other species, which confirmed a previous study. In that study, Foster et al. used 20,154 SNPs from 13 Brucella genomes to show that most Brucella species had diverged from their common B. ovis ancestor in the past 86,000 to 296,000 years, which preceded the domestication of their livestock hosts [12].
Using 16 complete genomes, we found the discrepancy between Phylogenetic Trees 2 and 3. Phylogenetic Tree 2 revealed a well supported topology that placed B. ovis, B. microti, B. canis, B. suis, B. pinnipedialis, B. abortus and B. melitensis in different clades (Figure 1C). Meanwhile Phylogenetic Tree 3 classified B. ovis and B. pinnipedialis into one clade and did not classify B. suis and B. canis into two well separated groups as Phylogenetic Tree 2 did. The latter phenomenon, named the paraphyly of B. suis, was discovered in two previous studies [12]. The results in these study suggested the paraphyly of B. suis could be attributed to chromosome 2 (Phylogenetic Tree 3). To further investigate the relationship between different Brucella species, we conducted a collinear analysis of 16 complete genomes to provide more detailed information on the genomic regions.
Figure 1. The phylogenetic trees. A. Phylogenetic Tree 1 was built using homologous genes from 51 Brucella genomes (including 019 draft genome). B. A magnified view of Brucella suis and canis clades was from Phylogenetic Tree 1. C. Phylogenetic Tree 2 was built using chromosome 1 sequences from 15 Brucella complete genomes (including 019 complete genome). D. Phylogenetic Tree 3 was built using chromosome 2 sequences from 15 Brucella complete genomes (including 019 complete genome).
Figure 1. The phylogenetic trees. A. Phylogenetic Tree 1 was built using homologous genes from 51 Brucella genomes (including 019 draft genome). B. A magnified view of Brucella suis and canis clades was from Phylogenetic Tree 1. C. Phylogenetic Tree 2 was built using chromosome 1 sequences from 15 Brucella complete genomes (including 019 complete genome). D. Phylogenetic Tree 3 was built using chromosome 2 sequences from 15 Brucella complete genomes (including 019 complete genome).
Genes 07 00007 g001

2.3. Comparative Genome Analysis Using 16 Complete Genomes

Chromosome sequences from B. suis 019 and the other 15 Brucella species were aligned using the Mauve software (Methods). Mauve identified nine and seven locally collinear blocks (LCBs) for the 019 chromosome 1 and 2 (Supplementary file 5). LCBs are conserved segments that appear to be internally free from genome rearrangements [13]. Compared to the smaller LCB size in genomes of other genura (e.g., 78 LCBs in Yersinia [10] and 243 LCBs in Shewanella [14]), the large LCB size in Brucella genomes reflected the higher conservation in the Brucella genome structure. If not considering four LCBs with length 269, 300, 149 and 400 bp, five large LCBs with lengths 735,668, 465,416, 198,078, 6148 and 691,535 bp covered 99.63% (2,090,697/2,098,391) of the 019 chromosome 1 (Figure 2). Seven LCBs from the 019 chromosome 2 had the lengths 6490, 46,795, 63,955, 442,686, 3952, 319,146 and 321,153 bp (Figure 3). The total length of these seven LCBs was 1,204,177 bp, covering 99.98% (1,204,177/1,204,433) of the 019 chromosome 2. Overall, the Brucella chromosome 2 had a higher degree of genome rearrangements. A 46,795 bp LCB was almost absent on the B. ovis 25840 genome and inversed on the B. melitensis 23457, M28 and M5-90 genome. A 765,784 bp inversion including three LCBs was observed on the B. abortus S19, 2308 and 9-941 genome. A 3952 bp inversion was observed on the B. ovis 25840 genome. These results supported a previous conclusion that Brucella chromosome 2 is more dynamic, perhaps owing to its hypothesized origin as a plasmid [12].
Since B. suis 019 is closest to B. suis S2 among all the B. suis species (Figure 1A), we conducted a CDS syntenic analysis between the B. suis 019 complete genome and the B. suis S2 draft genome. The results showed a good conservation of synteny and collinearity between B. suis 019 and the B. suis S2 genome (Figure 4). Then, we blasted all the CDS sequences of B. suis S2 to the B. suis 019 complete genome. All of the 3230 CDS sequences of B. suis S2 could be covered by the hits from the B. suis 019 genome, 99.54% (3215/3230) of which have the identity 100% to the query sequences. We also blasted all the CDS sequences of B. suis 019 to the B. suis S2 draft genome. All of the 3091 CDS sequences of B. suis 019 were able to be covered by the hits from the B. suis S2 genome, 99.78% (3084/3091) of which have the identity 100% to the query sequences. The comparison between B. suis 019 and the B. suis S2 showed there were eight genes absent in the B. suis 019 (Table 2) and 19 genes with significant mutation between B. suis S2 and B. suis 019 (Supplementary file 6). Among 19 genes, four genes have a single copy in the B. suis 019 complete genome (Table 2). Particularly, we found that a 21 bp nucleotide deletion in the rsh gene resulted in a seven amino acid deletion QKRASGD. Based on search results from the NCBI website, we reported this as a new mutation of rsh gene.
Figure 2. Nine LCBs on Brucella chromosome 1. Nine LCBs on chromosome 1 were built using 16 Brucella complete genomes. The blocks in the same color are connected by lines. A phylogenetic map of the strains derived from Phylogenetic Tree 2 (topology only) is on the left side.
Figure 2. Nine LCBs on Brucella chromosome 1. Nine LCBs on chromosome 1 were built using 16 Brucella complete genomes. The blocks in the same color are connected by lines. A phylogenetic map of the strains derived from Phylogenetic Tree 2 (topology only) is on the left side.
Genes 07 00007 g002
The product of rsh named GTP pyrophosphokinase rsh (EC: 2.7.6.5) is a mediator of the stringent response that coordinates a variety of cellular activities in response to changes in nutritional abundance. Rsh is required for Brucella to express the type IV secretion system VirB, a major virulence factor of Brucella and therefore plays a role in adaptation to low-nutrient environments. This was evidenced using the Rsh deletion mutants in B. suis and B. melitensis in a previous study [15]. Comparative transcriptional analysis between B. suis 1330 wild-type and Δrsh mutant showed the Rsh-dependent up-regulation of 198 genes and down-regulation of 181 genes, which together account for 11.6% of the genome [16]. The Rsh protein (Uniprot: Q8CY42) with a length of 750 AA (amino acids) has four Pfam domains, HD_4 (residues 26-176), RelA_SpoT (residues 235-346), TGS (residues 392-451) and ACT_4 (residues 669-747). The seven amino acid deletion (residues 37-43) belongs to the HD_4 domain. We used the PredictProtein server (https://www.predictprotein.org) to analyze the rsh properties and predicted that all of the deleted seven amino acids had the coil secondary structures. These seven amino acids were predicted in the disorder regions. Moreover, the Glutamine (Q) on residue 37 was predicted as a protein binding region. Compared to the other three single copy genes (Table 2), the rsh gene with the seven amino acid deletion is more likely to be associated to the acquired pathogenicity of the B. suis 019.
Figure 3. Seven LCBs on Brucella chromosome 2. Seven LCBs on chromosome 2 were built using 16 Brucella complete genomes. The blocks in the same color are connected by lines. A phylogenetic map of the strains derived from Phylogenetic Tree 3 (topology only) is on the left side.
Figure 3. Seven LCBs on Brucella chromosome 2. Seven LCBs on chromosome 2 were built using 16 Brucella complete genomes. The blocks in the same color are connected by lines. A phylogenetic map of the strains derived from Phylogenetic Tree 3 (topology only) is on the left side.
Genes 07 00007 g003
Figure 4. The syntenic map between the B. suis S2 and 019. The syntenic map between the B. suis S2 and B. suis 019 strain was acquired on the CoGe website. The CDS sequences of B. suis S2 chromosome 1 and 2 used CP006961.1 and CP006962.1 from the GenBank database.
Figure 4. The syntenic map between the B. suis S2 and 019. The syntenic map between the B. suis S2 and B. suis 019 strain was acquired on the CoGe website. The CDS sequences of B. suis S2 chromosome 1 and 2 used CP006961.1 and CP006962.1 from the GenBank database.
Genes 07 00007 g004
Table 2. Significant different genes between B. suis 019 and S2.
Table 2. Significant different genes between B. suis 019 and S2.
Gene-IDChrLengthCopyProduct
BSS2_I051217951integrase catalytic subunit
BSS2_I051711119>1mannosyltransferase
BSS2_I051813691transposase
BSS2_I051913421IS5 family transposase orfB
BSS2_I08981165>1hypothetical protein
BSS2_I1794119291hypothetical protein
BSS2_I179516751hypothetical protein
BSS2_II052721965>1cell wall surface protein
chr1_239122321gtp pyrophosphokinase rsh
chr1_277111401cytochrome c-type biogenesis protein
chr1_995147821outer membrane autotransporter barrel domain-containing protein
chr1_1847177713-mercaptopyruvate sulfurtransferase
Gene-ID has two formats. The format “BSS2_xxxx” is the tag name in the GenBank data CP006961.1 (B. suis S2 chromosome 1) and CP006962.1 (B. suis S2 chromosome 2). The format “chrx_xxxx” is the gene ID of B. suis 019 complete genome (Supplementary files 1 and 2). The first eight genes are absent in the B. suis 019 complete genome. The other four genes have significant mutation from B. suis S2 to B. suis 019.

2.4. Beta-Ketoadipate Pathway and Lipopolysaccharide

One important characteristic of Brucella is that it shares two pathways with the soil microorganisms. These two pathways, beta-ketoadipate pathway and homoprotocatechuate pathway are widely distributed among diverse soil microorganisms and play a central role in the processing and degradation of plant-derived aromatic compounds. The B. suis 1330 genome (GenBank: AE014291.4 and AE014292.2) includes these two intact pathways, the genes of which are located on chromosome 2 [17]. The homoprotocatechuate pathway includes eight protein-coding genes (AE014292.2: BRA1155–BRA1162). These genes numbered from chr2_1076 to chr2_1084 (Table 3) were found in B. suis 019 in the same order on chromosome 2 with sequence identity 100%. The beta-ketoadipate pathway includes 12 protein-coding genes (AE014292.2:BRA0636–BRA0647). One previous study showed that at least 1 of the 12 genes carried by every Brucella genome except B. suis 1330 has become a pseudogene and 12 genes are completely missing in B. suis ATCC 23445 [18]. These 12 genes numbered from chr2_443 to chr2_454 (Table 3) were found in B. suis 019 in the same order on chromosome 2 with sequence identity 100%.
Table 3. Featured genes of Brucella.
Table 3. Featured genes of Brucella.
1330-IDChrStartEnd019-IDStartEndProduct
BRA06362617674618876chr2_454490638491840beta-ketoadipyl CoA thiolase
BRA06372618885619574chr2_4534899404906293-oxoadipate CoA-transferase, beta subunit
BRA06382619571620278chr2_4524892364899433-oxoadipate CoA-transferase, alpha subunit
BRA06392620462621232chr2_451488282489052transcriptional regulator PcaR, putative
BRA06402621239622156chr2_450 487358488275pobR protein
BRA06412622269623438chr2_449486076487245p-hydroxybenzoate hydroxylase
BRA06422623848624756chr2_448484758485666transcriptional regulator PcaQ
BRA06432624850625653chr2_447 4838614846643-oxoadipate enol-lactone hydrolase
BRA06442625657626073chr2_4464834414838604-carboxymuconolactone decarboxylase
BRA06452626070626810chr2_445482704483444protocatechuate 3,4-dioxygenase, beta subunit
BRA06462626812627429chr2_444 482085482702protocatechuate 3,4-dioxygenase, alpha subunit
BRA06472627433628497chr2_443 4810174820813-carboxy-cis,cis-muconate cycloisomerase, putative
BRA1155211552551156661chr2_108411572621158668aldehyde dehydrogenase family protein
BRA1156211568181157627chr2_1082115629611571052,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase
BRA1157211576941157804chr2_1081x11561191156229hypothetical protein
BRA1158211578451158648chr2_1081115527511560782-keto-4-pentenoate hydratase
BRA1159211586521159518chr2_108011544051155271fumarylacetoacetate hydrolase family protein
BRA1160211595871160567chr2_107811533561154336catechol 2,3-dioxygenase (pseudo)
BRA1161211606221162136chr2_1077115178711533015-carboxy-2-hydroxymuconate semialdehyde dehydrogenase
BRA1162211622061162598chr2_1076115132511517175-carboxymethyl-2-hydroxymuconate delta isomerase
BR005816482466455chr1_179 820216821847phosphoglucomutase
BR05101509856511724chr1_359 374961376829epimerase/dehydratase, putative
BR05111511711512718chr1_358373967374974glycosyl transferase, group 4 family protein
BR0517 *1516587517366chr1_353369319370098formyltransferase, putative
BR0519 *1518244519002chr1_351367683368441O-antigen export system ATP-binding protein RfbE
BR05201518999519781chr1_350366904367686O-antigen export system permease protein RfbD
BR05211519796520899chr1_349365786366889perosamine synthase, putative
BR05221520907521995chr1_348364690365778GDP-mannose 4,6-dehydratase
BR05291525257526375NA498887498927mannosyltransferase, putative
BR05371529702531039chr1_345361179362516phosphomannomutase, putative
BR0538 *1531064532488chr1_344359730361154mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase
BR0539 *1532521533693chr1_343358525359697mannose-6-phosphate isomerase
BR05401533776534885chr1_342357333358442glycosyl transferase, group 1 family protein
BR06151606884608995chr1_269283223285334membrane protein, putative
BR09811948161949393chr1_191520412372042469glycosyl transferase WboA
BR09821949390950949chr1_191420396812041240glycosyl transferase, group 1 family protein
BR1503 *114578021458866chr1_144115316641532728lipopolysaccharide core biosynthesis mannosyltransferase LpcC
BRA0347 *2326808328223chr2_723778336779751mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase
BRA03482328220329653chr2_722776906778339phosphoglucomutase, putative
* Three groups of featured genes are separated by a blank line. The first group of 12 genes involves in the beta-ketoadipate pathway. The second group of 8 genes involves in the homoprotocatechuate pathway. The third group of 19 genes are indicated as being important in producing smoothness. 1330-ID uses tags in the GenBank data AE014291.4 and AE014292.2. 019-ID uses gene ID of B. suis 019 genome (Supplementary files 1 and 2).
Lipopolysaccharide (LPS) is the major structural component of the outer membrane of gram-negative bacteria. It is composed of a lipid core, a core oligosaccharide, and a distal O-polysaccharide (O-PS) side chain [19]. The presence of the intact O-PS produces smooth phenotypes in B. melitensis, B. suis and B. abortus, while the absence or disruption of O-PS produces rough phenotypes in B. canis and B. ovis with the lipid core and the core oligosaccharide. An unexpected finding on the B. suis 019 was its rough morphology. Several studies indicated specific genes are important for the development of the smooth phenotype in Brucella [19]. Until now, 19 genes have been indicated as being important in producing smoothness. The disruption of 13 genes, Pgm (BR0058), WbkD (BR0510), WbkF (BR0511), Wzm (BR0520), Per (BR0521), Gmd (BR0522), WbkA (BR0529), ManB (BR0537), WbkE (BR0540), Wa** (BR0615), WboA (BR0981), WboB (BR0982) and ManBcore (BRA0348), resulted in a rough phenotype in B. melitensis and six genes, WbkC (BR0517), Wzt (BR0519), ManC (BR0538), ManA (BR0539), LpcC (BR1503) and ManCcore (BRA0347), were identified as playing roles that had not been fully determined (Table 3). Based on the alignment results, we found that the WbkA gene was disrupted in the B. suis 019 complete genome with the other 18 genes 100% identical to their orthologs in B. suis 1330. Previous studies have demonstrated that spontaneous excision of the WbkA glycosyltranferase gene was a cause of dissociation of smooth to rough Brucella [20]. Therefore, we proposed that the disruption of the WbkA gene resulted in the rough B. suis 019.

3. Materials and Methods

3.1. Sample Preparation, DNA-seq Library Construction

The B. suis 019 strain was obtained from the key laboratory of prevention and control of animal disease, Shihezi University. This bacteria had been originally isolated from the sperm of sheep in the province of Xinjiang in China in 1983, then processed by freeze-drying for long-term preservation. In this study, B. suis 019 was cultured at 37 °C for 72 h using the streak plate method. Single bacterial colonies were inoculated in the TS broth at 37 °C with shaking for 48 h. Bacteria were collected by centrifugation at 10,000 rpm for 1 min and washed twice with sterile deionized water. Total genomic DNA was extracted and purified by the GENEray™ Bacteria Whole Genome DNA Extraction Kit GK1072 (Generay Biotech, Shanghai, China). The DNA purity and concentration was measured by a NanoDrop™ spectrophotometer. DNA fragmentation was conducted using an ultrasound machine. DNA fragments of around 500 bp size were separated and collected using Agarose Gel Electrophoresis. Finally, one DNA library was constructed using the Illumina TruSeq™ (Illumina, San Diego, CA, USA) DNA Sample Prep Kits for the draft genome sequencing. The same procedure was conducted to construct another DNA library for the complete genome sequencing by a different experimenter one year later.

3.2. Draft Genome Sequencing, Assembly and Annotation

The DNA-seq library was sequenced using the Illumina HiSeq™ 2000 platform. De novo assembly of the B. suis 019 draft genome was performed using the SOAPdenovo 1.05 [21]. Gene prediction was performed using the software Glimmer 3.02 [22]. The raw NGS data contained paired reads with the left read length of 90 bp and right read length of 70 bp. We produced a total of 330 M bp cleaned NGS data, roughly covering 100 fold (100×) of the B. suis 019 draft genome. The assembled B. suis 019 draft genome has a total sequence length of 3,299,107 bp with the GC content 57.28%. This assembly produced 30 scaffolds and 722 contigs (Methods). The scaffold N50 and contig N50 is 259,978 bp and 7677 bp, respectively. We predicted 3,529 ORFs with the average length of 804 bp. This data was submitted to the GenBank WGS database with ID ANOZ00000000.

3.3. Complete Genome Sequencing, Assembly and Annotation

The DNA-seq library was sequenced using the Illumina HiSeq™ 2000 platform. After removing low quality regions, adapters and viral sequences, the cleaned reads were produced for genome assembly using the software Fastq_clean [23]. De novo assembly of the B. suis 019 genome was performed using the Celera Assembler version 8.1 [24] to produce scaffolds with the default setting. Blastn [25] searching was conducted against the NCBI bacterial genome database with the scaffolds to find the best matched genome B. ceti TE28753-12 as the reference genome. Based on the reference sequence NC_022907.1 and NC_022908.1, we applied the LASTZ and Chain/Net to order the scaffolds on two chromosomes, respectively. The gaps within and between the scaffolds were closed with the GapFiller [26]. Gene prediction was performed using the software Prodigal 2.60 [27]. All putative genes were annotated based on the NCBI NR database. Functional categorization by Gene Ontology (GO) terms and KEGG pathway annotation was carried out based on the best 20 blastx hits from the NR database using the Blast2GO software [24].

3.4. Phylogenetic Analysis Using Homologous Genes

Using all the annotated genes in the B. suis 1330 genome as a reference, we aligned the genes of the other 50 genomes to the reference genes using the blastn software. Taking the sequence identity 70% as threshold, we obtained a total of 2537 homologous genes from the alignment results. These homologous genes were linked into 51 super homologous sequences. The length of the super homologous sequences is about 2,226,048 bp covering more than 2/3 of the Brucella genome. The multiple alignment of 51 super homologous sequences was implemented using ClustalW 2.0 [28]. At last, a phylogenetic tree was built using the UPGMA (unweighted pair-group method with arithmetic means) method in the software MEGA 5.0 [29].

3.5. Comparative Genome Analysis Using 16 Complete Genomes

Removing two genomes due to their abnormal genome size (ATCC 23445) or GC content (A13334), we used B. suis 019 with 15 other complete genomes for the analysis (Table 1). The phylogenetic trees for chromosomes 1 and 2 were constructed using the software Mauve 2.4.0 with the default setting [13]. The collinear analysis and result display was also conducted using Mauve 2.4.0. To clearly display the collinear analysis result in Mauve, we rotated 15 genome sequences to roughly align the collinear regions to start from similar regions on the chromosome.
The syntenic analysis between B. suis 019 and B. suis S2 (v1, id25770) was conducted using the SynMap tool with default setting on the CoGe website. To be compatible with previous studies, the comparison between B. suis 019 and B. suis S2 used the GenBank data CP006961.1 (B. suis S2 chromosome 1) and CP006962.1 (B. suis S2 chromosome 2) (Table 2). The comparison between B. suis 019 and B. suis 1330 used the GenBank data AE014291.4 (B. suis 1330 chromosome 1) and AE014292.2 (B. suis 1330 chromosome 2) (Table 3).

4. Conclusions

In this study, we presented the complete genome of B. suis 019 and conducted comparative genome analysis. B. suis 019 was identified to be closest to the vaccine strain B. suis bv. 1 str. S2. Based on further analysis results, we associated the rsh gene to the pathogenicity of B. suis 019, and the WbkA gene to the rough phenotype of B. suis 019.

Supplementary Materials

The following are available online at www.mdpi.com/link.

Acknowledgments

This work was supported by grants from the International Science and Technology Cooperation Project of China (2013DFA32380), and Natural Science Foundation of China (U1303283, 31260596 and 31360610). The data analysis in this study was supported by National Scientific Data Sharing Platform for Population and Health Translational Cancer Medicine Specials.

Author Contributions

Chuangfu Chen and Shan Gao conceived the project and supervised this study. Shan Gao and Xin Chen wrote the main manuscript text. Ke Zhang analyzed the B. suis 019 draft genome. Shan Gao, Yuanzhi Wang and Xin Chen analyzed the B. suis 019 complete genome. Zhen Wang, Hui Zhang, Fei Guo, Hanping Feng and Wenyi Gu downloaded, managed and processed the data. Changxin Wu, Lei Ma, and Tiansen Li prepared the figures and tables.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
NGS
Next-generation sequencing
LCBs
Locally Collinear Blocks
LPS
Lipopolysaccharide
O-PS
O-polysaccharide
GO
Gene Ontology
UPGMA
unweighted pair-group method with arithmetic means

References

  1. Carvalho, N.A.; Mol, J.P.; Xavier, M.N.; Paixao, T.A.; Lage, A.P.; Santos, R.L. Pathogenesis of bovine brucellosis. Vet. J. 2010, 184, 146–155. [Google Scholar]
  2. Roth, F.; Zinsstag, J.; Orkhon, D.; Chimed-Ochir, G.; Hutton, G.; Cosivi, O.; Carrin, G.; Otte, J. Human health benefits from livestock vaccination for brucellosis: Case study. Bull. World Health Organ. 2003, 81, 867–876. [Google Scholar] [PubMed]
  3. O'Callaghan, D.; Whatmore, A.M. Brucella genomics as we enter the multi-genome era. Brief. Funct. Genomics 2011, 10, 334–341. [Google Scholar] [CrossRef] [PubMed]
  4. Wattam, A.R.; Foster, J.T.; Mane, S.P.; Beckstrom-Sternberg, S.M.; Beckstrom-Sternberg, J.M.; Dickerman, A.W.; Keim, P.; Pearson, T.; Shukla, M.; Ward, D.V. Comparative phylogenomics and evolution of the brucellae reveal a path to virulence. J. Bacteriol. 2014, 196, 920–930. [Google Scholar] [CrossRef] [PubMed]
  5. Liu, Z.; Miao, L. Brucella ovis was first isolated and identified in xinjiang province. Shihezi Sci.Technol. 1992, S1–S4. [Google Scholar]
  6. Liu, Z.; Ren, J. A study on brucella ovis in xinjiang. Endemic Dis. Bull. 1993, 8, 52–60. [Google Scholar]
  7. Zhang, G.; Zhang, Y.; Liu, Z.; Cheng, F.; Miao, L.; Guo, X.; Wang, C.; Bai, C.; Cai, Q. Artifical infection or rheus monkey with brucella ovis and pathological observation and isolation of pathogen. Chin. J. Zoonoses 1999, 15, 78–80. [Google Scholar]
  8. Liu, J.; Chen, C.; Tian, J. Cloning and sequence analysis of the omp25 gene of brucella ovis in xinjiang sheep. Chin. J. Vet. Med. 2006, 41, 6–8. [Google Scholar]
  9. Wang, Y.; Chen, C.; Cui, B.; Cao, X.; Zhang, H. Comparative study on omp gene sequences of brucella ovis 019 strain. Dis. Surveill. 2010, 25, 737–740. [Google Scholar]
  10. Darling, A.E.; Miklos, I.; Ragan, M.A. Dynamics of genome rearrangement in bacterial populations. PLoS Genet. 2008, 4, e1000128. [Google Scholar] [CrossRef] [PubMed]
  11. Xin, X. Orally administrable brucellosis vaccine: Brucella suis strain 2 vaccine. Vaccine 1986, 4, 212–216. [Google Scholar] [CrossRef]
  12. Foster, J.T.; Beckstrom-Sternberg, S.M.; Pearson, T.; Beckstrom-Sternberg, J.S.; Chain, P.S.; Roberto, F.F.; Hnath, J.; Brettin, T.; Keim, P. Whole-genome-based phylogeny and divergence of the genus brucella. J. Bacteriol. 2009, 191, 2864–2870. [Google Scholar] [CrossRef] [PubMed]
  13. Darling, A.C.; Mau, B.; Blattner, F.R.; Perna, N.T. Mauve: Multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004, 14, 1394–1403. [Google Scholar] [CrossRef] [PubMed]
  14. Dikow, R.B. Genome-level homology and phylogeny of shewanella (gammaproteobacteria: Lteromonadales: Shewanellaceae). BMC Genomics 2011, 12, 237–250. [Google Scholar] [CrossRef] [PubMed]
  15. Dozot, M.; Boigegrain, R.A.; Delrue, R.M.; Hallez, R.G.; Ouahrani-Bettache, S.; Danese, I.; Letesson, J.J.; De Bolle, X.; K hler, S. The stringent response mediator rsh is required for brucella melitensis and brucella suis virulence, and for expression of the type iv secretion system virb. Cell. Microbiol. 2006, 8, 1791–1802. [Google Scholar] [CrossRef] [PubMed]
  16. Hanna, N.; Ouahrani-Bettache, S.; Drake, K.L.; Adams, L.G.; Köhler, S.; Occhialini, A. Global rsh-dependent transcription profile of brucella suis during stringent response unravels adaptation to nutrient starvation and cross-talk with other stress responses. BMC Genomics 2013, 14, 459–474. [Google Scholar] [CrossRef] [PubMed]
  17. Paulsen, I.T.; Seshadri, R.; Nelson, K.E.; Eisen, J.A.; Heidelberg, J.F.; Read, T.D.; Dodson, R.J.; Umayam, L.; Brinkac, L.M.; Beanan, M.J. The brucella suis genome reveals fundamental similarities between animal and plant pathogens and symbionts. Proc. Natl. Acad. Sci. 2002, 99, 13148–13153. [Google Scholar] [CrossRef] [PubMed]
  18. Wattam, A.R.; Williams, K.P.; Snyder, E.E.; Almeida, N.F.; Shukla, M.; Dickerman, A.; Crasta, O.; Kenyon, R.; Lu, J.; Shallom, J. Analysis of ten brucella genomes reveals evidence for horizontal gene transfer despite a preferred intracellular lifestyle. J. Bacteriol. 2009, 191, 3569–3579. [Google Scholar] [CrossRef] [PubMed]
  19. Godfroid, F.; Cloeckaert, A.; Taminiau, B.; Danese, I.; Tibor, A.; de Bolle, X.; Mertens, P.; Letesson, J.-J. Genetic organisation of the lipopolysaccharide o-antigen biosynthesis region of brucella melitensis 16m (wbk). Res. Microbiol. 2000, 151, 655–668. [Google Scholar] [CrossRef]
  20. Mancilla, M.; Marín, C.M.; Blasco, J.M.; Zárraga, A.M.; López-Goñi, I.; Moriyón, I. Spontaneous excision of the o-polysaccharide wbka glycosyltranferase gene is a cause of dissociation of smooth to rough brucella colonies. J. Bacteriol. 2012, 194, 1860–1867. [Google Scholar] [CrossRef] [PubMed]
  21. Luo, R.; Liu, B.; Xie, Y.; Li, Z.; Huang, W.; Yuan, J.; He, G.; Chen, Y.; Pan, Q.; Liu, Y. Soapdenovo2: An empirically improved memory-efficient short-read de novo assembler. Gigascience 2012, 1, 18–23. [Google Scholar] [CrossRef] [PubMed]
  22. Delcher, A.L.; Harmon, D.; Kasif, S.; White, O.; Salzberg, S.L. Improved microbial gene identification with glimmer. Nucleic Acids Res. 1999, 27, 4636–4641. [Google Scholar] [CrossRef] [PubMed]
  23. Zhang, M.; Sun, H.; Fei, Z.; Zhan, F.; Gong, X.; Gao, S. Fastq_clean: An optimized pipeline to clean the illumina sequencing data with quality control. In Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Belfast, UK, November 2014; pp. 44–48.
  24. Myers, E.W.; Sutton, G.G.; Delcher, A.L.; Dew, I.M.; Fasulo, D.P.; Flanigan, M.J.; Kravitz, S.A.; Mobarry, C.M.; Reinert, K.H.; Remington, K.A. A whole-genome assembly of drosophila. Science 2000, 287, 2196–2204. [Google Scholar] [CrossRef] [PubMed]
  25. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J.Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  26. Nadalin, F.; Vezzi, F.; Policriti, A. Gapfiller: A de novo assembly approach to fill the gap within paired reads. BMC Bioinform. 2012, 13, S8–S23. [Google Scholar] [CrossRef] [PubMed]
  27. Hyatt, D.; Chen, G.L.; LoCascio, P.F.; Land, M.L.; Larimer, F.W.; Hauser, L.J. Prodigal: Prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 2010, 11, 119–129. [Google Scholar] [CrossRef] [PubMed]
  28. Larkin, M.A.; Blackshields, G.; Brown, N.P.; Chenna, R.; McGettigan, P.A.; McWilliam, H.; Valentin, F.; Wallace, I.M.; Wilm, A.; Lopez, R.; et al. Clustal w and clustal x version 2.0. Bioinformatics 2007, 23, 2947–2948. [Google Scholar] [CrossRef] [PubMed]
  29. Tamura, K.; Dudley, J.; Nei, M.; Kumar, S. Mega5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evolut. 2011, 28, 2731–2739. [Google Scholar] [CrossRef] [PubMed]

Share and Cite

MDPI and ACS Style

Wang, Y.; Wang, Z.; Chen, X.; Zhang, H.; Guo, F.; Zhang, K.; Feng, H.; Gu, W.; Wu, C.; Ma, L.; et al. The Complete Genome of Brucella Suis 019 Provides Insights on Cross-Species Infection. Genes 2016, 7, 7. https://0-doi-org.brum.beds.ac.uk/10.3390/genes7020007

AMA Style

Wang Y, Wang Z, Chen X, Zhang H, Guo F, Zhang K, Feng H, Gu W, Wu C, Ma L, et al. The Complete Genome of Brucella Suis 019 Provides Insights on Cross-Species Infection. Genes. 2016; 7(2):7. https://0-doi-org.brum.beds.ac.uk/10.3390/genes7020007

Chicago/Turabian Style

Wang, Yuanzhi, Zhen Wang, Xin Chen, Hui Zhang, Fei Guo, Ke Zhang, Hanping Feng, Wenyi Gu, Changxin Wu, Lei Ma, and et al. 2016. "The Complete Genome of Brucella Suis 019 Provides Insights on Cross-Species Infection" Genes 7, no. 2: 7. https://0-doi-org.brum.beds.ac.uk/10.3390/genes7020007

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop