Next Article in Journal
Ultrasound Imaging for Risk Assessment in Atherosclerosis
Next Article in Special Issue
Comparative Analysis of Anther Transcriptome Profiles of Two Different Rice Male Sterile Lines Genotypes under Cold Stress
Previous Article in Journal
New Coumarin Derivatives and Other Constituents from the Stem Bark of Zanthoxylum avicennae: Effects on Neutrophil Pro-Inflammatory Responses
Previous Article in Special Issue
Genome-Wide Identification and Analysis of Drought-Responsive Genes and MicroRNAs in Tobacco
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

De Novo Transcriptome Sequencing of Low Temperature-Treated Phlox subulata and Analysis of the Genes Involved in Cold Stress

1
College of Agriculture, Heilongjiang Bayi Agricultural University, Daqing 163000, China
2
Institute of Natural Resources and Ecology, Heilongjiang Academy of Sciences (HAS), Collaborative Innovation Center for Development and Utilization of Forest Resources, Harbin 150040, China
3
College of Horticulture, Northeast Agricultural University, Harbin 150030, China
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2015, 16(5), 9732-9748; https://0-doi-org.brum.beds.ac.uk/10.3390/ijms16059732
Submission received: 15 March 2015 / Revised: 21 April 2015 / Accepted: 22 April 2015 / Published: 29 April 2015
(This article belongs to the Special Issue Abiotic Stress and Gene Networks in Plants)

Abstract

:
Phlox subulata, a perennial herbaceous flower, can survive during the winter of northeast China, where the temperature can drop to −30 °C, suggesting that P. subulata is an ideal model for studying the molecular mechanisms of cold acclimation in plants. However, little is known about the gene expression profile of P. subulata under cold stress. Here, we examined changes in cold stress-related genes in P. subulata. We sequenced three cold-treated (CT) and control (CK) samples of P. subulata. After de novo assembly and quantitative assessment of the obtained reads, 99,174 unigenes were generated. Based on similarity searches with known proteins in public protein databases, 59,994 unigenes were functionally annotated. Among all differentially expressed genes (DEGs), 8302, 10,638 and 11,021 up-regulated genes and 9898, 17,876, and 12,358 down-regulated genes were identified after treatment at 4, 0, and −10 °C, respectively. Furthermore, 3417 up-regulated unigenes were expressed only in CT samples. Twenty major cold-related genes, including transcription factors, antioxidant enzymes, osmoregulation proteins, and Ca2+ and ABA signaling components, were identified, and their expression levels were estimated. Overall, this is the first transcriptome sequencing of this plant species under cold stress. Studies of DEGs involved in cold-related metabolic pathways may facilitate the discovery of cold-resistance genes.

1. Introduction

Low temperature and cold are major environmental stressors that can influence the distribution, growth, and development of plants. Cold acclimation is the main process involved in increasing cold and freezing tolerance of some temperate plant species under chilling and freezing conditions. During this process, extensive physiological and biochemical changes occur in plants, allowing them to withstand low temperature conditions [1]. These changes are mediated through the differential expression of many genes responding to cold stress [2]. Detection and identification of these genes are important for understanding the regulatory mechanisms of cold acclimation [3]. For example, in the model plant Arabidopsis thaliana, many transcription factors that are involved in the regulation of cold-related genes have been identified; the most well-studied pathways are regulated by the transcription factors DREB/CBF (dehydration-responsive-element-binding/C-repeat binding factors), which promote the expression of downstream genes by specific binding to the DRE/CRT (dehydration-responsive-element/C-repeat) cis-elements in the promoter regions of target genes [4]. Recent studies have also identified upstream regulators of DREB/CBF, including ICE1, MYB15, CAMTA and HOS1 [5,6,7,8].
Unlike model plants, many non-model plant species are able to grow in specific environments because of their specialized genetic resources and adaptive mechanisms to native adverse conditions. Moreover, with the advent of next-generation sequencing technology (e.g., Illumina/Solexa-based RNA-seq) [9], large-scale transcriptome data have become available in non-model species. Furthermore, such technologies allow for high accuracy and sensitivity, even for genes with low expression levels, and permit rapid identification and analysis of whole transcriptomes. Currently, Illumina/Solexa has been successfully applied for transcriptome sequencing in many plant species, including Cunninghamia lanceolata [10], Raphanus sativus [11], Ammopiptanthus mongolicus [12], Anthurium andraeanum [13] and Wolfiporia cocos [14].
Phlox subulata L. is a perennial herbaceous flower and a member of the Polemoniaceae family. This plant has strong resistance to salt, drought, heat, and cold stresses. P. subulata can survive safely during the winter at temperatures as low as −40 °C, maintaining a green phenotype at −12 °C. The ability to survive to such low temperatures suggests that P. subulata may have distinctive molecular mechanisms that adapt to cold stress conditions. However, to date, no previous genomic information has been reported in P. subulata plants.
In this study, we aimed to examine the genomic characteristics of P. subulata that confer resistance to cold. To this end, we sequenced and annotated the transcriptome of P. subulata under normal and cold treatment conditions using RNA-seq and publicly available databases. We also analyzed differentially expressed genes (DEGs) of cold-treated (CT; 4, 0 and −10 °C) and control (CK) plants, and identified numerous specific cold-related genes. Our results provided a foundation for understanding the cold response mechanism of P. subulata and provided a valuable resource for the development of cold-tolerant plants through genetic manipulation.

2. Results and Discussion

2.1. Phlox subulata (P. subulata) Transcriptome Sequencing and de Novo Assembly

P. subulata plants grew and blossomed in the spring and autumn (Figure 1A), producing a 10–15 cm high plant with old stem half-lignification (Figure 1B), needle-like and leathery leaves, and 2 cm pink flowers (Figure 1C–F). Sequence analysis and assembly were performed to investigate the transcriptome and gene expression profiles of P. subulata under normal and cold stress. Four cDNA samples from seedlings of CT (4, 0 and −10 °C, subsequently referred to as CT1, CT2 and CT3, respectively) and CK (20 °C) plants were sequenced using an Illumina HiSeq 2000 platform. In total, we obtained approximately 55–59 million raw reads for CT and CK samples. After removing the low-quality reads and reads containing adaptors, 21.3 × 107 clean reads consisting of 19.2 × 109 nucleotides (nt) were obtained with a Q20 percentage (an error probability of 0.01) of more than 97% for four samples (Table 1). All clean reads were deposited in the NCBI Sequence Read Archive (SRA, http://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/Traces/sra) database with accession number SRP055942.
Figure 1. Phenotype characteristics of Phlox subulata. (A) Natural populations of P. subulata plants distribution in northeast China; (BF) Phenotypes of the P. subulata plant, and its roots, stems (B); leaves (C,E); and flowers (D,F); Scale bars = 1 cm (E,F).
Figure 1. Phenotype characteristics of Phlox subulata. (A) Natural populations of P. subulata plants distribution in northeast China; (BF) Phenotypes of the P. subulata plant, and its roots, stems (B); leaves (C,E); and flowers (D,F); Scale bars = 1 cm (E,F).
Ijms 16 09732 g001
Table 1. Statistics of the sequencing and assembly of cold-treated (CT) and control (CK) Phlox subulata plants.
Table 1. Statistics of the sequencing and assembly of cold-treated (CT) and control (CK) Phlox subulata plants.
SamplesCK (20 °C)CT1 (4 °C)CT2 (0 °C)CT3 (−10 °C)Total
Total Raw Reads59,580,72859,263,03255,328,17055,583,970
Total Clean Reads55,528,59255,319,07651,409,11451,736,252
Total Clean Nucleotides (nt)4,997,573,2804,978,716,8404,626,820,2604,656,262,680
Q20 percentage97.50%97.53%97.41%97.51%
N percentage0.01%0.01%0.01%0.01%
GC percentage46.40%46.32%46.85%46.95%
Contig
Total Number126,166125,583120,123140,329
Total Length (nt)48,095,63947,695,97642,622,57748,874,606
Mean Length (nt)381380355348
N50820857714705
Unigene
Total Number82,94881,05975,18185,49199,174
Total Length (nt)67,146,82363,741,99454,402,95960,415,66598,892,318
Mean Length (nt)810786724707997
N5014701434136913221622
Total Consensus Sequences82,94881,05975,18185,49199,174
Distinct Clusters30,54629,61523,87627,44842,007
Distinct Singletons52,40251,44451,30558,04357,167
Transcriptome de novo assembly was performed using Trinity program [15]. All high-quality clean reads of each sample were assembled into 125,583 (CT1), 120,123 (CT2), 140,329 (CT3) and 126,166 (CK) contigs, respectively (Table 1). In four samples, the average contig length exceeded 340 nt (length distributions of these contigs are shown in Figure S1). The contigs of each sample were then joined into unigenes, generating 81,059 (CT1), 75,181 (CT2), 85,491 (CT3) and 82,948 (CK) unigenes, respectively. After long-sequence clustering of four samples, a total of 99,174 unigenes were obtained for all samples. The total length was 98,892,318 nt, with a mean length of 997 nt and an N50 of 1622 nt (Table 1). The length distributions of unigenes of each sample are given in Figure 2.

2.2. Functional Annotation and Classification of the Assembled Unigenes

To validate and annotate the assembled unigenes, sequence similarity searches were conducted using sequence- and domain-based alignments. In total, 99,174 unigenes from all groups significantly matched a sequence in at least one of the public databases, including NCBI non-redundant protein (NR), Swiss-Prot protein, Kyoto Encyclopedia of Genes and Genomes (KEGG), Clusters of Orthologous Groups (COG), and Gene Ontology (GO) databases. There were 59,994 unigenes (60.49% of all unigenes) with homologous sequences in at least one of the aforementioned databases (Table 2). Among them, 55,996 (56.5%), 39,519 (39.8%), 36,150 (36.5%), 24,872 (25.1%) and 41,688 (42.0%) unigenes were found in NR, SwissProt, KEGG, COG, and GO databases, respectively. The remaining 39,180 (39.5%) unmapped unigenes were not identified (Table 2). After searching all unigene sequences against four protein databases (NR, SwissProt, KEGG, and COG) using BLASTx (E-value < 0.00001), we identified 57,044 coding sequences (CDSs) and predicted proteins. For unigenes with no BLAST hits, we used ESTScan to predict 6834 CDSs and predicted proteins. The distributions of the CDSs and predicted proteins are shown in Figure 3.
Figure 2. Length distributions of the unigenes from cold-treated (CT) and control (CK) samples. (AE) The length distributions of unigenes from CK (A); CT1 (B); CT2 (C); CT3 (D); and all samples (E).
Figure 2. Length distributions of the unigenes from cold-treated (CT) and control (CK) samples. (AE) The length distributions of unigenes from CK (A); CT1 (B); CT2 (C); CT3 (D); and all samples (E).
Ijms 16 09732 g002
Table 2. Number of functional annotations for all unigenes in public databases.
Table 2. Number of functional annotations for all unigenes in public databases.
Public Protein DatabaseNumber of UnigenesPercentage (%)
NR55,99656.5
Swiss-Prot39,51939.8
KEGG36,15036.5
COG24,87225.1
GO41,66842.0
All59,99460.5
Unmapped39,18039.5
Figure 3. Length distributions of the protein-coding region predictions (CDSs) (AD). The length distributions of CDSs using BLASTx (A); the length distributions of proteins using BLASTx (B); the length distributions of CDSs using ESTscan (C); the length distributions of proteins using ESTscan (D).
Figure 3. Length distributions of the protein-coding region predictions (CDSs) (AD). The length distributions of CDSs using BLASTx (A); the length distributions of proteins using BLASTx (B); the length distributions of CDSs using ESTscan (C); the length distributions of proteins using ESTscan (D).
Ijms 16 09732 g003
For the NR annotations, 55,996 of the unigenes were found to be matched in the database. For the E-value distribution, 44.4% of moderate homolog sequences ranged from 1.0 × 105 to 1.0 × 1045, while 55.6% of sequences had a threshold E-value less than 1.0 × 1045, showing strong homology (Figure 4A). The identity distribution pattern showed that 59% of the sequences had a similarity higher than 60%, while 41% showed similarity between 17% and 60% (Figure 4B). The majority of the annotated sequences corresponded to the known protein sequences of plant species. However, we found that 32.8% of the unigenes had similar sequences to proteins from Vitis vinifera, followed by Lycopersicon esculentum (11.1%), Amygdalus persica (9.5%), Ricinus communis (7.2%) and Populus balsamifera subsp. Trichocarpa (6.4%; Figure 4C). In addition, the other 24.2% unigenes had matches with other plant species, such as Cucumis sativus and Medicago truncatula. This result suggested that the sequences of the P. subulata transcripts generated in the present study were assembled and annotated properly.
Figure 4. Characteristics of homology search of query sequences aligned by BLASTx to the NCBI nonredundant (NR) database. (A) E-value distribution of BLAST hits for matched unigene sequences, using an E-value cutoff of 1.0 × 105; (B) Similarity distribution of top BLAST hits for each unigene; (C) Species distribution of the top BLAST hits.
Figure 4. Characteristics of homology search of query sequences aligned by BLASTx to the NCBI nonredundant (NR) database. (A) E-value distribution of BLAST hits for matched unigene sequences, using an E-value cutoff of 1.0 × 105; (B) Similarity distribution of top BLAST hits for each unigene; (C) Species distribution of the top BLAST hits.
Ijms 16 09732 g004
Based on the NR annotation, we used the GO annotation to classify the possible functions of the unigenes. GO annotation is an international classification system that provides controlled vocabulary for descriptions of gene functions [16]. A total of 41,688 (42.0% of all unigenes) unigenes were successfully assigned to at least one GO term. All the GO terms were classified into 56 functional groups, including three main categories: biological processes, cellular components, and molecular functions (Figure 5). Among biological processes, transcript sequences assigned to cellular (26,638), metabolic (25,871) and single-organism processes (18,247) were the most abundant. For cellular components, proteins were mostly assigned to cell (31,028), cell part (31,024) and organelle (24,793) categories. Within the molecular function category, the majority of the GO terms were predominantly assigned to binding (19,903), catalytic activity (21,442) and transporter activity (3303) (Figure 5). Similar results were found in many plant species [11,12,13], suggesting that the DEGs were responsible for fundamental biological metabolism and regulation.
Figure 5. Histogram of gene ontology (GO) classification. The results are summarized in three main categories: Biological processes (blue), cellular components (brown) and molecular functions (green). The y-axis on the right side indicates the percent of unigenes in a category, and the y-axis on the left side indicates the number of unigenes.
Figure 5. Histogram of gene ontology (GO) classification. The results are summarized in three main categories: Biological processes (blue), cellular components (brown) and molecular functions (green). The y-axis on the right side indicates the percent of unigenes in a category, and the y-axis on the left side indicates the number of unigenes.
Ijms 16 09732 g005
To further predict gene function and to evaluate the integrality of the P. Subulata transcriptome, all unigenes were searched against the COG database. Overall, 24,872 (25.1% of all unigenes) unigenes were assigned COG classifications (Figure 6). By classifying the possible functions of these unigenes, we grouped the unigenes into 25 functional categories. The largest category was “General function prediction only” (8804 of 24,872 unigenes, about 35.4%), followed by “Transcription” (4186 unigenes, 16.8%), “Replication, recombination and repair” (3820, 15.4%), “Translation, ribosomal structure, and biogenesis”, (3708, 14.9%), and “Signal transduction mechanisms” (3307, 13.3%). The categories of “Nuclear structure” (19, 0.07%), “Extracellular structures” (21, 0.08%) and “Cell motility” (320, 1.29%) had the fewest responding unigenes. Additionally, 2287 (9.10%) unigenes were annotated as “Function unknown” (Figure 6).
Figure 6. Clusters of orthologous groups (COG) functional classification. A total of 24,872 unigenes were assigned to 25 classifications. The capital letters along the x-axis indicate the COG categories as listed on the right of the histogram, and the y-axis indicates the number of unigenes.
Figure 6. Clusters of orthologous groups (COG) functional classification. A total of 24,872 unigenes were assigned to 25 classifications. The capital letters along the x-axis indicate the COG categories as listed on the right of the histogram, and the y-axis indicates the number of unigenes.
Ijms 16 09732 g006

2.3. Analysis of Potential Differentially Expressed Genes (DEGs)

To identify different expression levels of unigenes, we used the reads per kb per million reads (RPKM) method to calculate the expression levels of the unigenes from CT and CK samples. Screening of DEGs (FDR ≤ 0.001 and ratios larger than 2), the results showed that the numbers of both up- and down-regulated genes changed with reduction of the temperature. The distribution of transcript changes of unigenes is shown in Figure S2. Among all DEGs, 8302 were induced by cold, while 9898 were down-regulated after incubation at 4 °C (CT1; Figure 7 and Table S1). After incubation at 0 °C (CT2), 10,638 genes were up-regulated and 17,876 were down-regulated (Figure 7 and Table S2). After incubation at −10 °C (CT3), 11,021 genes were up-regulated, and 12,358 were down-regulated (Figure 7 and Table S3). We also identified a total of 3417 up-regulated unigenes expressed only in the CT samples (Table S4). Among them, 533, 1580 and 1922 unigenes were found in CT, CT2 and CT3, respectively (Figure 8). Some of these unigenes had no homologs in the NCBI database, suggesting that they may represent new cold-related transcripts that have not been reported in model plants.
Figure 7. Differentially expressed genes in P. subulata under low temperatures. The numbers of up-regulated and down-regulated genes in cold-treated (CT) samples are shown. We used “false discovery rate ≤0.001 and the absolute value of log2Ratio ≥1” as the threshold to judge the significance of differences in gene expression. Control (CK) plants were grown at 20 °C. CT1, CT2 and CT3 plants were incubated at 4, 0 and −10 °C, respectively.
Figure 7. Differentially expressed genes in P. subulata under low temperatures. The numbers of up-regulated and down-regulated genes in cold-treated (CT) samples are shown. We used “false discovery rate ≤0.001 and the absolute value of log2Ratio ≥1” as the threshold to judge the significance of differences in gene expression. Control (CK) plants were grown at 20 °C. CT1, CT2 and CT3 plants were incubated at 4, 0 and −10 °C, respectively.
Ijms 16 09732 g007
Figure 8. The numbers of up-regulated genes expressed only in the three CT samples (4, 0, and –10 °C).
Figure 8. The numbers of up-regulated genes expressed only in the three CT samples (4, 0, and –10 °C).
Ijms 16 09732 g008

2.4. Analysis of DEGs Related to Metabolism

To understand the biological function of DEGs, we performed metabolic pathway enrichment analysis using the KEGG database. A total of 21,155 up-regulated unigenes and 22,737 down-regulated unigenes were identified after cold treatment. All DEGs were related to 128 metabolic pathways (Table S5). Among the mapped pathways, 43, 33 and 61 pathways were significantly enriched (p ≤ 0.05) after cold treatment. We listed the 10 most significantly enriched pathways (Table 3). After incubation at 4 °C, the four most significantly enriched pathways were plant-pathogen interactions (Figure S3), ether lipid metabolism (Figure S4), biosynthesis of secondary metabolites and glycerophospholipid metabolism pathways (Figure S5). After incubation at 0 °C, the ribosome and plant-pathogen interaction pathways were significantly identified as the top two pathways. After incubation at −10 °C, we found that biosynthesis of secondary metabolites, metabolic pathways, phenylpropanoid biosynthesis, ether lipid metabolism and glycerophospholipid metabolism were significantly enriched (Table 3).
Table 3. The 10 most significantly enriched pathways of differentially expressed genes under cold treatment.
Table 3. The 10 most significantly enriched pathways of differentially expressed genes under cold treatment.
SamplePathway IDPathwayAll Genes with Pathway AnnotationDEGs Genes with Pathway Annotationp Value
CT1/CKko04626Plant-pathogen interaction2388 (6.61%)691 (9.14%)4.63 × 10−22
ko00565Ether lipid metabolism791 (2.19%)278 (3.68%)6.63 × 10−21
ko01110Biosynthesis of secondary metabolites4516 (12.49%)1171 (15.49%)2.76 × 10−18
ko00564Glycerophospholipid metabolism1034 (2.86%)332 (4.39%)9.88 × 10−18
ko00945Stilbenoid, diarylheptanoid and gingerol biosynthesis319 (0.88%)134 (1.77%)1.06 × 10−17
ko04144Endocytosis1097 (3.03%)346 (4.58%)3.95 × 10−17
ko00941Flavonoid biosynthesis391 (1.08%)153 (2.02%)1.23 × 10−16
ko01100Metabolic pathways8740 (24.18%)2088 (27.62%)4.49 × 10−15
ko04075Plant hormone signal transduction1752 (4.85%)499 (6.6%)8.37 × 10−15
ko03050Proteasome166 (0.46%)79 (1.04%)1.60 × 10−14
CT2/CKko03010Ribosome1440 (3.98%)593 (5.23%)6.57 × 10−16
ko04626Plant-pathogen interaction2388 (6.61%)881 (7.77%)1.47 × 10−9
ko00908Zeatin biosynthesis384 (1.06%)176 (1.55%)1.71 × 10−9
ko00460Cyanoamino acid metabolism355 (0.98%)163 (1.44%)5.71 × 10−9
ko00945Stilbenoid, diarylheptanoid and gingerol biosynthesis319 (0.88%)145 (1.28%)8.29 × 10−8
ko00940Phenylpropanoid biosynthesis812 (2.25%)320 (2.82%)5.48 × 10−7
ko00904Diterpenoid biosynthesis188 (0.52%)91 (0.8%)7.60 × 10−7
ko00900Terpenoid backbone biosynthesis284 (0.79%)128 (1.13%)8.04 × 10−7
ko00944Flavone and flavonol biosynthesis179 (0.5%)87 (0.77%)1.06 × 10−6
ko01110Biosynthesis of secondary metabolites4516 (12.49%)1552 (13.69%)1.97 × 10−6
CT3/CKko01110Biosynthesis of secondary metabolites4516 (12.49%)1521 (16.37%)9.61 × 10−38
ko01100Metabolic pathways8740 (24.18%)2704 (29.11%)3.08 × 10−37
ko00940Phenylpropanoid biosynthesis812 (2.25%)351 (3.78%)4.43 × 10−28
ko00565Ether lipid metabolism791 (2.19%)325 (3.5%)9.99 × 10−22
ko00564Glycerophospholipid metabolism1034 (2.86%)403 (4.34%)1.47 × 10−21
ko00500Starch and sucrose metabolism986 (2.73%)376 (4.05%)2.23 × 10−18
ko00941Flavonoid biosynthesis391 (1.08%)179 (1.93%)5.66 × 10−18
ko04144Endocytosis1097 (3.03%)386 (4.16%)9.57 × 10−13
ko00908Zeatin biosynthesis384 (1.06%)161 (1.73%)2.54 × 10−12
ko00945Stilbenoid, diarylheptanoid and gingerol biosynthesis319 (0.88%)137 (1.47%)1.38 × 10−11

2.5. Identification of Major Genes Involved in Cold Stress

Next, we concentrated on analysis of genes up-regulated in response to cold stress in P. subulata. Functional annotations of up-regulated unigenes showed that some unigenes were closely related to cold stress, including cold-inducible transcription factors, cold-induced proteins, antioxidant enzymes, osmoregulation proteins, and proteins related to Ca2+ and ABA signaling [12,13,17,18]. We selected 20 cold-related unigenes and estimated their expression levels by the RPKM method (Table 4). Among these genes, dehydration-responsive element binding protein (DREB), ethylene-responsive element binding factor (AP2/ERF), and the transcription factors MYB (v-myb avian myeloblastosis viral oncogene homolog) and ICE1 (inducer of CBF expression 1) were found to be significantly induced by cold stress in P. subulata. These transcription factors have been described as major factors involved in plant cold-stress responses [5,19,20]. Small groups of cold-related proteins, including low temperature-regulated (LT), cold-regulated (COR), and shock proteins were involved in responses to cold stimuli. Genes within the LT and COR groups have been previously reported to be involved in the cold stress response process [21,22]. Additionally, some antioxidant enzymes and osmoregulation proteins were identified, including superoxide dismutase (SOD), peroxidase (POD), catalase (CAT), glutathione peroxidase (GPX), water stress-induced protein, and sugar transporter. Under cold stress, reactive oxygen species (ROS) accumulate, and water becomes imbalanced in plants. These phenotypes are harmful to both the membrane and related biochemical reactions [23]. The accumulation of antioxidant enzymes and osmoregulation proteins contributes to the stabilization of membranes as these molecules protect membranes against cold injury [18,24]. Moreover, we also found that ABA-hydroxylase and calcium-dependent protein kinase (CDPK) were up-regulated (Table 4). These genes may be involved in the modification and processing of ABA and Ca2+ signaling molecules, which play important roles in signal transduction under cold stress [18,25]. Finally, many unigenes with no known homologs in the NR were also found. These unigenes may represent novel genes specifically expressed in P. subulata or may participate in specific regulatory mechanisms associated with cold adaptation in P. subulata. However, further studies are needed to verify and characterize these unigenes.
Table 4. Analysis of differences in the expression of genes up-regulated in response to cold stress in Phlox subulata.
Table 4. Analysis of differences in the expression of genes up-regulated in response to cold stress in Phlox subulata.
Gene IDAnnotationExpression Difference Analysis (Log2(CT_RPKM/CK_RPKM))
CT1/CKCT2/CKCT3/CK
Unigene933_AllDehydration-responsive element-binding protein 1B0.494.235.12
Unigene30882_AllDehydration-responsive element-binding protein 1D1.494.225.23
Unigene13555_AllAP2/ERF domain-containing transcription factor1.563.504.03
CL11425.Contig1_AllTranscription factor MYB1R11.910.471.49
CL10927.Contig2_AllTranscription factor ICE10.561.882.18
CL12801.Contig1_AllCold-inducible protein3.467.434.37
CL3749.Contig3_AllCold-inducible protein5.3310.299.68
CL10279.Contig1_AllCold acclimation protein COR413PM22.551.361.03
Unigene18542_AllCold acclimation protein COR413IM21.801.561.41
CL6095.Contig8_AllCold regulated COR182.414.132.24
Unigene52767_AllCold shock protein CS666.889.6712.23
CL10842.Contig2_AllCold-inducible RNA-binding protein3.565.613.58
CL12984.Contig6_AllLow-temperature-induced 65 kDa protein1.566.355.07
CL10225.Contig1_AllHeat shock protein0.601.262.75
Unigene52380_AllSuperoxide dismutase4.033.6511.15
Unigene9176_AllPeroxidase 521.022.481.26
Unigene701_AllPeroxidase 571.532.512.75
CL12499.Contig1_AllCatalase isozyme B1.473.215.69
CL6576.Contig2_AllGlutathione peroxidase GPX61.522.681.68
CL11774.Contig1_AllWater stress-induced protein ER5 3.466.194.78
Unigene8303_AllBidirectional sugar transporter N37.756.299.03
CL7748.Contig2_AllBidirectional sugar transporter SWEET110.662.492.04
CL1443.Contig6_AllAbscisic acid-hydroxylase0.803.332.93
Unigene58_AllCalcium-dependent protein kinase0.462.051.26

3. Experimental Section

3.1. Plant Materials and Cold Treatments

P. subulata L. plants were grown in the nursery for three months at the Heilongjiang Academy of the Sciences (China; 128.4°E, 45.0°N). Subsequently, control plants were grown in a growth chamber under a 12/12-h light/dark photoperiod (20–40 μm·s−1·m−2 light intensity) at 20 °C. For cold treatments, plants were transferred to 4, 0 and −10 °C under the same light source. Roots, stems, and leaves of three CT and CK plants were collected simultaneously after treatment for three days and immediately stored at −80 °C until required for RNA extraction.

3.2. RNA Extraction, cDNA Library Construction, and Transcriptome Sequencing

Total RNA from each sample was isolated using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) according to the manufacturer’s instructions. The RNA samples were examined using a NanoDrop ND-8000 Spectrophotometer (NanoDrop, Wilmington, DE, USA); the A260/A280 ratios of four samples ranged from 1.8 to 2.0. The integrity of the RNA samples was assessed with an Agilent 2100 Bioanalyzer (Agilent, Palo Alto, CA, USA). The RNA sample (mixed RNA of root, stem, leaf according to 1:1 proportion) was sent to the Beijing Genomic Institute (BGI, Shenzhen, China) for RNA sequencing. Accordingly, cDNA library construction and Illumina transcriptome sequencing were performed following the methods described by Shu et al. [14].

3.3. De Novo Assembly and Functional Annotation

Raw reads produced from sequencing machines contained dirty reads with adapters or low-quality bases. These data negatively affect bioinformatics analysis. Therefore, dirty raw reads were discarded. De novo assembly of the transcriptome was achieved using the short-reads assembly program, Trinity [15]. Trinity combined reads with a certain length of overlap to form longer fragments, known as contigs. These contigs were subjected to sequence clustering to form longer sequences. Such sequences were defined as unigenes.
All of the assembled unigenes were compared with the publicly protein databases, including NR, Swiss-Prot protein, KEGG, COG and GO databases, using BLASTx analysis with a cut-off E-value of 10−5. The best alignments were used to identify the sequence direction of the assembled unigenes. Unigenes were first aligned to protein databases in the priority order of NR, Swiss-Prot, KEGG and COG. When unigenes could not be aligned using these databases, ESTScan software was introduced to determine the sequence direction [26]. For the NR annotations, the BLAST2GO program was used to obtain GO annotations of assembled unigenes for describing biological processes, molecular functions, and cellular components [27]. After obtaining GO annotations for assembled unigenes, WEGO software [28] was used to determine GO functional classifications for understanding the distribution of unigene functions. Unigenes were aligned to the COG database to predict and classify possible functions of the unigenes. In the final step, KEGG pathway [29] annotations were performed to analyze the metabolic pathways and functions of unigenes.

3.4. Differential Expression Analysis of Unigenes

RPKM [30] was used to calculate unigene expression levels, which eliminated the influence of gene length and sequencing level. The RPKM method formula was:
R P K M = 10 6 C N L / 10 3
where C is the number of reads that uniquely aligned to one unigene; N is the total number of reads that uniquely aligned to all unigenes; and L is the base number in the CDS of one unigene.
The FDR control method [31] was used in multiple hypothesis testing to correct for p values. After the FDR was obtained, the ratio of RPKMs was used to calculate the fold-change in the expression of unigenes in two samples simultaneously. The smaller the FDR and the larger the ratio, the larger was the difference in the expression level between the two samples. In our analysis, DEGs were screened with an FDR threshold of 0.001 or less and an absolute value of the log2 ratio of 1 or more [32]. For pathway enrichment analysis, we looked for significantly enriched metabolic pathways or signal transduction pathways containing DEGs by comparison with the whole genome. Finally, some cold stress-related genes were selected and listed.

4. Conclusions

In this study, we first reported the transcriptome data of P. subulata under cold stress using Illumina/Solexa. The total length of the reads was ~20 Gb. A total of 99,174 unigenes were assembled. The large number of unigenes and their functional annotations provide valuable resource for genetic and genomic studies in P. subulata plants. A large number of DEGs involved in cold stress were identified. Further, we identified a total of 3417 up-regulated unigenes expressed only in the cold-treated P. subulata plants. Studies of these up-regulated unigenes involved in cold-related metabolic pathways will facilitate the discovery of other cold-resistance genes.

Supplementary Materials

Acknowledgments

This work was supported by the Special Fund for Basic Application Technology of Institutes of Heilongjiang Province of China (STJB2015-06). We also thank the staff at Editage for copy editing the manuscript.

Author Contributions

Yanting Qu and Aimin Zhou conceived of the study, coordinated and performed experiments, and prepared the manuscript. Xing Zhang, Huanwei Tang, Ming Liang and Hui Han performed data analysis. Yuhu Zuo analyzed the data with suggestions by Yanting Qu and Aimin Zhou wrote the manuscript. All authors read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Liu, W.; Yu, K.; He, T.; Li, F.; Zhang, D.; Liu, J. The low temperature induced physiological responses of Avena nuda L., a cold-tolerant plant species. Sci. World J. 2013, 2013, 658793. [Google Scholar]
  2. Chinnusamy, V.; Zhu, J.; Zhu, J.K. Cold stress regulation of gene expression in plants. Trends Plant Sci. 2007, 12, 444–451. [Google Scholar] [CrossRef] [PubMed]
  3. Thomashow, M.F. Plant cold acclimation: Freezing tolerance genes and regulatory mechanisms. Annu. Rev. Plant Physiol. Plant Mol. Biol. 1999, 50, 571–599. [Google Scholar] [CrossRef] [PubMed]
  4. Maruyama, K.; Takeda, M.; Kidokoro, S.; Yamada, K.; Sakuma, Y.; Urano, K.; Fujita, M.; Yoshiwara, K.; Matsukura, S.; Morishita, Y.; et al. Metabolic pathways involved in cold acclimation identified by integrated analysis of metabolites and transcripts regulated by DREB1A and DREB2A. Plant Physiol. 2009, 150, 1972–1980. [Google Scholar] [CrossRef] [PubMed]
  5. Miura, K.; Jin, J.B.; Lee, J.; Yoo, C.Y.; Stirm, V.; Miura, T.; Ashworth, E.N.; Bressan, R.A.; Yun, D.J.; Hasegawa, P.M. SIZ1-mediated sumoylation of ICE1 controls CBF3/DREB1A expression and freezing tolerance in Arabidopsis. Plant Cell 2007, 19, 1403–1414. [Google Scholar] [CrossRef] [PubMed]
  6. Eckardt, N.A. CAMTA proteins: A direct link between calcium signals and cold acclimation? Plant Cell 2009, 21, 697. [Google Scholar] [CrossRef] [PubMed]
  7. Doherty, C.J.; van Buskirk, H.A.; Myers, S.J.; Thomashow, M.F. Roles for Arabidopsis CAMTA transcription factors in cold-regulated gene expression and freezing tolerance. Plant Cell 2009, 21, 972–984. [Google Scholar] [CrossRef] [PubMed]
  8. Dong, C.H.; Agarwal, M.; Zhang, Y.; Xie, Q.; Zhu, J.K. The negative regulator of plant cold responses, HOS1, is a RING E3 ligase that mediates the ubiquitination and degradation of ICE1. Proc. Natl. Acad. Sci. USA 2006, 103, 8281–8286. [Google Scholar] [CrossRef] [PubMed]
  9. Morozova, O.; Hirst, M.; Marra, M.A. Applications of new sequencing technologies for transcriptome analysis. Annu. Rev. Genomics Hum. Genet. 2009, 10, 135–151. [Google Scholar] [CrossRef] [PubMed]
  10. Huang, H.H.; Xu, L.L.; Tong, Z.K.; Lin, E.P.; Liu, Q.P.; Cheng, L.J.; Zhu, M.Y. De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis. BMC Genomics 2012, 13, 648. [Google Scholar] [CrossRef] [PubMed]
  11. Wang, Y.; Pan, Y.; Liu, Z.; Zhu, X.; Zhai, L.; Xu, L.; Yu, R.; Gong, Y.; Liu, L. De novo transcriptome sequencing of radish (Raphanus sativus L.) and analysis of major genes involved in glucosinolate metabolism. BMC Genomics 2013, 14, 836. [Google Scholar] [CrossRef] [PubMed]
  12. Pang, T.; Ye, C.Y.; Xia, X.; Yin, W. De novo sequencing and transcriptome analysis of the desert shrub, Ammopiptanthus mongolicus, during cold acclimation using Illumina/Solexa. BMC Genomics 2013, 14, 488. [Google Scholar] [CrossRef] [PubMed]
  13. Tian, D.Q.; Pan, X.Y.; Yu, Y.M.; Wang, W.Y.; Zhang, F.; Ge, Y.Y.; Shen, X.L.; Shen, F.Q.; Liu, X.J. De novo characterization of the Anthurium. transcriptome and analysis of its digital gene expression under cold stress. BMC Genomics 2013, 14, 827. [Google Scholar] [CrossRef] [PubMed]
  14. Shu, S.; Chen, B.; Zhou, M.; Zhao, X.; Xia, H.; Wang, M. De novo sequencing and transcriptome analysis of Wolfiporia cocos to reveal genes related to biosynthesis of triterpenoids. PLoS ONE 2013, 8, e71350. [Google Scholar] [CrossRef] [PubMed]
  15. Grabherr, M.G.; Haas, B.J.; Yassour, M.; Levin, J.Z.; Thompson, D.A.; Amit, I.; Adiconis, X.; Fan, L.; Raychowdhury, R.; Zeng, Q.; et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011, 29, 644–652. [Google Scholar] [CrossRef] [PubMed]
  16. Harris, M.A.; Clark, J.; Ireland, A.; Lomax, J.; Ashburner, M.; Foulger, R.; Eilbeck, K.; Lewis, S.; Marshall, B.; Mungall, C.; et al. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2004, 32, D258–D261. [Google Scholar] [CrossRef] [PubMed]
  17. Winfield, M.O.; Lu, C.; Wilson, I.D.; Coghill, J.A.; Edwards, K.J. Plant responses to cold: Transcriptome analysis of wheat. Plant Biotechnol. J. 2010, 8, 749–771. [Google Scholar] [CrossRef] [PubMed]
  18. An, D.; Yang, J.; Zhang, P. Transcriptome profiling of low temperature-treated cassava apical shoots showed dynamic responses of tropical plant to cold stress. BMC Genomics 2012, 13, 64. [Google Scholar] [CrossRef] [PubMed]
  19. Cook, D.; Fowler, S.; Fiehn, O.; Thomashow, M.F. A prominent role for the CBF cold response pathway in configuring the low-temperature metabolome of Arabidopsis. Proc. Natl. Acad. Sci. USA 2004, 101, 15243–15248. [Google Scholar] [CrossRef] [PubMed]
  20. Thomashow, M.F. So what’s new in the field of plant cold acclimation? Lots! Plant Physiol. 2001, 125, 89–93. [Google Scholar] [CrossRef] [PubMed]
  21. Breton, G.; Danyluk, J.; Charron, J.B.; Sarhan, F. Expression profiling and bioinformatic analyses of a novel stress-regulated multispanning transmembrane protein family from cereals and Arabidopsis. Plant Physiol. 2003, 132, 64–74. [Google Scholar] [CrossRef] [PubMed]
  22. Xue, G.P. The DNA-binding activity of an AP2 transcriptional activator HvCBF2 involved in regulation of low-temperature responsive genes in barley is modulated by temperature. Plant J. 2003, 33, 373–383. [Google Scholar] [CrossRef] [PubMed]
  23. Airaki, M.; Leterrier, M.; Mateos, R.M.; Valderrama, R.; Chaki, M.; Barroso, J.B.; del Rio, L.A.; Palma, J.M.; Corpas, F.J. Metabolism of reactive oxygen species and reactive nitrogen species in pepper (Capsicum annuum L.) plants under low temperature stress. Plant Cell Environ. 2012, 35, 281–295. [Google Scholar] [CrossRef] [PubMed]
  24. Khaper, N.; Kaur, K.; Li, T.; Farahmand, F.; Singal, P.K. Antioxidant enzyme gene expression in congestive heart failure following myocardial infarction. Mol. Cell. Biochem. 2003, 251, 9–15. [Google Scholar] [CrossRef] [PubMed]
  25. Saijo, Y.; Hata, S.; Kyozuka, J.; Shimamoto, K.; Izui, K. Over-expression of a single Ca2+-dependent protein kinase confers both cold and salt/drought tolerance on rice plants. Plant J. 2000, 23, 319–327. [Google Scholar] [CrossRef] [PubMed]
  26. Iseli, C.; Jongeneel, C.V.; Bucher, P. ESTScan: A program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 1999, 1, 138–148. [Google Scholar]
  27. Conesa, A.; Gotz, S.; Garcia-Gomez, J.M.; Terol, J.; Talon, M.; Robles, M. Blast2GO: A universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 2005, 21, 3674–3676. [Google Scholar] [CrossRef] [PubMed]
  28. Ye, J.; Fang, L.; Zheng, H.; Zhang, Y.; Chen, J.; Zhang, Z.; Wang, J.; Li, S.; Li, R.; Bolund, L.; et al. WEGO: A web tool for plotting GO annotations. Nucleic Acids Res. 2006, 34, W293–W297. [Google Scholar] [CrossRef] [PubMed]
  29. Kanehisa, M.; Goto, S.; Sato, Y.; Furumichi, M.; Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012, 40, D109–D114. [Google Scholar] [CrossRef] [PubMed]
  30. Mortazavi, A.; Williams, B.A.; McCue, K.; Schaeffer, L.; Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 2008, 5, 621–628. [Google Scholar] [CrossRef] [PubMed]
  31. Reiner, A.; Yekutieli, D.; Benjamini, Y. Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics 2003, 19, 368–375. [Google Scholar] [CrossRef] [PubMed]
  32. Wu, J.; Zhang, Y.; Zhang, H.; Huang, H.; Folta, K.M.; Lu, J. Whole genome wide expression profiles of Vitis amurensis grape responding to downy mildew by using Solexa sequencing technology. BMC Plant Biol. 2010, 10, 234. [Google Scholar] [CrossRef] [PubMed]

Share and Cite

MDPI and ACS Style

Qu, Y.; Zhou, A.; Zhang, X.; Tang, H.; Liang, M.; Han, H.; Zuo, Y. De Novo Transcriptome Sequencing of Low Temperature-Treated Phlox subulata and Analysis of the Genes Involved in Cold Stress. Int. J. Mol. Sci. 2015, 16, 9732-9748. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms16059732

AMA Style

Qu Y, Zhou A, Zhang X, Tang H, Liang M, Han H, Zuo Y. De Novo Transcriptome Sequencing of Low Temperature-Treated Phlox subulata and Analysis of the Genes Involved in Cold Stress. International Journal of Molecular Sciences. 2015; 16(5):9732-9748. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms16059732

Chicago/Turabian Style

Qu, Yanting, Aimin Zhou, Xing Zhang, Huanwei Tang, Ming Liang, Hui Han, and Yuhu Zuo. 2015. "De Novo Transcriptome Sequencing of Low Temperature-Treated Phlox subulata and Analysis of the Genes Involved in Cold Stress" International Journal of Molecular Sciences 16, no. 5: 9732-9748. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms16059732

Article Metrics

Back to TopTop