Estimating Phylogenies from Large Genomic Datasets

A special issue of Genes (ISSN 2073-4425). This special issue belongs to the section "Plant Genetics and Genomics".

Deadline for manuscript submissions: closed (30 November 2017) | Viewed by 25244

Special Issue Editors


E-Mail Website
Guest Editor
Department of Biological Sciences, University of New Orleans, New Orleans, LA 70148, USA
Interests: diversification; divergence times; molecular clocks; RADseq; phylogeny; phylogenetics; phylogenomics

E-Mail Website
Guest Editor
Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
Interests: diversification; divergence times; molecular clocks; RADseq; phylogeny; phylogenetics; phylogenomics

Special Issue Information

Dear Colleagues,

With the explosion of new data from “next generation” sequencing technologies over the past decade, evolutionary biologists have been faced with a number of challenges in using this data for phylogenetic inference. The goal of this Special Issue is to provide a brief overview of recently proposed methods for phylogenetic inference, as well as novel techniques for the acquisition, curation, and implementation of large datasets. We are especially interested in a wide variety of studies (both theoretical and empirical), from overviews of recent software/techniques/pipelines that have been developed to infer phylogenies from large genomic datasets.

Dr. Charles Bell
Dr. Stephen A. Smith
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Genes is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2600 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • Genomics

  • Proteomics

  • Phylogenomics

  • Phylotranscriptomics

Published Papers (4 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

Jump to: Review

18 pages, 7334 KiB  
Article
Genome-Wide Analysis of the PYL Gene Family and Identification of PYL Genes That Respond to Abiotic Stress in Brassica napus
by Feifei Di, Hongju Jian, Tengyue Wang, Xueping Chen, Yiran Ding, Hai Du, Kun Lu, Jiana Li and Liezhao Liu
Genes 2018, 9(3), 156; https://0-doi-org.brum.beds.ac.uk/10.3390/genes9030156 - 12 Mar 2018
Cited by 58 | Viewed by 6795
Abstract
Abscisic acid (ABA) is an endogenous phytohormone that plays important roles in the regulation of plant growth, development, and stress responses. The pyrabactin resistance 1-like (PYR/PYL) protein is a core regulatory component of ABA signaling networks in plants. However, no details regarding this [...] Read more.
Abscisic acid (ABA) is an endogenous phytohormone that plays important roles in the regulation of plant growth, development, and stress responses. The pyrabactin resistance 1-like (PYR/PYL) protein is a core regulatory component of ABA signaling networks in plants. However, no details regarding this family in Brassica napus are available. Here, 46 PYLs were identified in the B. napus genome. Based on phylogenetic analysis, BnPYR1 and BnPYL1-3 belong to subfamily I, BnPYL7-10 belong to subfamily II, and BnPYL4-6 and BnPYL11-13 belong to subfamily III. Analysis of BnPYL conserved motifs showed that every subfamily contained four common motifs. By predicting cis-elements in the promoters, we found that all BnPYL members contained hormone- and stress-related elements and that expression levels of most BnPYLs were relatively higher in seeds at the germination stage than those in other organs or at other developmental stages. Gene Ontology (GO) enrichment showed that BnPYL genes mainly participate in responses to stimuli. To identify crucial PYLs mediating the response to abiotic stress in B. napus, expression changes in 14 BnPYL genes were determined by quantitative real-time RT-PCR after drought, heat, and salinity treatments, and identified BnPYR1-3, BnPYL1-2, and BnPYL7-2 in respond to abiotic stresses. The findings of this study lay a foundation for further investigations of PYL genes in B. napus. Full article
(This article belongs to the Special Issue Estimating Phylogenies from Large Genomic Datasets)
Show Figures

Figure 1

22 pages, 3442 KiB  
Article
Testing for Polytomies in Phylogenetic Species Trees Using Quartet Frequencies
by Erfan Sayyari and Siavash Mirarab
Genes 2018, 9(3), 132; https://0-doi-org.brum.beds.ac.uk/10.3390/genes9030132 - 28 Feb 2018
Cited by 83 | Viewed by 7421
Abstract
Phylogenetic species trees typically represent the speciation history as a bifurcating tree. Speciation events that simultaneously create more than two descendants, thereby creating polytomies in the phylogeny, are possible. Moreover, the inability to resolve relationships is often shown as a (soft) polytomy. Both [...] Read more.
Phylogenetic species trees typically represent the speciation history as a bifurcating tree. Speciation events that simultaneously create more than two descendants, thereby creating polytomies in the phylogeny, are possible. Moreover, the inability to resolve relationships is often shown as a (soft) polytomy. Both types of polytomies have been traditionally studied in the context of gene tree reconstruction from sequence data. However, polytomies in the species tree cannot be detected or ruled out without considering gene tree discordance. In this paper, we describe a statistical test based on properties of the multi-species coalescent model to test the null hypothesis that a branch in an estimated species tree should be replaced by a polytomy. On both simulated and biological datasets, we show that the null hypothesis is rejected for all but the shortest branches, and in most cases, it is retained for true polytomies. The test, available as part of the Accurate Species TRee ALgorithm (ASTRAL) package, can help systematists decide whether their datasets are sufficient to resolve specific relationships of interest. Full article
(This article belongs to the Special Issue Estimating Phylogenies from Large Genomic Datasets)
Show Figures

Figure 1

23136 KiB  
Article
Genome-Wide Identification and Structural Analysis of bZIP Transcription Factor Genes in Brassica napus
by Yan Zhou, Daixiang Xu, Ledong Jia, Xiaohu Huang, Guoqiang Ma, Shuxian Wang, Meichen Zhu, Aoxiang Zhang, Mingwei Guan, Kun Lu, Xinfu Xu, Rui Wang, Jiana Li and Cunmin Qu
Genes 2017, 8(10), 288; https://0-doi-org.brum.beds.ac.uk/10.3390/genes8100288 - 24 Oct 2017
Cited by 62 | Viewed by 6032
Abstract
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed (Brassica napus). In this study, we identified 247 [...] Read more.
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed (Brassica napus). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B. napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B. napus and its parental lines and for molecular breeding studies of bZIP genes in B. napus. Full article
(This article belongs to the Special Issue Estimating Phylogenies from Large Genomic Datasets)
Show Figures

Figure 1

Review

Jump to: Research

19 pages, 875 KiB  
Review
Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets
by Mark S. Springer and John Gatesy
Genes 2018, 9(3), 123; https://0-doi-org.brum.beds.ac.uk/10.3390/genes9030123 - 26 Feb 2018
Cited by 22 | Viewed by 4455
Abstract
Summary coalescence methods have emerged as a popular alternative for inferring species trees with large genomic datasets, because these methods explicitly account for incomplete lineage sorting. However, statistical consistency of summary coalescence methods is not guaranteed unless several model assumptions are true, including [...] Read more.
Summary coalescence methods have emerged as a popular alternative for inferring species trees with large genomic datasets, because these methods explicitly account for incomplete lineage sorting. However, statistical consistency of summary coalescence methods is not guaranteed unless several model assumptions are true, including the critical assumption that recombination occurs freely among but not within coalescence genes (c-genes), which are the fundamental units of analysis for these methods. Each c-gene has a single branching history, and large sets of these independent gene histories should be the input for genome-scale coalescence estimates of phylogeny. By contrast, numerous studies have reported the results of coalescence analyses in which complete protein-coding sequences are treated as c-genes even though exons for these loci can span more than a megabase of DNA. Empirical estimates of recombination breakpoints suggest that c-genes may be much shorter, especially when large clades with many species are the focus of analysis. Although this idea has been challenged recently in the literature, the inverse relationship between c-gene size and increased taxon sampling in a dataset—the ‘recombination ratchet’—is a fundamental property of c-genes. For taxonomic groups characterized by genes with long intron sequences, complete protein-coding sequences are likely not valid c-genes and are inappropriate units of analysis for summary coalescence methods unless they occur in recombination deserts that are devoid of incomplete lineage sorting (ILS). Finally, it has been argued that coalescence methods are robust when the no-recombination within loci assumption is violated, but recombination must matter at some scale because ILS, a by-product of recombination, is the raison d’etre for coalescence methods. That is, extensive recombination is required to yield the large number of independently segregating c-genes used to infer a species tree. If coalescent methods are powerful enough to infer the correct species tree for difficult phylogenetic problems in the anomaly zone, where concatenation is expected to fail because of ILS, then there should be a decreasing probability of inferring the correct species tree using longer loci with many intralocus recombination breakpoints (i.e., increased levels of concatenation). Full article
(This article belongs to the Special Issue Estimating Phylogenies from Large Genomic Datasets)
Show Figures

Figure 1

Back to TopTop