Access

You are not currently logged in.

Access your personal account or get JSTOR access through your library or other institution:

login

Log in to your personal account or through your institution.

Species Trees from Gene Trees: Reconstructing Bayesian Posterior Distributions of a Species Phylogeny Using Estimated Gene Tree Distributions

Liang Liu and Dennis K. Pearl
Systematic Biology
Vol. 56, No. 3 (Jun., 2007), pp. 504-514
Stable URL: http://www.jstor.org/stable/20143055
Page Count: 11
  • Download ($42.00)
  • Cite this Item
Species Trees from Gene Trees: Reconstructing Bayesian Posterior Distributions of a Species Phylogeny Using Estimated Gene Tree Distributions
Preview not available

Abstract

The desire to infer the evolutionary history of a group of species should be more viable now that a considerable amount of multilocus molecular data is available. However, the current molecular phylogenetic paradigm still reconstructs gene trees to represent the species tree. Further, commonly used methods of combining data, such as the concatenation method, are known to be inconsistent in some circumstances. In this paper, we propose a Bayesian hierarchical model to estimate the phylogeny of a group of species using multiple estimated gene tree distributions, such as those that arise in a Bayesian analysis of DNA sequence data. Our model employs substitution models used in traditional phylogenetics but also uses coalescent theory to explain genealogical signals from species trees to gene trees and from gene trees to sequence data, thereby forming a complete stochastic model to estimate gene trees, species trees, ancestral population sizes, and species divergence times simultaneously. Our model is founded on the assumption that gene trees, even of unlinked loci, are correlated due to being derived from a single species tree and therefore should be estimated jointly. We apply the method to two multilocus data sets of DNA sequences. The estimates of the species tree topology and divergence times appear to be robust to the prior of the population size, whereas the estimates of effective population sizes are sensitive to the prior used in the analysis. These analyses also suggest that the model is superior to the concatenation method in fitting these data sets and thus provides a more realistic assessment of the variability in the distribution of the species tree that may have produced the molecular information at hand. Future improvements of our model and algorithm should include consideration of other factors that can cause discordance of gene trees and species trees, such as horizontal transfer or gene duplication.

Page Thumbnails

  • Thumbnail: Page 
504
    504
  • Thumbnail: Page 
505
    505
  • Thumbnail: Page 
506
    506
  • Thumbnail: Page 
507
    507
  • Thumbnail: Page 
508
    508
  • Thumbnail: Page 
509
    509
  • Thumbnail: Page 
510
    510
  • Thumbnail: Page 
511
    511
  • Thumbnail: Page 
512
    512
  • Thumbnail: Page 
513
    513
  • Thumbnail: Page 
514
    514