Published in

Wiley, Journal of Systematics and Evolution, 1(51), p. 30-43, 2012

DOI: 10.1111/j.1759-6831.2012.00236.x

Wiley, Journal of Systematics and Evolution, p. n/a-n/a, 2012

DOI: 10.1002/jse.236

Links

Tools

Export citation

Search in Google Scholar

The unbearable uncertainty of Bayesian divergence time estimation

Journal article published in 2012 by Mario Dos Reis ORCID, Ziheng Yang
This paper was not found in any repository, but could be made available legally by the author.
This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Divergence time estimation using molecular sequence data relying on uncertain fossil calibrations is an unconventional statistical estimation problem. As the sequence data provide information about the distances only, estimation of absolute times and rates has to rely on information in the prior, so that the model is only semi-identifiable. In this paper, we use a combination of mathematical analysis, computer simulation, and real data analysis to examine the uncertainty in posterior time estimates when the amount of sequence data increases. The analysis extends the infinite-sites theory of Yang and Rannala, which predicts the posterior distribution of divergence times and rate when the amount of data approaches infinity. We found that the posterior credibility interval in general decreases and reaches a non-zero limit when the data size increases. However, for the node with the most precise fossil calibration (as measured by the interval width divided by the mid value), sequence data do not really make the time estimate any more precise. We propose a finite-sites theory which predicts that the square of the posterior interval width approaches its infinite-data limit at the rate 1/n, where n is the sequence length. We suggest a procedure to partition the uncertainty of posterior time estimates into that due to uncertainties in fossil calibrations and that due to sampling errors in the sequence data. We evaluate the impact of conflicting fossil calibrations on posterior time estimation and point out that narrow credibility intervals or overly precise time estimates can be produced by conflicting or erroneous fossil calibrations.