Performance of criteria for selecting evolutionary models in phylogenetics: A comprehensive study based on simulated datasets

Luo, Arong; Qiao, Huijie; Zhang, Yanzhou; Shi, Weifeng; Ho, Simon Yw W.; Xu, Weijun; Zhang, Aibing; Zhu, Chaodong

Published in

BioMed Central, BMC Evolutionary Biology, 1(10), 2010

DOI: 10.1186/1471-2148-10-242

Tools

Export citation

Search in Google Scholar

Performance of criteria for selecting evolutionary models in phylogenetics: A comprehensive study based on simulated datasets

Journal article published in 2010 by Arong Luo, Huijie Qiao, Yanzhou Zhang, Weifeng Shi, Simon Yw W. Ho, Weijun Xu, Aibing Zhang

, Chaodong Zhu

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract Background Explicit evolutionary models are required in maximum-likelihood and Bayesian inference, the two methods that are overwhelmingly used in phylogenetic studies of DNA sequence data. Appropriate selection of nucleotide substitution models is important because the use of incorrect models can mislead phylogenetic inference. To better understand the performance of different model-selection criteria, we used 33,600 simulated data sets to analyse the accuracy, precision, dissimilarity, and biases of the hierarchical likelihood-ratio test, Akaike information criterion, Bayesian information criterion, and decision theory. Results We demonstrate that the Bayesian information criterion and decision theory are the most appropriate model-selection criteria because of their high accuracy and precision. Our results also indicate that in some situations different models are selected by different criteria for the same dataset. Such dissimilarity was the highest between the hierarchical likelihood-ratio test and Akaike information criterion, and lowest between the Bayesian information criterion and decision theory. The hierarchical likelihood-ratio test performed poorly when the true model included a proportion of invariable sites, while the Bayesian information criterion and decision theory generally exhibited similar performance to each other. Conclusions Our results indicate that the Bayesian information criterion and decision theory should be preferred for model selection. Together with model-adequacy tests, accurate model selection will serve to improve the reliability of phylogenetic inference and related analyses.

Published in

Links

Tools

Performance of criteria for selecting evolutionary models in phylogenetics: A comprehensive study based on simulated datasets

Abstract