Published in

Oxford University Press, Molecular Biology and Evolution, 7(30), p. 1675-1686, 2013

DOI: 10.1093/molbev/mst062

Links

Tools

Export citation

Search in Google Scholar

The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC

Journal article published in 2013 by Walid H. Gharib, Marc Robinson-Rechavi ORCID
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Green circle
Published version: archiving allowed
Data provided by SHERPA/RoMEO

Abstract

Positive selection is widely estimated from protein coding sequences by the non synonymous / synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. While there is widespread concern about the robustness of the estimation of the ω ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of under-estimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, under-estimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to ∼80%) and low GC (∼30%) genes. Moreover, shifts of GC content on a specific branch do not generate many false positives. Nor do major shifts in GC along the gene sequence. Our results confirm that the branch-site is a very conservative test.