Published in

American Society for Microbiology, Applied and Environmental Microbiology, 1(87), 2020

DOI: 10.1128/aem.02002-20

Links

Tools

Export citation

Search in Google Scholar

Nonsynonymous Polymorphism Counts in Bacterial Genomes: a Comparative Examination

This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Orange circle
Published version: archiving restricted
Data provided by SHERPA/RoMEO

Abstract

Identifying selection from SNP data obtained from whole-genome sequencing studies is challenging. Some current measures used to identify and quantify selection acting on genomes rely on fixed differences; thus, these are inappropriate for SNP data where variants are not fixed. With the increase in whole-genome sequencing studies, it is important to consider SNP data in the context of evolutionary processes. How SNPs are counted and analyzed can help in understanding mutation accumulation and trajectories of strains. We developed a tool for identifying possible evidence of selection and for comparative analysis with other SNP data. We propose a model that provides a rule-of-thumb guideline and two new visualization techniques that can be used to interpret and compare SNP data. We quantify the expected proportion of nonsynonymous SNPs in coding regions under neutrality and demonstrate its use in identifying evidence of positive and negative selection from simulations and empirical data.