Strategies for selecting subsets of single-nucleotide polymorphisms to genotype in association studies.

Timothy Bishop, D.; Butler, Joe M.; Bishop, D. Timothy; Barrett, Jennifer H.

Published in

BioMed Central, BMC Genetics, S1(6), 2005

DOI: 10.1186/1471-2156-6-s1-s72

Tools

Export citation

Search in Google Scholar

Strategies for selecting subsets of single-nucleotide polymorphisms to genotype in association studies.

Journal article published in 2005 by D. Timothy Bishop, Joe M. Butler, D. Timothy Bishop, Jennifer H. Barrett

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract In genetic association studies, linkage disequilibrium (LD) within a region can be exploited to select a subset of single-nucleotide polymorphisms (SNPs) to genotype with minimal loss of information. A novel entropy-based method for selecting SNPs is proposed and compared to an existing method based on the coefficient of determination (R ²) using simulated data from Genetic Analysis Workshop 14. The effect of the size of the sample used to investigate LD (by estimating haplotype frequencies) and hence select the SNPs is also investigated for both measures. It is found that the novel method and the established method select SNP subsets that do not differ greatly. The entropy-based measure may thus have value because it is easier to compute than R ². Increasing the sample size used to estimate haplotype frequencies improves the predictive power of the subset of SNPs selected. A smaller subset of SNPs chosen using a large initial sample to estimate LD can in some instances be more informative than a larger subset chosen based on poor estimates of LD (using a small initial sample). An initial sample size of 50 individuals is sufficient in most situations investigated, which involved selection from a set of 7 SNPs, although to select a larger number of SNPs, a larger initial sample size may be required.

Published in

Links

Tools

Strategies for selecting subsets of single-nucleotide polymorphisms to genotype in association studies.

Abstract