Haplotype Inference in Random Population Samples

Lin, Shin; Cutler, David J.; Zwick, Michael E.; Chakravarti, Aravinda

Published in

Cell Press, American Journal of Human Genetics, 5(71), p. 1129-1137, 2002

DOI: 10.1086/344347

Springer Verlag, Lecture Notes in Computer Science, p. 134-134

DOI: 10.1007/978-3-540-24719-7_14

Tools

Export citation

Search in Google Scholar

Haplotype Inference in Random Population Samples

Journal article published in 2002 by Shin Lin

, David J. Cutler, Michael E. Zwick, Aravinda Chakravarti

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Contemporary genotyping and sequencing methods do not provide information on linkage phase in diploid organisms. The application of statistical methods to infer and reconstruct linkage phase in samples of diploid sequences is a potentially time- and labor-saving method. The Stephens-Smith-Donnelly (SSD) algorithm is one such method, which incorporates concepts from population genetics theory in a Markov chain-Monte Carlo technique. We applied a modified SSD method, as well as the expectation-maximization and partition-ligation algorithms, to sequence data from eight loci spanning >1 Mb on the human X chromosome. We demonstrate that the accuracy of the modified SSD method is better than that of the other algorithms and is superior in terms of the number of sites that may be processed. Also, we find phase reconstructions by the modified SSD method to be highly accurate over regions with high linkage disequilibrium (LD). If only polymorphisms with a minor allele frequency >0.2 are analyzed and scored according to the fraction of neighbor relations correctly called, reconstructions are 95.2% accurate over entire 100-kb stretches and are 98.6% accurate within blocks of high LD.

Published in

Links

Tools

Haplotype Inference in Random Population Samples

Abstract