LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms

Money, Daniel; Gardner, Kyle; Migicovsky, Zoë; Schwaninger, Heidi; Zhong, Gan-Yuan; Myles, Sean

Published in

Genetics Society of America, G3, 11(5), p. 2383-2390, 2015

DOI: 10.1534/g3.115.021667

Tools

Export citation

Search in Google Scholar

LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms

Journal article published in 2015 by Daniel Money, Kyle Gardner, Zoë Migicovsky

, Heidi Schwaninger, Gan-Yuan Zhong, Sean Myles

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: policy unknown

Upload

Policy details

Data provided by

Abstract

Abstract Obtaining genome-wide genotype data from a set of individuals is the first step in many genomic studies, including genome-wide association and genomic selection. All genotyping methods suffer from some level of missing data, and genotype imputation can be used to fill in the missing data and improve the power of downstream analyses. Model organisms like human and cattle benefit from high-quality reference genomes and panels of reference genotypes that aid in imputation accuracy. In nonmodel organisms, however, genetic and physical maps often are either of poor quality or are completely absent, and there are no panels of reference genotypes available. There is therefore a need for imputation methods designed specifically for nonmodel organisms in which genomic resources are poorly developed and marker order is unreliable or unknown. Here we introduce LinkImpute, a software package based on a k-nearest neighbor genotype imputation method, LD-kNNi, which is designed for unordered markers. No physical or genetic maps are required, and it is designed to work on unphased genotype data from heterozygous species. It exploits the fact that markers useful for imputation often are not physically close to the missing genotype but rather distributed throughout the genome. Using genotyping-by-sequencing data from diverse and heterozygous accessions of apples, grapes, and maize, we compare LD-kNNi with several genotype imputation methods and show that LD-kNNi is fast, comparable in accuracy to the best-existing methods, and exhibits the least bias in allele frequency estimates.

Published in

Links

Tools

LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms

Abstract