Inbred Strain Variant Database (Isvdb): A Repository For Probabilistically Informed Sequence Differences Among The Collaborative Cross Strains And Their Founders

Oreper, Daniel; Cai, Yanwei; Tarantino, Lisa M.; de Villena, Fernando Pardo-Manuel; Pardo-Manuel de Villena, Fernando; Valdar, William

Published in

Genetics Society of America, G3, 6(7), p. 1623-1630, 2017

DOI: 10.1534/g3.117.041491

Tools

Export citation

Search in Google Scholar

Inbred Strain Variant Database (Isvdb): A Repository For Probabilistically Informed Sequence Differences Among The Collaborative Cross Strains And Their Founders

Journal article published in 2017 by Daniel Oreper

, Yanwei Cai

, Lisa M. Tarantino

, Fernando Pardo-Manuel de Villena

, Fernando Pardo-Manuel de Villena, William Valdar

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: policy unknown

Upload

Policy details

Data provided by

Abstract

Abstract The Collaborative Cross (CC) is a panel of recently established multiparental recombinant inbred mouse strains. For the CC, as for any multiparental population (MPP), effective experimental design and analysis benefit from detailed knowledge of the genetic differences between strains. Such differences can be directly determined by sequencing, but until now whole-genome sequencing was not publicly available for individual CC strains. An alternative and complementary approach is to infer genetic differences by combining two pieces of information: probabilistic estimates of the CC haplotype mosaic from a custom genotyping array, and probabilistic variant calls from sequencing of the CC founders. The computation for this inference, especially when performed genome-wide, can be intricate and time-consuming, requiring the researcher to generate nontrivial and potentially error-prone scripts. To provide standardized, easy-to-access CC sequence information, we have developed the Inbred Strain Variant Database (ISVdb). The ISVdb provides, for all the exonic variants from the Sanger Institute mouse sequencing dataset, direct sequence information for CC founders and, critically, the imputed sequence information for CC strains. Notably, the ISVdb also: (1) provides predicted variant consequence metadata; (2) allows rapid simulation of F1 populations; and (3) preserves imputation uncertainty, which will allow imputed data to be refined in the future as additional sequencing and genotyping data are collected. The ISVdb information is housed in an SQL database and is easily accessible through a custom online interface (http://isvdb.unc.edu), reducing the analytic burden on any researcher using the CC.

Published in

Links

Tools

Inbred Strain Variant Database (Isvdb): A Repository For Probabilistically Informed Sequence Differences Among The Collaborative Cross Strains And Their Founders

Abstract