MixupMapper: correcting sample mix-ups in genome-wide datasets increases power to detect small genetic effects

Westra, Harm-Jan; Jansen, Ritsert C.; Fehrmann, Rudolf S. N.; te Meerman, Gerard J.; van Heel, David; Meerman, Gerard J. te; Heel, David van; Wijmenga, Cisca; Franke, Lude

Published in

Oxford University Press (OUP), Bioinformatics, 15(27), p. 2104-2111

DOI: 10.1093/bioinformatics/btr323

Tools

Export citation

Search in Google Scholar

MixupMapper: correcting sample mix-ups in genome-wide datasets increases power to detect small genetic effects

Journal article published in 2011 by Harm-Jan Westra, Ritsert C. Jansen, Rudolf S. N. Fehrmann, Gerard J. te Meerman, David van Heel

, Gerard J. te Meerman, David van Heel, Cisca Wijmenga

, Lude Franke

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

MOTIVATION: Sample mix-ups can arise during sample collection, handling, genotyping or data management. It is unclear how often sample mix-ups occur in genome-wide studies, as there currently are no post hoc methods that can identify these mix-ups in unrelated samples. We have therefore developed an algorithm (MixupMapper) that can both detect and correct sample mix-ups in genome-wide studies that study gene expression levels. RESULTS: We applied MixupMapper to five publicly available human genetical genomics datasets. On average, 3% of all analyzed samples had been assigned incorrect expression phenotypes: in one of the datasets 23% of the samples had incorrect expression phenotypes. The consequences of sample mix-ups are substantial: when we corrected these sample mix-ups, we identified on average 15% more significant cis-expression quantitative trait loci (cis-eQTLs). In one dataset, we identified three times as many significant cis-eQTLs after correction. Furthermore, we show through simulations that sample mix-ups can lead to an underestimation of the explained heritability of complex traits in genome-wide association datasets. Availability and implementation: MixupMapper is freely available at http://www.genenetwork.nl/mixupmapper/

Published in

Links

Tools

MixupMapper: correcting sample mix-ups in genome-wide datasets increases power to detect small genetic effects

Abstract