Genomic prediction of complex human traits: relatedness, trait architecture and predictive meta-models

Spiliopoulou, Athina; Nagy, Reka; Bermingham, Mairead L.; Huffman, Jennifer E.; Hayward, Caroline; Vitart, Veronique; Rudan, Igor; Campbell, Harry; Wright, Alan F.; Wilson, James F.; Pong-Wong, Ricardo; Agakov, Felix; Navarro, Pau; Haley, Chris S.

Published in

Oxford University Press, Human Molecular Genetics, 14(24), p. 4167-4182, 2015

DOI: 10.1093/hmg/ddv145

Tools

Export citation

Search in Google Scholar

Genomic prediction of complex human traits: relatedness, trait architecture and predictive meta-models

Journal article published in 2015 by Athina Spiliopoulou, Reka Nagy, Mairead L. Bermingham, Jennifer E. Huffman, Caroline Hayward, Veronique Vitart, Igor Rudan, Harry Campbell, Alan F. Wright, James F. Wilson, Ricardo Pong-Wong, Felix Agakov, Pau Navarro

, Chris S. Haley

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

We explore the prediction of individuals' phenotypes for complex traits using genomic data. We compare several widely used prediction models, including Ridge Regression, LASSO and Elastic Nets estimated from cohort data, and polygenic risk scores constructed using published summary statistics from genome-wide association meta-analyses (GWAMA). We evaluate the interplay between relatedness, trait architecture and optimal marker density, by predicting height, body mass index (BMI) and high-density lipoprotein level (HDL) in two data cohorts, originating from Croatia and Scotland. We empirically demonstrate that dense models are better when all genetic effects are small (height and BMI) and target individuals are related to the training samples, while sparse models predict better in unrelated individuals and when some effects have moderate size (HDL). For HDL sparse models achieved good across-cohort prediction, performing similarly to the GWAMA risk score and to models trained within the same cohort, which indicates that, for predicting traits with moderately sized effects, large sample sizes and familial structure become less important, though still potentially useful. Finally, we propose a novel ensemble of whole-genome predictors with GWAMA risk scores and demonstrate that the resulting meta-model achieves higher prediction accuracy than either model on its own. We conclude that although current genomic predictors are not accurate enough for diagnostic purposes, performance can be improved without requiring access to large-scale individual-level data. Our methodologically simple meta-model is a means of performing predictive meta-analysis for optimizing genomic predictions and can be easily extended to incorporate multiple population-level summary statistics or other domain knowledge.

Published in

Links

Tools

Genomic prediction of complex human traits: relatedness, trait architecture and predictive meta-models

Abstract