Published in

Wiley, Genetic Epidemiology, 1(47), p. 45-60, 2022

DOI: 10.1002/gepi.22501

Links

Tools

Export citation

Search in Google Scholar

An empirical Bayes approach to improving population‐specific genetic association estimation by leveraging cross‐population data

Journal article published in 2022 by Li Hsu ORCID, Anna Kooperberg, Alexander P. Reiner, Charles Kooperberg
This paper was not found in any repository, but could be made available legally by the author.
This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

AbstractPopulations of non‐European ancestry are substantially underrepresented in genome‐wide association studies (GWAS). As genetic effects can differ between ancestries due to possibly different causal variants or linkage disequilibrium patterns, a meta‐analysis that includes GWAS of all populations yields biased estimation in each of the populations and the bias disproportionately impacts non‐European ancestry populations. This is because meta‐analysis combines study‐specific estimates with inverse variance as the weights, which causes biases towards studies with the largest sample size, typical of the European ancestry population. In this paper, we propose two empirical Bayes (EB) estimators to borrow the strength of information across populations although accounting for between‐population heterogeneity. Extensive simulation studies show that the proposed EB estimators are largely unbiased and improve efficiency compared to the population‐specific estimator. In contrast, even though the meta‐analysis estimator has a much smaller variance, it yields significant bias when the genetic effect is heterogeneous across populations. We apply the proposed EB estimators to a large‐scale trans‐ancestry GWAS of stroke and demonstrate that the EB estimators reduce the variance of the population‐specific estimator substantially, with the effect estimates close to the population‐specific estimates.