Published in

Nature Research, Nature, 7604(533), p. 539-542

DOI: 10.1038/nature17671



Export citation

Search in Google Scholar

Genome-wide association study identifies 74 loci associated with educational attainment

Journal article published in 2016 by Sven J. van der Lee, Christiaan de Leeuw, Aysu Okbay, Jonathan P. (Jonathan) Beauchamp, Mark Alan Fontana, James J. (James J.) Lee, Tune H. (Tune) Pers, Cornelius A. (Cornelius A.) Rietveld, R. de Vlaming, Patrick Turley, Guo-Bo Chen ORCID, Valur Emilsson, S. Fleur W. Meddens, Sven Oskarsson, Joseph K. (Joseph K.) Pickrell and other authors.
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO


Educational attainment is strongly influenced by social and other environmental factors, but genetic factors are estimated to account for at least 20% of the variation across individuals. Here we report the results of a genome-wide association study (GWAS) for educational attainment that extends our earlier discovery sample of 101,069 individuals to 293,723 individuals, and a replication study in an independent sample of 111,349 individuals from the UK Biobank. We identify 74 genome-wide significant loci associated with the number of years of schooling completed. Single-nucleotide polymorphisms associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioural phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because educational attainment is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric diseases.