A global analysis of matches and mismatches between human genetic and linguistic histories

Barbieri, Chiara; Blasi, Damián E.; Arango-Isaza, Epifanía; Sotiropoulos, Alexandros G.; Hammarström, Harald; Wichmann, Søren; Greenhill, Simon J.; Gray, Russell D.; Forkel, Robert; Bickel, Balthasar; Shimizu, Kentaro K.

Published in

National Academy of Sciences, Proceedings of the National Academy of Sciences, 47(119), 2022

DOI: 10.1073/pnas.2122084119

Tools

Export citation

Search in Google Scholar

A global analysis of matches and mismatches between human genetic and linguistic histories

Journal article published in 2022 by Chiara Barbieri

, Damián E. Blasi

, Epifanía Arango-Isaza

, Alexandros G. Sotiropoulos

, Harald Hammarström

, Søren Wichmann

, Simon J. Greenhill

, Russell D. Gray

, Robert Forkel

, Balthasar Bickel

, Kentaro K. Shimizu

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving forbidden

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Human history is written in both our genes and our languages. The extent to which our biological and linguistic histories are congruent has been the subject of considerable debate, with clear examples of both matches and mismatches. To disentangle the patterns of demographic and cultural transmission, we need a global systematic assessment of matches and mismatches. Here, we assemble a genomic database (GeLaTo, or Genes and Languages Together) specifically curated to investigate genetic and linguistic diversity worldwide. We find that most populations in GeLaTo that speak languages of the same language family (i.e., that descend from the same ancestor language) are also genetically highly similar. However, we also identify nearly 20% mismatches in populations genetically close to linguistically unrelated groups. These mismatches, which occur within the time depth of known linguistic relatedness up to about 10,000 y, are scattered around the world, suggesting that they are a regular outcome in human history. Most mismatches result from populations shifting to the language of a neighboring population that is genetically different because of independent demographic histories. In line with the regularity of such shifts, we find that only half of the language families in GeLaTo are genetically more cohesive than expected under spatial autocorrelations. Moreover, the genetic and linguistic divergence times of population pairs match only rarely, with Indo-European standing out as the family with most matches in our sample. Together, our database and findings pave the way for systematically disentangling demographic and cultural history and for quantifying processes of shifts in language and social identities on a global scale.

Published in

Links

Tools

A global analysis of matches and mismatches between human genetic and linguistic histories

Abstract