Privacy-first health research with federated learning

Sadilek, Adam; Liu, Luyang; Nguyen, Dung; Kamruzzaman, Methun; Serghiou, Stylianos; Rader, Benjamin; Ingerman, Alex; Mellem, Stefan; Kairouz, Peter; Nsoesie, Elaine O.; MacFarlane, Jamie; Vullikanti, Anil; Marathe, Madhav; Eastham, Paul; Brownstein, John S.; Arcas, Blaise Aguera Y.; Howell, Michael D.; Hernandez, John

Published in

Nature Research, npj Digital Medicine, 1(4), 2021

DOI: 10.1038/s41746-021-00489-2

Tools

Export citation

Search in Google Scholar

Privacy-first health research with federated learning

Journal article published in 2020 by Adam Sadilek

, Luyang Liu, Dung Nguyen, Methun Kamruzzaman, Stylianos Serghiou, Benjamin Rader, Alex Ingerman, Stefan Mellem, Peter Kairouz, Elaine O. Nsoesie, Jamie MacFarlane, Anil Vullikanti, Madhav Marathe, Paul Eastham, John S. Brownstein and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractPrivacy protection is paramount in conducting health research. However, studies often rely on data stored in a centralized repository, where analysis is done with full access to the sensitive underlying content. Recent advances in federated learning enable building complex machine-learned models that are trained in a distributed fashion. These techniques facilitate the calculation of research study endpoints such that private data never leaves a given device or healthcare system. We show—on a diverse set of single and multi-site health studies—that federated models can achieve similar accuracy, precision, and generalizability, and lead to the same interpretation as standard centralized statistical models while achieving considerably stronger privacy protections and without significantly raising computational costs. This work is the first to apply modern and general federated learning methods that explicitly incorporate differential privacy to clinical and epidemiological research—across a spectrum of units of federation, model architectures, complexity of learning tasks and diseases. As a result, it enables health research participants to remain in control of their data and still contribute to advancing science—aspects that used to be at odds with each other.

Published in

Links

Tools

Privacy-first health research with federated learning

Abstract