Distributed Analytics on Sensitive Medical Data: The Personal Health Train

Beyan, Oya; Oya Beyan, Oya Beyan; van Soest, Johan; Zimmermann, Lukas; Stenzhorn, Holger; da Silva Santos, Luiz Olavo Bonino; Ananya Choudhury, Ananya Choudhury; Choudhury, Ananya; Johan van Soest, Johan van Soest; Kohlbacher, Oliver; Oliver Kohlbacher, Oliver Kohlbacher; Lukas Zimmermann, Lukas Zimmermann; Holger Stenzhorn, Holger Stenzhorn; Karim, M.-D. Rezaul; M.-D. Rezaul Karim, M.-D. Rezaul Karim; Dumontier, Michel; Michel Dumontier, Michel Dumontier; Decker, Stefan; Stefan Decker, Stefan Decker; Luiz Olavo Bonino da Silva Santos, Luiz Olavo Bonino da Silva Santos; Andre Dekker, Andre Dekker; Dekker, Andre

Published in

Science Data Bank Datasets, 2022

DOI: 10.11922/sciencedb.j00104.00070

Massachusetts Institute of Technology Press, Data Intelligence, 1-2(2), p. 96-107, 2020

DOI: 10.1162/dint_a_00032

Tools

Export citation

Search in Google Scholar

Distributed Analytics on Sensitive Medical Data: The Personal Health Train

Journal article published in 2020 by Oya Beyan

, Oya Beyan Oya Beyan, Johan van Soest, Lukas Zimmermann, Holger Stenzhorn, Luiz Olavo Bonino da Silva Santos, Ananya Choudhury Ananya Choudhury, Ananya Choudhury, Johan van Soest Johan van Soest, Oliver Kohlbacher, Oliver Kohlbacher Oliver Kohlbacher, Lukas Zimmermann Lukas Zimmermann, Holger Stenzhorn Holger Stenzhorn, M.-D. Rezaul Karim, M.-D. Rezaul Karim M.-D. Rezaul Karim and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: policy unknown

Upload

Postprint: policy unknown

Upload

Published version: policy unknown

Upload

Abstract

In recent years, as newer technologies have evolved around the healthcare ecosystem, more and more data have been generated. Advanced analytics could power the data collected from numerous sources, both from healthcare institutions, or generated by individuals themselves via apps and devices, and lead to innovations in treatment and diagnosis of diseases; improve the care given to the patient; and empower citizens to participate in the decision-making process regarding their own health and well-being. However, the sensitive nature of the health data prohibits healthcare organizations from sharing the data. The Personal Health Train (PHT) is a novel approach, aiming to establish a distributed data analytics infrastructure enabling the (re)use of distributed healthcare data, while data owners stay in control of their own data. The main principle of the PHT is that data remain in their original location, and analytical tasks visit data sources and execute the tasks. The PHT provides a distributed, flexible approach to use data in a network of participants, incorporating the FAIR principles. It facilitates the responsible use of sensitive and/or personal data by adopting international principles and regulations. This paper presents the concepts and main components of the PHT and demonstrates how it complies with FAIR principles.

Published in

Links

Tools

Distributed Analytics on Sensitive Medical Data: The Personal Health Train

Abstract