Development of a machine learning model using electrocardiogram signals to improve acute pulmonary embolism screening

Somani, Sulaiman S.; Honarvar, Hossein; Narula, Sukrit; Landi, Isotta; Lee, Shawn; Khachatoorian, Yeraz; Rehmani, Arsalan; Kim, Andrew; De Freitas, Jessica K.; Teng, Shelly; Jaladanki, Suraj; Kumar, Arvind; Russak, Adam; Zhao, Shan P.; Freeman, Robert; Levin, Matthew A.; Nadkarni, Girish N.; Kagen, Alexander C.; Argulian, Edgar; Glicksberg, Benjamin S.

Published in

Oxford University Press, European Heart Journal – Digital Health, 1(3), p. 56-66, 2021

DOI: 10.1093/ehjdh/ztab101

Tools

Export citation

Search in Google Scholar

Development of a machine learning model using electrocardiogram signals to improve acute pulmonary embolism screening

Journal article published in 2021 by Sulaiman S. Somani

, Hossein Honarvar

, Sukrit Narula, Isotta Landi

, Shawn Lee, Yeraz Khachatoorian, Arsalan Rehmani

, Andrew Kim

, Jessica K. De Freitas

, Shelly Teng

, Suraj Jaladanki

, Arvind Kumar

, Adam Russak

, Shan P. Zhao

, Robert Freeman and other authors.

Distributing this paper is prohibited by the publisher

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract Aims Clinical scoring systems for pulmonary embolism (PE) screening have low specificity and contribute to computed tomography pulmonary angiogram (CTPA) overuse. We assessed whether deep learning models using an existing and routinely collected data modality, electrocardiogram (ECG) waveforms, can increase specificity for PE detection. Methods and results We create a retrospective cohort of 21 183 patients at moderate- to high suspicion of PE and associate 23 793 CTPAs (10.0% PE-positive) with 320 746 ECGs and encounter-level clinical data (demographics, comorbidities, vital signs, and labs). We develop three machine learning models to predict PE likelihood: an ECG model using only ECG waveform data, an EHR model using tabular clinical data, and a Fusion model integrating clinical data and an embedded representation of the ECG waveform. We find that a Fusion model [area under the receiver-operating characteristic curve (AUROC) 0.81 ± 0.01] outperforms both the ECG model (AUROC 0.59 ± 0.01) and EHR model (AUROC 0.65 ± 0.01). On a sample of 100 patients from the test set, the Fusion model also achieves greater specificity (0.18) and performance (AUROC 0.84 ± 0.01) than four commonly evaluated clinical scores: Wells’ Criteria, Revised Geneva Score, Pulmonary Embolism Rule-Out Criteria, and 4-Level Pulmonary Embolism Clinical Probability Score (AUROC 0.50–0.58, specificity 0.00–0.05). The model is superior to these scores on feature sensitivity analyses (AUROC 0.66–0.84) and achieves comparable performance across sex (AUROC 0.81) and racial/ethnic (AUROC 0.77–0.84) subgroups. Conclusion Synergistic deep learning of ECG waveforms with traditional clinical variables can increase the specificity of PE detection in patients at least at moderate suspicion for PE.

Published in

Links

Tools

Development of a machine learning model using electrocardiogram signals to improve acute pulmonary embolism screening

Abstract