Improving explainability of deep neural network-based electrocardiogram interpretation using variational auto-encoders

van de Leur, Rutger R.; Bos, Max N.; Taha, Karim; Sammani, Arjan; Yeung, Ming Wai; van Duijvenboden, Stefan; Lambiase, Pier D.; Hassink, Rutger J.; van der Harst, Pim; Doevendans, Pieter A.; Gupta, Deepak K.; van Es, René

Published in

Oxford University Press, European Heart Journal – Digital Health, 3(3), p. 390-404, 2022

DOI: 10.1093/ehjdh/ztac038

Tools

Export citation

Search in Google Scholar

Improving explainability of deep neural network-based electrocardiogram interpretation using variational auto-encoders

Journal article published in 2022 by Rutger R. van de Leur, Max N. Bos, Karim Taha, Arjan Sammani

, Ming Wai Yeung

, Stefan van Duijvenboden, Pier D. Lambiase, Rutger J. Hassink, Pim van der Harst

, Pieter A. Doevendans, Deepak K. Gupta, René van Es

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract Aims Deep neural networks (DNNs) perform excellently in interpreting electrocardiograms (ECGs), both for conventional ECG interpretation and for novel applications such as detection of reduced ejection fraction (EF). Despite these promising developments, implementation is hampered by the lack of trustworthy techniques to explain the algorithms to clinicians. Especially, currently employed heatmap-based methods have shown to be inaccurate. Methods and results We present a novel pipeline consisting of a variational auto-encoder (VAE) to learn the underlying factors of variation of the median beat ECG morphology (the FactorECG), which are subsequently used in common and interpretable prediction models. As the ECG factors can be made explainable by generating and visualizing ECGs on both the model and individual level, the pipeline provides improved explainability over heatmap-based methods. By training on a database with 1.1 million ECGs, the VAE can compress the ECG into 21 generative ECG factors, most of which are associated with physiologically valid underlying processes. Performance of the explainable pipeline was similar to ‘black box’ DNNs in conventional ECG interpretation [area under the receiver operating curve (AUROC) 0.94 vs. 0.96], detection of reduced EF (AUROC 0.90 vs. 0.91), and prediction of 1-year mortality (AUROC 0.76 vs. 0.75). Contrary to the ‘black box’ DNNs, our pipeline provided explainability on which morphological ECG changes were important for prediction. Results were confirmed in a population-based external validation dataset. Conclusions Future studies on DNNs for ECGs should employ pipelines that are explainable to facilitate clinical implementation by gaining confidence in artificial intelligence and making it possible to identify biased models.

Published in

Links

Tools

Improving explainability of deep neural network-based electrocardiogram interpretation using variational auto-encoders

Abstract