Prediction of short-term atrial fibrillation risk using primary care electronic health records

Nadarajah, Ramesh; Wu, Jianhua; Hogg, David; Raveendra, Keerthenan; Nakao, Yoko M.; Nakao, Kazuhiro; Arbel, Ronen; Haim, Moti; Zahger, Doron; Parry, John; Bates, Chris; Cowan, Campbel; Gale, Chris P.

Published in

BMJ Publishing Group, Heart, 14(109), p. 1072-1079, 2023

DOI: 10.1136/heartjnl-2022-322076

Tools

Export citation

Search in Google Scholar

Prediction of short-term atrial fibrillation risk using primary care electronic health records

Journal article published in 2023 by Ramesh Nadarajah

, Jianhua Wu

, David Hogg, Keerthenan Raveendra, Yoko M. Nakao

, Kazuhiro Nakao, Ronen Arbel

, Moti Haim, Doron Zahger, John Parry, Chris Bates

, Campbel Cowan, Chris P. Gale

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

ObjectiveAtrial fibrillation (AF) screening by age achieves a low yield and misses younger individuals. We aimed to develop an algorithm in nationwide routinely collected primary care data to predict the risk of incident AF within 6 months (Future Innovations in Novel Detection of Atrial Fibrillation (FIND-AF)).MethodsWe used primary care electronic health record data from individuals aged ≥30 years without known AF in the UK Clinical Practice Research Datalink-GOLD dataset between 2 January 1998 and 30 November 2018, randomly divided into training (80%) and testing (20%) datasets. We trained a random forest classifier using age, sex, ethnicity and comorbidities. Prediction performance was evaluated in the testing dataset with internal bootstrap validation with 200 samples, and compared against the CHA₂DS₂-VASc (Congestive heart failure, Hypertension, Age >75 (2 points), Stroke/transient ischaemic attack/thromboembolism (2 points), Vascular disease, Age 65–74, Sex category) and C₂HEST (Coronary artery disease/Chronic obstructive pulmonary disease (1 point each), Hypertension, Elderly (age ≥75, 2 points), Systolic heart failure, Thyroid disease (hyperthyroidism)) scores. Cox proportional hazard models with competing risk of death were fit for incident longer-term AF between higher and lower FIND-AF-predicted risk.ResultsOf 2 081 139 individuals in the cohort, 7386 developed AF within 6 months. FIND-AF could be applied to all records. In the testing dataset (n=416 228), discrimination performance was strongest for FIND-AF (area under the receiver operating characteristic curve 0.824, 95% CI 0.814 to 0.834) compared with CHA₂DS₂-VASc (0.784, 0.773 to 0.794) and C₂HEST (0.757, 0.744 to 0.770), and robust by sex and ethnic group. The higher predicted risk cohort, compared with lower predicted risk, had a 20-fold higher 6-month incidence rate for AF and higher long-term hazard for AF (HR 8.75, 95% CI 8.44 to 9.06).ConclusionsFIND-AF, a machine learning algorithm applicable at scale in routinely collected primary care data, identifies people at higher risk of short-term AF.

Published in

Links

Tools

Prediction of short-term atrial fibrillation risk using primary care electronic health records

Abstract