Wiley, Alzheimer's & Dementia: The Journal of the Alzheimer's Association, 8(19), p. 3506-3518, 2023
DOI: 10.1002/alz.12967
Full text: Unavailable
AbstractIntroductionThis study aims to explore machine learning (ML) methods for early prediction of Alzheimer's disease (AD) and related dementias (ADRD) using the real‐world electronic health records (EHRs).MethodsA total of 23,835 ADRD and 1,038,643 control patients were identified from the OneFlorida+ Research Consortium. Two ML methods were used to develop the prediction models. Both knowledge‐driven and data‐driven approaches were explored. Four computable phenotyping algorithms were tested.ResultsThe gradient boosting tree (GBT) models trained with the data‐driven approach achieved the best area under the curve (AUC) scores of 0.939, 0.906, 0.884, and 0.854 for early prediction of ADRD 0, 1, 3, or 5 years before diagnosis, respectively. A number of important clinical and sociodemographic factors were identified.DiscussionWe tested various settings and showed the predictive ability of using ML approaches for early prediction of ADRD with EHRs. The models can help identify high‐risk individuals for early informed preventive or prognostic clinical decisions.